Slides 13, 14 ΰ³ Flashcards
What is the research question regarding prices on Amazon and the UCLA bookstore?
Are the prices on Amazon different than the prices at the UCLA bookstore?
How many UCLA courses were sampled for the price comparison?
201 UCLA courses.
How many paired data points were found for the price comparison?
68 paired data points.
What is the formula for calculating the price difference in the dataset?
UCLA Bookstore price - Amazon price
True or False: Consistency matters when analyzing paired data.
True.
What is the importance of subtracting using a consistent order in paired data analysis?
Your results wonβt make sense if the order of subtraction is inconsistent.
What does the term βdifference of two unpaired group meansβ refer to?
It refers to comparing the average of two groups without creating a new column of data.
What must be checked separately when working with unpaired data?
The Central Limit Theorem (CLT) conditions.
What is the research question regarding newborns from smoking and non-smoking mothers?
Is there convincing evidence that newborns from mothers who smoke have a different average birth weight than newborns from mothers who donβt smoke?
How many cases are included in the smoking group and the non-smoking group in the dataset?
Smoking group: 50 cases; Non-smoking group: 100 cases.
What are the two conditions that must be checked for the CLT when using unpaired data?
- Independence
- Normality
What does the T statistic represent when the population standard deviations are unknown?
It is used for inference on the difference of two means.
What is represented by ΞΌn and ΞΌs in the context of birth weights?
- ΞΌn = population mean of nonsmoking mothers
- ΞΌs = population mean of smoking mothers
What is the significance level used in the hypothesis test for birthweights?
Ξ± = 0.05
What should be used to calculate degrees of freedom when working with more than one sample?
The smaller sample size.
What is the purpose of calculating the standard error in the analysis?
To estimate the variability of the point estimate of the population difference.
What statistical method is used to model the difference of sample means?
t-distribution.
What is required to conclude the hypothesis test based on the t-score?
Determine whether to use pt() or 1 - pt() based on the t-score.
Fill in the blank: The weight variable represents the weights of the _______.
newborns.
What type of data is represented in the ncbirths dataset?
A random sample of mothers and their newborns in North Carolina.
What is the first condition for the Central Limit Theorem in unpaired data?
The observations are independent, both within and between samples.
What is a common method to check for outliers in data?
Visual inspection of the data distribution.
What is needed to find the area in the tails (the p-value) using R?
The degrees of freedom (df)
If working with more than one sample, use the smaller sample size to calculate df.
What does a p-value indicate about the strength of evidence?
The evidence may be too weak to detect a real difference
P-value is linked to sample sizes.