Exam 3 Flashcards
What is important to remember about the sampling distribution of means?
A population with a normal distribution has a distribution of sample means that is normal
What statistic can you use to test a distribution of sample means? (first one)
- Z standardization, however you rarely know the population standard deviation so substitute and use a students t-distribution
Describe the student’s t distribution.
- similar to standard normal distribution (z) but with fatter tails
- as the sample size increases, the t distribution becomes more like the standard normal distribution
What can the t-distribution be used for?
- It can be used to accurately calculate a confidence interval for the mean of a population with a normal distribution
- (population mean) - (Tcritical value x SE(standard error of mean) )< (actual mean) < (population mean) + (Tcritical value x SE(standard error of mean) )
What is the standard error of the mean?
SE (y) = s / sqrt(n)
What is a one sample t-test?
compares the mean of a random sample from a normal population with the population mean proposed in a null hypothesis
What are the hypotheses and test statistic in a one sample t test?
H0 - true mean equals u0
Ha - true mean does not equal u0
How do you interpret the t-statistic in a one sample t-test?
- compute the p value: probability of this t-statistic or more extreme given the null hypothesis is true
- if p value is >.05 then you fail to reject the null hypothesis
How does increasing sample size affect a one sample t test?
- increasing sample size reduces the standard error of the mean
- increase the probability of rejecting a false null hypothesis (power)
What are the assumptions of a one-sample t-test?
- data are a random sample from the population
- variable is normally distributed in the population (robust to departures)
What are the confidence intervals for variance and standard deviation? And what are the assumptions of these statistics?
Assumptions: random sample from the population, variable must have a normal distribution (formulas are NOT robust to departures from normality)
What are two different study designs when comparing two means?
two-sample and paired designs
What is a two sample design?
- two groups
- each group is composed of independent sample of units
What is a paired designs?
- two groups
- each sampled unit receives both treatments
- paired designs are usually more powerful because of control for variation among sampling units
How are paired designs treated?
- paired measurements are converted to a single measurement by taking the difference between them
What is a paired t-test?
- used to test the null hypothesis that the mean difference of paired measurements equals a specific value
- null is often that the difference (change) is zero before and after treatment
How does a paired t-test compare to a one sample t-test?
- The same process except the calculation of the test statistic occurs on the difference value (d)
What does the p-value indicate in a paired t-test statistic?
- P-value >0.05
- Fail to reject the null hypothesis that the mean change is zero
- P-value <0.05
- reject the null hypothesis that the mean change is zero
What are the assumptions of a paired t-test?
- sampling units are randomly sampled from the population
- paired differences have a normal distribution in the population
What test is a formal test of normality?
the shapiro-wilk test
What are the hypotheses of the shapiro-wilk test? Why should it be used with caution?
- H0 = sample has normal distribution
- Ha = sample does not have normal distribution
- Should be used with caution:
- small sample sizes lack power to reject a false null (Type 2 error)
- large sample sizes can reject null when the departure from normality is minimal and would not affect methods that assume normality
Under the null hypothesis, the sampling distribution of the one-sample t statistic follows a _________
t distribution with n-1 DOF
Describe the t distribution.
-The area under the curve to the left (lower tail) of -t is the same as the area to the right (upper tail) of t
- t distribution is symmetrical around the mode of zero
What does a Shapiro Wilk test evaluate?
- evaluates the goodness of fit of a normal distribution to a set of data randomly sampled from a population