One-Way ANOVA Flashcards
What method tests significant difference between two independent samples?
t-test
What is the formula for determining how many t-tests are required to compare n samples? How many samples are required for 10 samples?
- Number of t-tests with n samples = n! / (2!(n-2)!)
- 10 samples:
- 10! / 2! * 8!
- 10 * 9 / 2
- 45
For three or more samples, how do you find the distance/variability between means?
- Find the average squared deviation of each sample mean from the total mean.
- This “total mean” is known as the Grand Mean
- xbarG
When it comes to the Grand Mean, if the sample sizes are the same for each sample group, how can the Grand Mean be determined?
- If each sample size is the same, then the “mean of means” can be used.
- For example:
- Samples X, Y, and Z
- Each sample is the same size (n)
- The mean can then be determined by:
- Adding each average and dividing by the total number of samples (3)
- (xbar + ybar + zbar) / 3
- When does this approach not work?
- When the sizes of the sample are not the same.
- Then you have to add the average of all samples and divide by the total number of values (sample size of each sample group, added together)
- What conclusions can we draw from the deviation of each sample mean from the mean of means?
- What is this known as?
- What conclusions can we draw from the variability of each sample mean from the mean of means?
- What is this known as?
- Between-group variability
- The smaller the distance between sample means, the less likely population means will differ significantly
- The greater the distance between sample means, the more likely the population means will differ significantly.
- Within-group variability
- The greater the variability of each individual sample, the less likely population means will differ significantly
- The smaller the variability of each individual sample, the more likely population means will differ significantly.
ANOVA
- Analysis of Variance
- a collection of statisitcal models and their associated procedures (such as “variation” among and between groups) used to analyze the differences among group means.
- In its simplest form, ANOVA provids a statistical test of whether or not the means of several groups are equal, and therefore generalizes the t-test to more than two groups.
- ANOVA is useful for comparing (testing) three or more means (groups or variables) for statistical significance.
- Hypothesis testing:
- Ho: M1 = M2 = M3
- HA: At least one pair of samples is significantly different
What does it mean if you get a large statistic during an ANOVA test?
- Two means are causing between subject variability
- You will reject the null hypothesis (accept the alternative hypothesis)
- You will need to do an additional step to find out which means are different from each other.
- These additional tests are called multiple comparison tests
During an ANOVA test, if the variance of a “within-group” individual sample becomes bigger (all else held constant) what does this mean?
- The between sample means are not significantly different
- We accept the null hypothesis
- Our test statistic will be smaller because there is a larger within group variability
During an ANOVA test, if the between group variability increases (the sample means get further apart from each other), what does this mean?
- At least one pair of samples is significantly different
- We accept the alternative hypothesis (reject the null hypothesis)
What is the statistic for the ANOVA test?
- F statistic
- F = between-group variability / within-group variability
- Reasoning:
- Increases in “between-group” variability means accepting the alternative hypothesis
- Having “between-group” in the numerator will make a large F statistic when there are increases in “between-group” variability
- Increases in “within-group” variability means accepting the null hypothesis
- Having “within-group” variability in the denominator means a small F statistic when there are increases in “within-group” variability
- Increases in “between-group” variability means accepting the alternative hypothesis
What is the formula for ANOVA?
- F = between-group variability / within-group variability
- F = ( SSbetween / dfbetween ) / ( SSwithin / dfwithin )
- F = MSbetween / MSwithin
The F-statistic is _________ negative.
- never
SStotal
- SStotal = SSbetween + SSwithin
- SStotal = sum(xi - xbarG)2
dftotal
- dftotal = dfbetween + dfwithin = N -1
What does the F-distribution look like?
- Righ (positive) skewed
- Peaks at ‘1’
- This is due to “no change” in the numerator and “no change” in the denominator being 1 to signify there was not change due to the treatment
- One critical region in the one tail
- Critical value and alpha just like t-tests