One-Way ANOVA Flashcards

Question 1

Q

What method tests significant difference between two independent samples?

Question 2

Q

What is the formula for determining how many t-tests are required to compare n samples? How many samples are required for 10 samples?

Answer

A

Number of t-tests with n samples = n! / (2!(n-2)!)
10 samples:
- 10! / 2! * 8!
- 10 * 9 / 2
- 45

Question 3

Q

For three or more samples, how do you find the distance/variability between means?

Answer

A

Find the average squared deviation of each sample mean from the total mean.
This “total mean” is known as the Grand Mean
xbar_G

Question 4

Q

When it comes to the Grand Mean, if the sample sizes are the same for each sample group, how can the Grand Mean be determined?

Answer

A

If each sample size is the same, then the “mean of means” can be used.
For example:
- Samples X, Y, and Z
- Each sample is the same size (n)
- The mean can then be determined by:
  - Adding each average and dividing by the total number of samples (3)
  - (xbar + ybar + zbar) / 3
When does this approach not work?
- When the sizes of the sample are not the same.
- Then you have to add the average of all samples and divide by the total number of values (sample size of each sample group, added together)

Question 5

Q

What conclusions can we draw from the deviation of each sample mean from the mean of means?
- What is this known as?
What conclusions can we draw from the variability of each sample mean from the mean of means?
- What is this known as?

Answer

A

Between-group variability
- The smaller the distance between sample means, the less likely population means will differ significantly
- The greater the distance between sample means, the more likely the population means will differ significantly.
Within-group variability
- The greater the variability of each individual sample, the less likely population means will differ significantly
- The smaller the variability of each individual sample, the more likely population means will differ significantly.

Question 6

Q

ANOVA

Answer

A

Analysis of Variance
a collection of statisitcal models and their associated procedures (such as “variation” among and between groups) used to analyze the differences among group means.
In its simplest form, ANOVA provids a statistical test of whether or not the means of several groups are equal, and therefore generalizes the t-test to more than two groups.
ANOVA is useful for comparing (testing) three or more means (groups or variables) for statistical significance.
Hypothesis testing:
- H_o: M₁ = M₂ = M₃
- H_A: At least one pair of samples is significantly different

Question 7

Q

What does it mean if you get a large statistic during an ANOVA test?

Answer

A

Two means are causing between subject variability
You will reject the null hypothesis (accept the alternative hypothesis)
You will need to do an additional step to find out which means are different from each other.
- These additional tests are called multiple comparison tests

Question 8

Q

During an ANOVA test, if the variance of a “within-group” individual sample becomes bigger (all else held constant) what does this mean?

Answer

A

The between sample means are not significantly different
We accept the null hypothesis
Our test statistic will be smaller because there is a larger within group variability

Question 9

Q

During an ANOVA test, if the between group variability increases (the sample means get further apart from each other), what does this mean?

Answer

A

At least one pair of samples is significantly different
We accept the alternative hypothesis (reject the null hypothesis)

Question 10

Q

What is the statistic for the ANOVA test?

Answer

A

F statistic
F = between-group variability / within-group variability
Reasoning:
- Increases in “between-group” variability means accepting the alternative hypothesis
  - Having “between-group” in the numerator will make a large F statistic when there are increases in “between-group” variability
- Increases in “within-group” variability means accepting the null hypothesis
  - Having “within-group” variability in the denominator means a small F statistic when there are increases in “within-group” variability

Question 11

Q

What is the formula for ANOVA?

Answer

A

F = between-group variability / within-group variability
F = ( SS_between / df_between ) / ( SS_within / df_within )
F = MS_between / MS_within

Question 12

Q

The F-statistic is _________ negative.

Question 13

Q

SS_total

Answer

A

SS_total = SS_between + SS_within
SS_total = sum(x_i - xbar_G)²

Question 14

Q

df_total

Answer

A

df_total = df_between + df_within = N -1

Question 15

Q

What does the F-distribution look like?

Answer

A

Righ (positive) skewed
Peaks at ‘1’
- This is due to “no change” in the numerator and “no change” in the denominator being 1 to signify there was not change due to the treatment
One critical region in the one tail
- Critical value and alpha just like t-tests

Question 16

Q

Clothing Example

Given the following datasets, calculate the individual means and the grand mean
- Snapzi: 15, 12, 14, 11
- Irisa: 39, 45, 48, 60
- LolaMoon: 65, 45, 32, 38

Answer

A

Snapzi
- xbar_s = 13
Irisa
- xbar_I = 48
LolaMoon
- xbar_L = 45
Grand Mean
- xbar_G = 35.33

Question 17

Q

Clothing Example

Given the following mean values, calculate the SS_between
- xbar_s = 13
- xbar_I = 48
- xbar_L = 45
- xbar_G = 35.33

Answer

A

SS_between = n*sum(xbar_k - xbar_G)²
SS_between = 4( (13-35.33)²+ (48-35.33)²+ (45-35.33)²)
SS_between = 4(498.63 + 160.528 + 93.508)
SS_between = 3010.67

Question 18

Q

Clothing Example

Given the following mean and sample values, calculate the SS_within
- xbar_s = 13
  - Snapzi: 15, 12, 14, 11
- xbar_I = 48
  - Irisa: 39, 45, 48, 60
- xbar_L = 45
  - LolaMoon: 65, 45, 32, 38
- xbar_G = 35.33

Answer

A

Snapzi
- 15
  - x_i - xbar_s = 15 - 13 = 2
  - (x_i - xbar_s)² = 4
- 12
  - x_i - xbar_s = 12 - 13 = -1
  - (x_i - xbar_s)² = 1
- 14
  - x_i - xbar_s = 14 - 13 = 1
  - (x_i - xbar_s)² = 1
- 11
  - x_i - xbar_s = 11 - 13 = 2
  - (x_i - xbar_s)² = 4
- Sum(x_i - xbar_s)² = 10
Repeat for the other two:
- Sum(x_i - xbar_I)² = 234
- Sum(x_i - xbar_L)² = 618
Then calculate SS_within:
- Sum(x_i - xbar_K)² = 10 + 234 + 618 = 862

Question 19

Q

Clothing Example

Given the attached image information, calculate the following
- df_between
- df_within

Answer

A

df_between
- n - 1
- 3 - 1 = 2
df_within
- N - K
- 12 - 3 = 9

Question 20

Q

Clothing Example

Given the following data (calculated in the prior examples) calculate the:
- MS_between
- MS_within
- F-statistic
SS_between= 3010.67
SS_within = 862
df_between= 2
df_within = 9

Answer

A

MS_between = SS_between / df_between
- 3010.67 / 2 = 1505.34
MS_within= SS_within / df_within
- 862 / 9 = 95.7
F-statistic = MS_between/ MS_within
- 1505.34 / 95.7 = 15.72

Question 21

Q

Clothing Example

Given the following information, calculate the F-critical value and decide whether to accept or reject the null hypothesis:
- df_between = 2
- df_within = 9
- F-statistic = 15.72

Answer

A

f-table
- Find the df_between (numerator) along the x-axis = 2
- Find the df_within (denominator) along the y-axis = 9
- F-critical = 4.2565
Accept or reject null hypothesis:
- Since the F-statistic (15.72) is greater than the F-critical value (4.2565), we reject the null hypothesis in favor of the alternative hypothesis

Question 22

Q

As the variability within treatment groups increases, the likelihood of rejecting the null hypothesis _____________ ?

Answer

A

decreases

Question 23

Q

In ANOVA, the differences between treatment groups (between-group variances) contributes to ____________ ?

Answer

A

the numerator of the F-ratio

Question 24

Q

What is NOT a potential source of variation within a treatment group?

Answer

A

Treatment effects
Treatment effects would only increase variability between groups, not within a treatment group.

Question 25

Q

As the variability between treatment groups gets larger and larger (assuming the within-group variability reamins relatively constant), the likelihood of rejecting the null hypothesis ____________ ?

Answer

A

Increases

Question 26

Q

If the null hypothesis is TRUE, then on average the F-ratio for ANOVA is expected to have a value near __________ ?

Question 27

Q

Which combinations of factors is most likely to produce a large F-value?

Answer

A

Large mean differences, small within-group variability

Question 28

Q

What is a Multiple Comparison Test?

What is an example of a Multiple Comparison Test?

Answer

A

When an ANVOA test results in the rejection of the null hypothesis, a Multiple Comparison Test helps determine which group means are different
Tukey’s Honestly Significant Difference

Question 29

Q

Tukey’s Honestly Significant Difference

Answer

A

Compares the differences between any two groups means
Allows us to make pairwise comparisons
THSD = q * sqrt( MS_within / n)

Brainscape's Knowledge GenomeTM

One-Way ANOVA Flashcards

Brainscape's Knowledge Genome^TM