Lecture 8 - Statistical Tests IV: ANOVA, Kruskal-Wallis, ANCOVA Flashcards

1
Q

the three statistical tests used for comparing two groups against a continuous variable:

A

students t-test: both groups are parametric with equal variances

welch’s t-test: both groups parametric but with unequal variance

Mann-Whitney U Test / Wilcoxon Test: non-parametric data (no assumptions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does ANOVA stand for?

A

ANOVA means “analysis of variance”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what sort of variables are present in ANOVA testing?

A

response variables are continuous and explanatory variables are categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

ANOVA and linear regression are identical except for:

A

the type of explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does ANOVA actually do?

A

ANOVA compares means of factor levels (treatment levels), by analysing variances associated with them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

ANOVA procedure:

A

(1) choose a model

(2) estimate the parameters of the model

(3) model fit: how well does the model describe out data?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

command for creating an ANOVA model in R:

A

> m1<-aov(y.v~x.v)

m1 = model name (can be anything)

> summary.aov(m1)
summary.lm(m1)
tapply(y.v, x.v, mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

ANOVA assumptions:

A
  • residuals are normally distributed
  • variance is associated with the distribution of the residuals is constant (i.e. variation in Y does not differ among the factor levels)
  • individual measurements are independent
  • data comes from a random sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

diagnostic ANOVA plot in R:

A

> plot(m1)

  • we want “sky at night” and a normal-q-q line with data stuck to the line
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

after completing your ANOVA diagnostic plots [plot(m1)] you have results that show non-parametric data, what do you do?

A

if you dont recieve “sky at night” or residuals following the line tightly you must use the non-parametric equivalent of the ANOVA test - the Kruskal-Wallis test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

kruskal-wallis test command:

A

kruskal.test(y-variable ~ x-variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the difference between the ANOVA and Kruskal-Wallis Test?

A

the difference is that ANOVA testing has assumptions but Kruskal-Wallis has no assumptions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

similarities in ANOVA and linear regression testing:

A
  • both are parametric tests
  • response variable continuous and explanatory variable categorical
  • compares means of factor levels (treatment levels) by analysing variances associated with them
  • assumptions: residuals are parametric, variance homogeneity; individual measurements are independent; data from random sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is ANCOVA?

A

ancova is a analysis of co-variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does ANCOVA combine?

A

ANCOVA combines elements of linear regression and ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

in ANCOVA, what are the response and explanatory variables:

A

response variable: continuous
explanatory varibale(S): categorical and continuous

17
Q

what is the ANCOVA process?

A
  • number of factor levels (or categories) = number of linear regressions
  • estimation of the slope and intercept for each factor level (category)
  • model simplification (principle of parsimony)
18
Q

models should be simplified until:

A

they are minimal adequate

“model should be as simple as possible, but no simpler”

19
Q

what is the only difference between ANOVA and linear regression?

A

linear regression is the relationship between two continuous variables whilst ANOVA is a test for the relationship between a continuous response variable and an explanatory categorical value