General Statistical Facts Flashcards

(24 cards)

1
Q

Assumptions of Independent ANOVAs

A
  • Independence of data points
  • Homogeneity/equality of variance = Levene’s (pass if p>0.05 - if fail = Welch’s correction
  • Normality of residuals = Shapiro-Wilk’s
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Assumptions of a Repeated Measures ANOVA

A
  • Independence of data points = Durbin-Watson
  • Spherical – homogeneity/equality of difference between groups = Mauchy’s (pass if p>0.05) - if fail – Greenhouse Geisser correction
  • Normality of residuals = Shapiro-Wilk’s
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Assumptions of Multiple Linear Regression

A
  • Linear relationship
  • Independence of data points
  • No autocorrelation occurring = Durbin-Watson
  • No multicolineraity occurring = VIF < 10
  • Homoscedastic relationship = look at scatterplot of residuals
  • Normal distribution of residuals = Shapiro-Wilks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Assumptions of Simple Linear Regression

A
  • Linear relationship
  • Independence of data points
  • No autocorrelation occurring = Durbin-Watson
  • Homoscedastic relationship = look at scatterplot of residuals
  • Normal distribution of residuals = Shapiro-Wilks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Standardised Residuals

A

A measure of the strength between the observed and expected values
If more than 5% are >2, may indicate a poor model
Should not get >3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cook’s Distance

A

A measure of the overall influence of a date point on the overall model
>1 = cause for concern

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Leverage Value

A

The influence of the data point on the predicted values (slope) - how far the data point is from the centre of gravity
0 (no influence) - 1 (complete influence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Standardised DFFit

A

How the fit of the model would change if the data point were to be removed
>1 = substantial influence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standardised DFBeta

A

How the slope of the model (expected data points) would change if the data point were to be removed
>1 = substantial influence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Simple linear regression has the same output as…

A

Unpaired t-test

SLR - one of the variables is categorical using dummy variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Multiple linear regression has the same output as…

A

One-Way ANOVA - when MLR uses dummy variables!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Helmert

A

Orthogonal

Each category except the last is compared to the mean of the subsequent ones

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Difference

A

Orthogonal

Each category except the first is compared to the mean of the previous categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Deviation

A

Non-Orthogonal - need to perform post-hoc tests
Each category except the first (or last) is compared to the overall experimental effect
ie. 2 vs. 1, 2, 3, 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Simple

A

Non-Orthogonal - need to perform post-hoc tests
Each category is compared to the first (or last) category
Similar to Dunnet’s post-hoc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Repeated

A

Non-Orthogonal - need to perform post-hoc tests

Each category is compared to the previous category

17
Q

Polynomial

A

Tests for trends in the data

Only makes sense to perform on ANOVAs where there is a logical order to the group

18
Q

Tukey

A

Equal sample sizes - good trade off between type 1 + 2 errors

19
Q

Bonferoni

A

Conservative - lacks power
Conserve type 1 error rate at the expense of the type 2 error rate
Use for repeated measures simple effect analysis

20
Q

Gabriels

A

Slightly different sample sizes

21
Q

Hochberg’s GT2

A

Very different sample sizes

22
Q

Games-Howell

A

Any doubt about the equality of variance assumption

23
Q

Standardised effect sizes

A

Small
R=.1
d= .2

Medium
R=.3
d=.5

Large
R=.5
D=.8

24
Q

Simpson’s Paradox

A

A trend appears in several different groups of data but disappears/reverses when there groups are combined

There is an apparent paradox between treatment outcomes and overall outcome due to hidden confounding variables