General Statistical Facts Flashcards
(24 cards)
Assumptions of Independent ANOVAs
- Independence of data points
- Homogeneity/equality of variance = Levene’s (pass if p>0.05 - if fail = Welch’s correction
- Normality of residuals = Shapiro-Wilk’s
Assumptions of a Repeated Measures ANOVA
- Independence of data points = Durbin-Watson
- Spherical – homogeneity/equality of difference between groups = Mauchy’s (pass if p>0.05) - if fail – Greenhouse Geisser correction
- Normality of residuals = Shapiro-Wilk’s
Assumptions of Multiple Linear Regression
- Linear relationship
- Independence of data points
- No autocorrelation occurring = Durbin-Watson
- No multicolineraity occurring = VIF < 10
- Homoscedastic relationship = look at scatterplot of residuals
- Normal distribution of residuals = Shapiro-Wilks
Assumptions of Simple Linear Regression
- Linear relationship
- Independence of data points
- No autocorrelation occurring = Durbin-Watson
- Homoscedastic relationship = look at scatterplot of residuals
- Normal distribution of residuals = Shapiro-Wilks
Standardised Residuals
A measure of the strength between the observed and expected values
If more than 5% are >2, may indicate a poor model
Should not get >3
Cook’s Distance
A measure of the overall influence of a date point on the overall model
>1 = cause for concern
Leverage Value
The influence of the data point on the predicted values (slope) - how far the data point is from the centre of gravity
0 (no influence) - 1 (complete influence)
Standardised DFFit
How the fit of the model would change if the data point were to be removed
>1 = substantial influence
Standardised DFBeta
How the slope of the model (expected data points) would change if the data point were to be removed
>1 = substantial influence
Simple linear regression has the same output as…
Unpaired t-test
SLR - one of the variables is categorical using dummy variables
Multiple linear regression has the same output as…
One-Way ANOVA - when MLR uses dummy variables!
Helmert
Orthogonal
Each category except the last is compared to the mean of the subsequent ones
Difference
Orthogonal
Each category except the first is compared to the mean of the previous categories
Deviation
Non-Orthogonal - need to perform post-hoc tests
Each category except the first (or last) is compared to the overall experimental effect
ie. 2 vs. 1, 2, 3, 4
Simple
Non-Orthogonal - need to perform post-hoc tests
Each category is compared to the first (or last) category
Similar to Dunnet’s post-hoc
Repeated
Non-Orthogonal - need to perform post-hoc tests
Each category is compared to the previous category
Polynomial
Tests for trends in the data
Only makes sense to perform on ANOVAs where there is a logical order to the group
Tukey
Equal sample sizes - good trade off between type 1 + 2 errors
Bonferoni
Conservative - lacks power
Conserve type 1 error rate at the expense of the type 2 error rate
Use for repeated measures simple effect analysis
Gabriels
Slightly different sample sizes
Hochberg’s GT2
Very different sample sizes
Games-Howell
Any doubt about the equality of variance assumption
Standardised effect sizes
Small
R=.1
d= .2
Medium
R=.3
d=.5
Large
R=.5
D=.8
Simpson’s Paradox
A trend appears in several different groups of data but disappears/reverses when there groups are combined
There is an apparent paradox between treatment outcomes and overall outcome due to hidden confounding variables