Beast of Bias Flashcards

1
Q

what are the four assumptions of the linear model?

A

linearity and additivity, sphericity, homoscedasticity, normality of residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

which assumptions are the most and least important?

A

most important: linearity

least important: normality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is linearity and additivity?

A

the relationship between the outcome and the predictor should be linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is sphericity?

A

error should be independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is homoscedasticity?

A

equal variances across predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is normality of residuals?

A

the residuals should be normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how do you test for linearity?

A

plot of observed vs predicted should be linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do you test for sphericity?

A

plot of ZRESID vs ZPRED / any number >3 or

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how do you test for homoscedasticity?

A

graph should not be funnel shaped or curved / the Robust F Statistic, welch and Brown Forsythe should share the same conclusion / variance ratio, okay if number is <2 / don’t use Levenes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how do you test for normality?

A

PP and QQ plots, case wise diagnostic, cook’s distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what do you do for a case wise diagnostic?

A

standardised residuals which exceed 2 or 2.5 divided by the total number of cases, multiplied by 100
95% should lie between 2
99% should lie between 2.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what do you do for cook’s distance?

A

any cases >1 are outliers / influencing the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is bootstrapping?

A

constructs a CI based on the data by sampling the amount of scores in the data set with replacement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are outliers?

A

atypical cases at odds with the data / impact the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly