week 4: Multiple regressions Flashcards

(19 cards)

1
Q

what is a regression?

A

expands on correlation, examining whether we can estimate the value of an outcome variable (Y) on the basis of our predictor (X)
- how does Y change in relation to X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is forced entry regression?

A
  • predictors based on previous research and theory
  • do not state a particular order for the variables to be entered
  • all variables are forced into the model at the same time
  • the ‘enter method’
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is hierarchical regression?

A
  • predictors based on previous research
  • researcher decides the order in which the predictors are entered into the model
  • enter known predictors from prior research first, and then the new predictors
  • new predictors can be entered: all at once, in a hierarchical manner, in a stepwise manner
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is stepwise regression?

A
  • computer programme selects the predictor that best predicts the outcome and enters that into the model first
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is unstandardised beta?

A

change in Y for 1 unit change in X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standardised beta?

A

change in Y for 1 standard deviation change in X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the R squared value measure?

A

how much variance is accounted for by the model (effect size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the assumptions of a multiple regression?

A
  • sample size
  • all predictor variables should be quantitative (continuous, categorical or ordinal), the outcome variable must be continuous
  • non-zero variance: predictor variables should show variance
  • independence: all values of the outcome variable should be independent
  • linearity: assume that the predictor and outcome variables have a linear relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many participants are needed for every 1 predictor variable?

A

10 participants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

if you have 2 predictor variables, how many participants are needed for a small, medium and large effect size?

A
  • small: 478
  • medium: 67
  • large: 31
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

if you have 3 predictor variables, how many participants are needed for a small, medium and large effect size?

A
  • small: 543
  • medium: 76
  • large: 36
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

if you have 4 predictor variables, how many participants are needed for a small, medium and large effect size?

A
  • small: 597
  • medium: 84
  • large: 39
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is multicollinearity

A

strong correlation between predictor variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What 2 statistics identify multicollinearity?

A
  • VIF (Variance inflation factor): if the average VIF is much greater than 1, then regression may be biased. if largest VIF is greater than 10, there is definitely a problem
  • Tolerance: if tolerance is below 0.1 there is a serious problem, and a potential problem if below 0.2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is homoscedasticity?

A

at each level of the predictor variable, the variance of the residuals should be constant
- if the variance of the residuals are different, we have heteroscedasticity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what are residuals?

A

distances between the line of best fit and the individual data points

17
Q

What are independent errors?

A

for any two observations (data points) the residual points should not correlate, they should be independent

18
Q

how do you test for independent errors?

A

use a Durbin-Watson test
- tests whether residuals next to each other are correlated
- test statistic varies between 0-4
- a value of 2 means the residuals are uncorrelated
- a value greater than 2 indicates a positive correlation, and less than 2 indicates negative correlation
- Values greater than 3 and less than 1 indicate an issue

19
Q

How do you interpret a multiple regression?

A

model fit:
- R squared value
- ANOVA results (F and p-values)
to examine relationships:
- beta values
- intercept

Multiply the R squared value by 100 to get the percentage proportion of variance accounted for by the model.

F(3,31) = 81.07, p < .001

f = 81.07
3 = degrees of freedom
31 = residual