Week 4 (Multiple Regression) Flashcards by Unknown Unknown

What is a regression?

Extends upon a correlation (relationship between two variables)

How well did you know this?

Not at all

Perfectly

Multiple Regression

-Extension of simple linear regression
-Explores impact of multiple predictor variables
-Tests relationship in parallel

How well did you know this?

Not at all

Perfectly

Regression VS ANOVA

Regression
-Focuses on relationships between predictor variables and one outcome variable

Factorial ANOVA
-Focuses on differences in scores on the dependent variable, according to two or more independent variables.

How well did you know this?

Not at all

Perfectly

Regression VS ANOVA (Requirements)

Regression
-Predictor variables can be continuous, ordinal or binary data, outcomes must be continuous.
-One hypothesis per predictor

Factorial ANOVA
-IVs must be categorical data, can have 2+ conditions, dependent variable must be continuous.
-One hypothesis per IV (=2) and one hypothesis for the interaction (so 3 in total)

How well did you know this?

Not at all

Perfectly

Types of multiple regression

-Forced Entry
-Hierarchical Multiple Regression
-Stepwise Multiple Regression

How well did you know this?

Not at all

Perfectly

Forced Entry Multiple Regression

-Predictors based on previous research and theory
-Do not state a particular order for the variables to be entered
-All variables are forced into the model at the same time
-Known as Enter method in SPSS

How well did you know this?

Not at all

Perfectly

Hierarchical Regression

-Predictors based on previous research
-Researcher designs the order in which predictors entered into model
-Enter known predictors first and then enter new predictors

How well did you know this?

Not at all

Perfectly

Stepwise regression

-Based on maths rather than previous research/theory
-Both forward and backward methods
-Computer programme selects the predictor that best predicts the outcome and enters that into the model first

How well did you know this?

Not at all

Perfectly

Assumptions of multiple regression

Sample Size
Variable Types
Non-zero variance
Independence
Linearity
(Lack of) Multicollinearity
Homoscedasticity
Independence Erros
Normally Distributed Errors

How well did you know this?

Not at all

Perfectly

Variable types (Regression Assumption)

-All predictor variables should be quantitative
*Can be continuous, categorical, or ordinal
-Outcome variable must be quantitative and continuous

How well did you know this?

Not at all

Perfectly

Non-zero variance (Regression Assumption)

-Predictor variables should have a variance
-In other words, should not have a variance of zero

How well did you know this?

Not at all

Perfectly

Independence (Regression Assumption)

-All values of the outcome variable should be independent
-Each value of the outcome variable should be a separate entity

How well did you know this?

Not at all

Perfectly

Linearity (Regression Assumption)

-Assume that the relationship between the predictor and outcome variable will be linear
-If analysis is run on a non-linear relationship, the model can be unreliable

How well did you know this?

Not at all

Perfectly

Sample Size of Regression (Regression Assumption)

More is better
-Field (2010) suggests you use the following equations to identify an appropriate size
50+8k where k = number of predictor variables
104+k

Or can use power analysis

How well did you know this?

Not at all

Perfectly

Multicollinearity (Regression Assumption)

-Strong correlation between predictor variables
*Perfect collinearity when you have a correlation of 1 between predictors
-Becomes difficult to interpret results
-Untrustworthy beta values
-Can’t identify individual importance of each predictor
-Limits size of r squared
-Threatens the validity of the model produced

How well did you know this?

Not at all

Perfectly

Identifying multicollinearity (Regression Assumption)

Study These Flashcards

-VIF (Variance Inflation Factor)
*If the average VIF is substantially greater than 1then regression might be biased
*If largest VIF is greater than 10 there is definitely a problem

-Tolerance
*If tolerance is below 0.1 a serious problem
*If tolerance is below 0.2 a potential problem

What are residuals (Regression Assumption)

Study These Flashcards

Distances between regression line and individual data points

Homoscedasticity (Regression Assumption)

Study These Flashcards

-At each level of the predictor, the variance of the residuals should be constant

Independent Errors (Regression Assumption)

Study These Flashcards

-For any two observations (data points) the residual points should not correlate, they should be independent
-This can be indentified as an issue with the Durbin-Watson Test

Normally Distributed errors (Regression Assumption)

Study These Flashcards

-The residual values in the regression model are random and normally distributed, with a mean of. I.e, there is an even chance of points lying above and below the best-fit line

How to check sample size (up-front assumption)

Study These Flashcards

-Calculate the desired sample size in advance

How to check variable types (up-front assumption)

Study These Flashcards

-Make sure your measures provides data appropriate for multiple regression

How to check non-zero variance (up-front assumption)

Study These Flashcards

Calculate the standard deviation of your variables, check if they have variance > 0

How to check independence (up-front assumption)

Study These Flashcards

-A measurement issue (make sure your outcome scores are all from different people)
-Should not have two or more values on your outcome variable from the same person

How to check linearity (up-front assumption)

Check by analysing residuals in SPSS

How to check lack of multicollinearity (up-front assumption)

Check VIF and Tolerance statistics in SPSS

How to check for Homoscedasticity (up-front assumption)

Check by analysing residuals in SPSS

How to check for independent errors (up-front assumption)

Check Durbin-Watson in SPSS

How to check for normally distributed errors (up-front assumption)

Check by analysing residuals in SPSS

Participants needed for different effect sizes (2 predictors)

Small effect: 478 Medium effect: 67 Large effect 31

Participants needed for different effect sizes (3 predictors)

Small effect: 543 Medium effect: 76 Large effect: 36

Participants needed for different effect sizes (4 predictors)

Small effect: 597 Medium effect: 84 Large effect: 39

Participants needed for different effect sizes (5 predictors)

Small effect: 643 Medium effect: 91 Large effect: 43

R squared value in regression

How good is our model at explaining our data variance explained by the model/ variance not explained by the model effect/error E.g, R squared = 0.887, so 88.7%

Interpret Durbin Watson Test

-Tests whether residuals next to each other are correlated -Varies between 0 and 4 -2 means residuals are uncorrelated -Value greater than 2 indicates a positive correlation -Values greater than 3 or less than 1 indicate a definite problem -Values close to 2 suggest there is no issue

How does f value ANOVA work

Mean square value 1 divided by value 2 = f value

Unstandardised and Standardised Beta

Unstandardised beta (b): The change in Y for a unit change in X Standardised beta (β): The change in Y for a standardised change in X

T-test

Which predictors individually predict our outcome

What is show on a coefficients table

-Unstandardised beta -Standardised beta -T-test -Collinearity (VIF and Tolerance)

Week 4 (Multiple Regression) Flashcards

(39 cards)