Multiple regression Flashcards
(63 cards)
What is the purpose of multiple regression?
To estimate the value of an outcome variable (Y) based on multiple predictor variables (X)
It extends upon the principles of simple linear regression.
What are predictor variables in multiple regression?
Variables that are used to predict the outcome variable
They can include continuous, ordinal, or binary data.
What is the main difference between regression and ANOVA?
Regression focuses on relationships between predictor variables and one outcome variable, while ANOVA focuses on differences in scores on the dependent variable according to two or more independent variables.
What is the Forced Entry method in multiple regression?
All predictor variables are entered into the model at the same time without a specified order
Known as the Enter method in SPSS.
What is Hierarchical Regression?
Predictors are entered into the model in a specified order based on previous research
New predictors can be entered all at once, hierarchically, or stepwise.
What is the Stepwise method in multiple regression?
A controversial method where the order of variable entry is based on statistical criteria rather than prior research.
What does the R² value represent in multiple regression?
The amount of variance accounted for by the model.
What are the assumptions of multiple regression?
Sample Size, Variable Types, Non-Zero Variance, Independence, Linearity, Multicollinearity, Homoscedasticity, Independent Errors, Normally Distributed Errors.
What is the rule of thumb for sample size in multiple regression?
10 participants for every one predictor variable.
What is Multicollinearity?
Strong correlation between predictor variables that can make interpreting results difficult.
What is the Durbin-Watson Test used for?
To test for correlations across error terms in the residuals.
What is homoscedasticity?
The variance of the residuals should be constant at each level of the predictor variable.
What does the term ‘independent errors’ refer to?
Residuals for any two observations should not correlate.
How should variables be coded in SPSS for binary predictors?
Categories must be coded as 0 and 1.
What is a common way to check for the assumption of linearity?
Analyzing residuals in SPSS.
What type of data should predictor variables be in multiple regression?
Quantitative, which can be continuous, categorical, or ordinal.
What happens if the assumptions of multiple regression are violated?
It can impact the validity of the results and confidence in the findings.
What are residuals?
The distances between the data points and the regression line.
What does a VIF value greater than 10 indicate?
There is likely a multicollinearity problem.
What is the significance of checking assumptions in regression analysis?
To ensure the model produced is reliable and valid.
What is the purpose of using power analysis in regression?
To determine an appropriate sample size based on the expected effect size.
What does the term ‘heteroscedasticity’ refer to?
Unequal variability of a variable across the range of values of a second variable.
True or False: The outcome variable in multiple regression must be continuous.
True.
Fill in the blank: In multiple regression, the regression line is also known as the _______.
line of best fit.