L07 Regressions Flashcards
(8 cards)
Regression
Used to predict the value of a dependent variable based on the value of one/more explanatory variables
Describe the characteristics of the line of best fit
Slope of the line and y intercept
Types of variables in regression
Outcome/Dependent/Criterion variable
Predictor/Independent/Exploratory variable
Multiple regression
If there are more than one predictor variable/IVs: how they contribute to DV
Plane of best fit
z = ax + by + c
Equation for regression
y = ax + b a = slope of the line b = y intercept
Slope of regression line
a in y = ax + b
The number of units that the regression line moves up the vertical axis for each unit of movement along the horizontal axis
Assumptions for regression
- Continuous level of measurement for both variables - ratio/interval
- Linear relationship between the two variables
- No significant outliers (checked with scatterplot)
- Lack of autocorrelation of residuals: Durbin Watson test ideally between 1.5-2.5
- Outcome variable comes from a normally distributed population
Residual
Difference between observed value of dependent variable (y) and predicted value (y hat)
Residual is denoted as e
Sum and mean of residuals is 0
A good line of best fit gives very small residuals for each data point
Autocorrelation
Correlation of errors/residuals on each point - residuals are not independent of each other No autocorrelation for a regression test Checked with Durbin-Watson Test Value should be between 1-3 ideally between 1.5 to 2.5