Regression Flashcards
Regression is…
used to understand relationship between variables
Independent variable (X)
Predictor or regressor
Dependent variable (y)
Outcome or reponse
Goal of regression
Predict changes in Y based on X
Correlation vs regression
Correlation - measures the strength and direction of a linear relationship
Regression Predicts Y based on X
Shared variance (r^2); Proportion of Ys variance explained by X
Simple linear Regression Equation
Y = B0 + B1 X
B0
Intercept: value of y when x is 0
B1
Slope: Change in Y per unit X
e
Error: Difference between observed and predicted Y
Regression using a SAMPLE of the populations
Sample estimates intercept and slope and predicted values of Y
Predicted values of Y are…
points on the Regression line that corresponds to the given value of X
Residuals (e’hat’) are…
distances between observed and predicted values of Y for corresponding X
Equations for the Slope (B1)
Equation for the Intercept (B0)
Correlation equation
What do you do with the r to find the proportion of shared variance?
rxy^2
Square it
1-r^2xy is the…
Variance of Y independent of X
Suppose we observe a high correlation between a child’s weight and their reading ability. This correlation is likely due to age, how can we combat the confounds?
We can control for the hypothesized influence of age on reading ability by removing the shared variance between age and weight
Squared Multiple Correlation (R-squared) formula:
The SMC represents the proportion of variance in Y shared with (or “explained by”) the set of all X variables
Numerator: proportion of non-redundant variance in Y shared with X1 and X2
Shared variance in Prediction
In two predictor regression, we are interested in imposing statistical control over X2 to test the unique effects of X1
Goal of Multiple Regression
- Evaluate the unique effect of X predictors on Y outcomes (holding constant other X)
- Determine the incremental contribution of new X predictors to estimating variance in Y (in addition to X already in the model)
- Determine the amount of variance explained in Y from a set of X predictors
To determine Incremental contribution to the model we use…
squared semi partial correlation
To determine variance explained in Y we use…
Squared multiple correlation
Regression is a method of finding an equation to describe…
The line of best for a set of data