Multiple Regression Flashcards

Question

What is the idea regression model?

Answer 1

Orthogonality between the predictors - that there is NO correlation or overlap between predictors.

Answer 2

It's the same as SLR: ``` SSy(total) = ∑(y-ybar)² SSreg = ∑(ŷ-ybar)² SSres= ∑(y-ŷ)² ```

Answer 3

The MSresidual got bigger due to the addition of the predictor. The F changed and the p value changed.

Answer 4

The regression model does not significantly fit the data, such that X1 and X2, taken together, does not significantly predict Y, F (2,7) = 2.64, p > .05.

Answer 5

It is the coefficient of MULTIPLE determination. Interpretation: It is the Proportion of variance in outcome accounted for by the SET of predictor variables. Formula: R² = SSreg/SStotal

Answer 6

R²y.12 = r²yx1 + rR²yx2

Answer 7

We look at the model information first, evaluate the model fit, THEN look at which predictors (coefficients) are significant or not. We do this to see if the model is worthwhile.

Answer 8

MSresidual = variance of the estimate It tells us the amount of variance of our points from the regression line, therefore, if this number is big and there's a lot of variance around the regression line, that means there's too much that is unexplained and variable.

Answer 9

Numerator = MSresidual, or the variance of the estimate. Denominator = We remove the overlap in the denominator by multiplying the model standard error by (1 - the squared correlation of both predictors [meaning they're highly correlated to one another]). All of it square rooted.

Answer 10

Large squared correlations indicate that the 2 predictors are highly correlated to one another. Since SE is used to test the significance of the coefficients. (t = b - R2/ SE) Then the T-value becomes small, so we're less likely to get significance.

Answer 11

2 things: Symbolically, it is the population coefficient, which is zero only in T-statistic. It is ALSO the Standardized Regression Coefficient.

Answer 12

The unstandardized b coefficients are based on the unit of measure of the outcome variable. ß - coefficients, or standardized coefficient, allow us to estimate unit free relationships among the standardized variables. It's the residuals divided by the estimate of their SD. This is contingent on the unit of measure (SD) for the x-variable. ex) is 4 a big number? It depends on what it's being measured up to. However, the ß-coefficients allow us to measure BEYOND the restraints of the X-variable. >We standardize regression coefficients because we can't define a cut-off point for what constitutes as a large residual.

Answer 13

1. We convert all of the variables into z-scores. 2. We convert our outcome variable into z-score. When we standardize the variables to z-scores, we can compare the residuals to what we already know about the properties of z-scores. ex) 99% of z-scores should lie between -3.29 and +3.29.

Answer 14

* *ex) Although not significant, doctor visits are EXPECTED to increase by .72 STANDARD DEVIATIONS for every 1 STANDARD DEVIATION increase in physical health problems, OVER AND ABOVE the effects of mental health, t(7) = 1.51, p > .05. problems) .

Answer 15

ß = b (Sx/Sy) ; b times standard deviation of x divided by the standard deviation of y. b = ß (Sy/Sx) ; ß times standard deviation of y divided by the standard deviation of x.

Answer 16

The intercept becomes zero after we standardize our coefficients because the z-score conversion made the means of the coefficients zero.

Answer 17

TAKEN TOGETHER, reported physical health problems, mental health problems, and stress significantly predicts # of doctor visits, F (3, 461) = 43.03, p

Answer 18

Intercept: The -3.7 doctor visits is expected when all other variables are zero, which significantly different from zero, t(461) = 3.29, p .05.

Answer 19

By setting certain coefficients to zero, it allows us to test the combinations of a set (ex: mental health and stress only as a SET of predictors). We then compute a R²∆ test to see if the change in model R² is significant. It tells us if the SET of predictors contributes to the Full model in a significant way.

Answer 20

We compute the R²∆ test to see if the set of predictors we got from our nested submodel contributes to our FULL model in a significant way. Formula: (R²Full - R²Reduced)/(dfRegfull) ÷ (1 - R²Full) / (dfResFull) Interpretation: Taken together, X1 and X3 significantly contribute to our model. ex) taken together, mental health and stress significantly contribute to our model.

Answer 21

anova (fitsub,fitfull)

Answer 22

The intercept. This is what we are testing our model against - we are testing the change between the model fit over this empty model.

Answer 23

1. Zero-order correlations (pearson r, generic correlation that shows shared variability ignoring everything else) 2. Partial correlations 3. Semi-partial correlations

Answer 24

Partial correlations is the amount of unique overlap bw X and Y where Z has been removed from X and Y. We remove both by residualizing.

Answer 25

Semi-partial looks at the unique amount of overlap bw X and Y where Z has been removed from X but not Y.

Answer 26

Partial - the amount of Y is smaller - the total amount of variability in Y is smaller. So the shared portion takes more portion of Y.

Answer 27

Partial correlation

Answer 28

We use Z as a predictor of Y. So we have Z predicting Y in the model. What's left is the residual of Y after controlling for Z. R² y.z = .59

Answer 29

All data points would fall on the regression line, meaning the residual would be zero.

Answer 30

For the b1 and b2 coefficients, we say the very important following: b1 - The change in the EXPECTED value of Y associated with a 1 unit increase in X, OVER AND ABOVE the effect of X2. Same for b2, except it would be OVER AND ABOVE the effect of X1.

Answer 31

TAKEN TOGETHER, X1 and X2 accounts for % of variability in Y.

Answer 32

b0 – Y-mean – b1X1 – b2X2 b1 – SSCPX1Y ÷ SSX b2 – SSCPX2Y ÷ SSX2

Answer 33

o Standardized coefficients (ß) allow us to estimate the variables without the restraint of the x-variables unit of measures (SD of x-variables limit us). o ß is computed by converting all variables and outcome variable into Z-scores. o Interpretation of ß-coefficients: ß1 – the standardized change in the expected value of Y associated with a 1 STANDARD DEVIATION increase in X1, OVER AND ABOVE the effect of X2. • Although not significant, doctor visits are EXPECTED to increase by .72 STANDARD DEVIATIONS for every 1 STANDARD DEVIATION increase in physical health problems, OVER AND ABOVE the effects of mental health, t(7) = 1.51, p > .05.

Answer 34

o Zy = ß1ZX1 + ß2Z2 + e

Answer 35

o The intercept practically becomes zero after we standardize our coefficients because the z-score conversion made the means of the coefficients zero.

Answer 36

o ß = b (Sx/Sy) ; b times standard deviation of x divided by the standard deviation of y. • “ßeta b SeXy” o b = ß (Sy/Sx) ; ß times standard deviation of y divided by the standard deviation of x.

Answer 37

o To create consistent and standardized means of measurements • We would use it to compare different units of measurements

Answer 38

o (R²Full - R²Reduced)/(dfRegfull) ÷ (1 - R²Full) / (dfResFull) o Interpretation: Taken together, X2 and X3 significantly contribute to our model. • ex) Taken together, mental health and stress significantly contribute to our model.

Multiple Regression Flashcards

(63 cards)