3. MLR Flashcards

1
Q

Purpose of Multiple Linear Regression

A

Explain the variation of Y.

( Each coefficient is interpreted as the estimated change in y corresponding to a unit change of its related variable, with all other variables held constant. )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Assumptions for error terms

A
  1. Normally distributed with mean = 0
    ( else not efficient estimators )
  2. Constant Variance
    ( fix heteroscedacity with weighted least squares )
  3. Error terms are independent of each other
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Common issues of MLR

A
  1. Overfitting ( non-contributing variables )
  2. Multicollinearity ( redundant variables )
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Testing for Overfitting

A

Single linear regression for each variable. Weak correlations signal non-contributing variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Testing for Multicollinearity

A

( After checking significance of each variable )

  • High correlation between explanatory variables
  • Sensitivities when including explanatory variables
  • Increase in Standard error, sharp decrease of adjusted R^2
  • Variance Inflation Factor (VIF) > 10
    ( VIF = 1/(1-R^2) )
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Testing significance

A

ANOVA - Analysis of Variance / F-Test ( MODEL )

p < alpha means variable explains some variation of y with stat. significance

  • t-Test ( INDIVIDUAL VARIABLES )
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Goodness of fit test measures

A
  • R^2
  • Adjusted R^2
  • MSE
  • F-TEST (VANOVA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Formula for F-Test

A

MSR / MSE or (SSR/k) / (SSE/n-k-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Formula for R^2

A

R^2 = 1 - (SSE/SST)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly