Multiple regression model assumptions
MSR
MSR = RSS/k
MSE
MSE = SSE/(n−k−1)
SST
RSS+SSE
R2
RSS/SST
oppure
(SST-SSE)/SST
oppure
(total variation – unexplained variation )/total variation
indica quanto l’indipendent variable puo spiegare
Breusch pagan
n*R^2
Adjusted R2
1-((n-1)/(n-k-1))*(1-R^2)
o measure of goodness of fit that adjusts for the number of independent variables
o adj R2<R2
o decreases when the added independent variable adds little value to regression model
Cook’s D
If observation > √(k/n)–> influential point
Odds
Prob given odds
Odds= e^coefficient
Prob with odds = odds/(1+odds)
F statistic
((SSEr-SSEu)/q) / (SSEu/(n-k-1))
=MSR/MSE with K and N-K-1 df
H0 all coefficients are zero
reject H0 if F (test-statistic) > Fc (critical value)
to explain whether at least one coefficient is significant
Conditional Heteroskedasticity
Residual variance is related to level of independent variables
DETECTION
* Breusch–Pagan chi-square test
* >5% hetero
* <5% no hetero
CORRECTIOn
robust or White-corrected standard errors
Serial Correlation
Residuals are correlated with each other
DETECTION
* Breusch–Godfrey (BG) F-test
* Durbin Watson (DW)
* DW<2–> pos. serial corre.
CORRECtION
Use robust or Newey–West corrected standard errors
Multicollinearity
Two or more independent variables are highly correlated
DETECTION
* Conflicting t and F-statistics
* variance inflation factors (VIF)
* VIF >5 o 10 problema
CORRECTION
* Drop 1 of the correl. variables
* use a different proxy for an included independent variable
MISSPECIFICATIONS
Omission of important independent variable(s)–>May lead to serial correlation or heteroskedasticity in the residuals
Inappropriate transformation / variable form–> May lead to heteroskedasticity in the residuals
Inappropriate scaling–>May lead to heteroskedasticity in the residuals or multicollinearity
Data improperly pooled
Solve it by running regression for multiple periods–May lead to heteroskedasticity or serial correlation in the residuals
prob with odds
P=(odds)/(1+odds)
Autoregressive (AR) Model
Covariance Stationary
Mean Reversion
A time series is mean reverting if it tends towards its mean over tim
=b0/(1-b1)
Se b1 =1–> mean reverting è undefined perchè b0/0
Unit Root = Random walk
Random Walk
Seasonality
Root Mean Squared Error (RMSE)
to assess accuracy of autoregressive models.
* lower RMSE = better
* Out-of-sample forecasts
significant shift in the plotted data at a point in time that seems to divide the data into two distinct patterns
two time series are economically linked (same macro variables) or follow the same trend and that relationship is not expected to change