What does correlation measure?
Strength of the linear relationship between two variables
= (Covx,y)/(σx*σy)
What does covariance measure?
The direction in which variables move
= [Σ(x-X)(y-Y)]/(n-1), where X and Y are the mean values
Slope of the regression line
= covariance/variance
Test for statistical significance
Standard error of estimate
= SQRT[SSE/(n-2)]
Total sum of squares (SST)
Regression sum of squares (RSS)
Sum of squared errors (SSE)
R-squared (coefficient of determination)
= (SST - SSE)/SST or RSS/SST
What does F test assess?
If at least one independent variable within a set explains variation in values of Y
F-statistic
= MSR/MSE
= (RSS/k)/[SSE/(n-k-1)]
One-tailed test
What is ρ value? How is it used?
Setting up a t-test for regression coefficients
= (estimated coefficient - hypothesized value)/standard error
Why is it necessary to adjust R squared? How?
= 1 - (1-Rsq)(n-1)/(n-k-1)
Heteroskedasticity
How to detect heteroskedasticity
- Breusch-Pagan chi-square test (n*Rsq)
Serial correlation (autocorrelation)
- Correct using the Hansen method by adjusting the coefficient standard errors
How to detect serial correlation
Durbin-Watson test
If autoregressive, use t-test on the residual autocorrelations over several lags
What is multicollinearity
Two or more independent variables are correlated with one another
How to detect multicollinearity
F test and R-sq show significance but p-values don’t
Autoregressive model
When a dependent variable is regressed against one or more lagged values of itself
Mean-reverting level for a time series
b0/(1-b1)
ARCH (autoregressive conditional heteroskedasticity)
Variance of the residuals in one period is dependent of the variance of the residuals in a previous period
What does the Dickey Fuller test check?
Covariance stationary and cointegration (two time series are economically linked)
How to test for ARCH
- If coefficient is statistically different from zero, then time series exhibits ARCH