Econometrics - Panel/TS & hetero/autocorrelation Flashcards
(57 cards)
What is heteroskedasticity and why is it a problem?
Non-constant variance in errors - violates a classical assumption
What are 6 reasons heteroskedasticity occurs?
- Error-learning models
- Real income grows through time
- Improved data collection over time
- Outliers in a sample of data
- An incorrectly specified model
- Skews in distribution in an X variable
What are the consequences of heteroskedasticity?
No longer minimum variance so inefficient estimator, not BEST anymore so there can be another estimator that can produce smaller variance, there will be a breakdown in inference (std errors no longer unbiased so issues w t-tests)
What are the 3 main tests for heteroskedasticity?
- Goldfeld-Quandt Test (GQ test)
- Breusch-Pagan Test (BP test)
- White’s Test
How does the Breusch-Pagan test work?
Fit regression, then calculate the squared residuals, and fir a new model using the squared residuals, then calculate the chi-square test stat and p-value, and compare to sig level. Null hypothesis is homoskedasticity.
How can you correct for heteroskedasticity?
- Transform model e.g. logs/squares/inverse
- Robust standard errors
- Generalised Least Squares/Weighted Least Squares (GLS/WLS)
What is autocorrelation/serial correlation and why is it a problem?
Errors correlated with their previous value - violates classical assumption
When does serial correlation occur?
- Time-series data
- Spatially organised data
- Can be in cross section but less common
What causes serial correlation?
- Omitted lagged variables
- Economic shocks that have persistent effects
- Transformations applied to data
- Model misspecification
- Error term being truly dynamic
What is first-order autocorrelation?
Assume that the errors is correlated linearly only with its value in the previous period
What are the consequences of autocorrelation?
Residuals don’t have minimum variances so OLS isn’t BLUE.
R-squared may seem high
Standard errors may be baised downwards - OLS is inefficient and incorrect inferences may be made
What does the Durbin Watson Test test for?
First-order correlation
What values of the Durbin Watson test statistic indicate first-order autocorrelation?
DW -> 0 = positive autocorrelation
DW -> 2 = no autocorrelation
DW -> 4 = negative autocorrelation
What are 3 limitations of the Durbin Watosn test?
- Not valid in dynamic models as test stat biased to 2
- Only applies to first-order autocorrelation
- Bounds test doesn’t offer exact critical values so an element of doubt
What is a better test for serial correlation?
Bresuch-Godfrey Lagrange Multiplier Test
How is a Bresuch-Godrey LM test conducted?
Estimate OLS and obtain residuals, estimate auxiliary regression and then either compute LM test stat and compare to ch-squared dist OR use an F-test
What can be done to correct for serial correlation?
Employ robust standard errors (HAC standard errors)
What do hetertoskedasticity autocorrelated consistent (HAC) standard errors do?
Larger standard errors so less statistical significance
What is the difference between a True and Natural experiment
True = observations randomly assigned to different groups
Natural = not randomly assigned
What is the equations for a Difference-in-Difference model?
Y = b0 + b1Gi + b2Ri + b3(G.R) + error, where G=1 in treatment group, otherwise 0, and R=1 if observation is observed in period 2, otherwise 0
What is the interpretation of b3 in a general D-in-D?
Average treatment effect (ATE), captures the policy effect
What are the advantages of using panel data? (5)
- More information, more variability, less collinearity, greater degrees of freedom - hence more efficient
- Consider dynamic changes
- Detect/measure effects that can’t be observed in other data types
- Better model specific types of economic behaviour
- Large panels less likely to produce biased estimates
What are the 4 main types of panel models?
- Pooled OLS
- Fixed Effects Least Squares Dummy Variable Model (LSDV)
- Fixed Effects Within-Group Model
- Random Effects Model
Explain pooled OLS?
Pools data and estimates simple OLS - disregards time and entity dimensions