Tutorial 5 - Panel data, Pooled OLS, Random-Effects-Estimator, Fixed-Effects-Estimator Flashcards

1
Q

What is panel data?

A

panel data or longitudinal data are multi-dimensional data involving measurements over time. Panel data contain observations of multiple phenomena obtained over multiple time periods for the same firms or individuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does a Linear Panel Data Model look like?

A
  • i = 1, .., N cross-sectional dimension (e.g. persons, firms, countries)
  • t = 1, …, T time-series dimension (e.g. years)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a stacked model for all NT observations look like?

A
  • i = 1, .., N cross-sectional dimension (e.g. persons, firms, countries)
  • t = 1, …, T time-series dimension (e.g. years)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does Pooled OLS estimator mean and how is it defined?

A

pooling all NT observations and applying OLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which condition needs to be met for the POLS to be consistent?

A

POLS is only consistent under the assumption of weak exogeneity of the regressors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the variance of the POLS error term?

A

NT × NT matrix:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What may the error variance matrix Ω of POLS contain and what not?

A

it may contain:

  • heteroscedasticity, i.e. different variances of error terms across individuals
  • autocorrelation of error terms within individuals over time

BUT:

  • Assumption: no correlation of error terms across individuals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the Breusch-Godfrey-Test test?

A

Test for serial correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does a model with serial correlation look like?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the steps for applying the Breusch-Godfrey test?

A
  1. Estimate an (P)OLS regression to estimate the model
  2. Using the estimated residuals ûᵢ as an estimate for û, estimate following model (below)
  3. Test (n − p)R² against critical value from X²ₚ (p needs to be justified, e.g. how many observations per person (cluster))
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is μᵢ in a model with random effects?

A

individual-specific, time-constant unobserved effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the assumptions for Random Effects?

A

see (1), (2), (3) below

with (3): the regressors are strictly exogenous, i.e. mean independence of composite error term and regressors of all (!) time periods. Assumption is violated if the individual-specific effect μᵢ is correlated with xᵢₜ!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What effect does μᵢ have on the composite error (in random effects models)?

A

Cov (ϵᵢₛ, ϵᵢₜ) ≠ 0

Note: RE explicitly models the serial correlation in a GLS framework

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does the variance between two individual observations look like in a model with random effects?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Show that there is serial correlation of error terms within individuals over time in a model with random effects.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Is there autocorrelation across individuals in a model with random effects?

A

No autocorrelation across individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How does the Var(ϵ) look like in a model with random effects?

A
  • Var(ϵ) = Ω, a NT × NT matrix.
  • However, under the RE assumptions, it only includes two distinct parameters: σμ² and σₑ².
  • In practice, Ω has to be consistently estimated -> ^Ω
18
Q

What is the random effects estimator?

19
Q

What are problems both with POLS and RE that Fixed Effects and First Differences can fix?

A

POLS and RE are inconsistent if the individual-specific effect μᵢ is correlated with xᵢₜ (example: individual-specific ability could be correlated with education, or with joining a union)

20
Q

How do Fixed Effects and First Differences handle the case that the individual-specific effect μi is correlated with xit ?

A

Fixed Effects and First Differences allow for this correlation by using only variation in xᵢₜ within individuals over time -> μᵢ cancels out -> controls for time-constant selection bias

21
Q

How can you find an expression for fixed effects?

A

Fixed Effects = POLS on time-demeaned data:

22
Q

How can you find an expression for first differences?

A

First Differences = POLS on first-differenced data:

23
Q

How would a fixed effects model be different if estimated on the data of the model shown below?

A
  • variables educ and black drop out because they don’t change over time for a given individual
  • FE is the same as a “Least Squares Dummy Variable Estimator” - here: POLS with a dummy for each individual
  • Time-constant variables like educ drop out, but we can estimate the interaction of educ with time dummies -> i.e., we can’t estimate the level of the return to education, but we can estimate whether the return changed over time
24
Q

What do the different terms in the model mean?

A

index ᵢₜ: not only identify the person, but also the time of the observation
λₜ stands for control variables (one for each year, except the base year) = like a dummy for each year
μᵢ = 545 values, one for each observation

25
Which four characteristics/assumptions describe the multiple linear regression mode?
* **Linearity:** * **Independence:** * **Strict Exogeneity**: * **Error Variance**:
26
What is linearity in this model?
**Linearity**: The model is linear in parameters α,β,γ, effect ci and error uit
27
What is independence in this model?
**Independence:** Xi, zi, yi are i.i.d. = independent and identically distributed. The observations are independent across individuals but not necessarily across time.
28
What is exogeneity in this model?
**Strict Exogeneity**: The idiosyncratic error term uit is assumed uncorrelated with the explanatory variables of all past, current and future time periods of thesame individual. This is a strong assumption which e.g. rules out lagged dependent variables.
29
What is assumed about error variance in this model?
30
What is ci in the random effects model?
In the random effects model, the individual-specific effect is a random variable that is uncorrelated with the explanatory variables
31
Which three additional assumptions make this a random effects model?
* Unrelated effects * Effect variance * Identifiability
32
What is the assumption of related effects in the fixed effects model?
That the assumption from the random effects does not hold: E [ci | Xi, zi] ≠ 0 assumes that the individual-specific effect is a random variable that is correlated with the explanatory variables of all past, current and future time periods of the same individual
33
What is the assumption of unrelated effects in the random effects model?
E [ci | Xi, zi] = 0 assumes that the individual-specific effect is a random variable that is uncorrelated with the explanatory variables of all past, current and future time periods of the same individual
34
What is the assumption of effect variance in the fixed effects model?
absence of assumption of constant variance of the individual specific effect
35
What is the assumption of effect variance in the random effects model?
assumes constant variance of the individual specific effect
36
What is ci in the fixed effects model?
In the fixed effects model, the individual-specific effect is a random variable that is allowed to be correlated with the explanatory variables
37
Which three additional assumptions make this a fixed effects model?
* Related effects * Effect variance * Identifiability
38
What happens if this random effects model is estimated by Pooled OLS?
* The pooled OLS estimator of α, β and γ is unbiased * It is consistent and approximately normally distributed * However, the pooled OLS estimator is not efficient * The usual standard errors of the pooled OLS estimator are incorrect -\> Correct standard errors can be estimated with the cluster-robust covariance estimator treating each individual as a cluster
39
What happens if this fixed effects model is estimated by Pooled OLS?
The pooled OLS estimators of α, β and γ are biased and inconsistent, because the variable ci is omitted and potentially correlated with the other regressors
40
What is the first difference etimator?
* The first-difference (FD) estimator is an approach used to address the problem of omitted variables with panel data. * The estimator is obtained by running a pooled OLS estimation for a regression of Δ yit on Δxit