Panel data-FE Flashcards

1
Q

The model for fixed effects is
yi = β0 + β1xi + ci + ui
What are some examples of what ci could be?

A

ci could represent the effects of ability, health, motivation, intelligence, parental resources, managerial quality, organizational culture, state/local policies or regulations, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the regression of the de-meaned y on de-meaned x look like, mechanically?

A

yit −y ̄i =β1(xit −x ̄i)+(uit −u ̄i)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does it mean to de-mean in the context of fixed effects?

A

within each panel unit i, take the average over t on both sides and subtract the average from each it observation:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some examples of time-invariant explanatory variables that fall out of the fe model?

A

gender, race

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why do time-invariant explanatory variables fall out of the fe model?

A

They all equal their within-group mean, so the within-transformation equals zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

write out first difference model

A

∆yi =β1∆xi +∆ui

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do OLS assumptions apply to first difference model?

A

The new error term ∆ui is uncorrelated with the new explanatory variable, ∆xi .

This requires that we have no cross-period correlations between u and x: called strict exogeneity

The xi must vary over time for at least some i, else they difference out (same as the within transformation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does strict exogeneity require?

A

no cross-period correlations between u and x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

In theory, what happens to constant when you estimate first difference model?

A

Differences out–if you want, you can include it to allow for year-to-year trend

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What happens when you apply first difference model to multiple years?

A

each year of data is differenced with previous year, so you lose the first year in your dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

True or false: In the one-way fixed effects model, we treat ci as a parameter to be estimated

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mechanically, what are we doing when we estimate ci?

A

Effectively we are allowing for a unique intercept for every cross-sectional
unit i. This is feasible to estimate since each i is observed multiple times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

model for fixed effects?

A

yit = β0 + β1xit + ci + uit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What paramaters are we estimating when using fe?

A

intercept (B0), slope(B1), and fixed effects (which are n-1 intercepts)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does the LSDV (least squares dummy variable) model do?

A

includes (n-1) dummy variables in the regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

drawbacks of LSDV approach?

A
  • time-consuming
  • soaks up degrees of freedom
  • often not interested in the fixed effects themselves–(exception is the teacher effects work)
17
Q

When is FE more efficient than first difference?

A

FE is more efficient (smaller standard errors) than first differencing if the error terms are serially uncorrelated and T > 2

18
Q

True or false: FE Assumes no correlation in u across units of panel i

A

True

19
Q

Consistency and unbiasedness of fixed effects themselves in large samples?

A

The estimates of the fixed effects themselves (ci ) are unbiased but inconsistent in large samples. (Why? As the number of panel units grows (N → ∞) the number of parameters to estimate grows).

20
Q

What model does stata fit when you run xtreg?

A

(yit −y ̄i +y ̄)=β0 +β1(xit −x ̄i +x ̄)+(uit −u ̄i +u ̄)

21
Q

Things to ask yourself when you run fe

A

Where is the identification coming from?

How much variation is there within panel units?

22
Q

What happens when there is little variation within panel units?

A

You risk imprecise estimates

23
Q

xtreg ouput: What does the f test tell you? What is the null?

A

F-test for joint significance of fixed effects (null hypothesis H0 is that all fixed effects are zero). If rejected, fixed effects model is a reasonable assumption and regular OLS would provide inconsistent estimates. In practice, rarely rejected.

24
Q

xtreg output: what does R-squared within tell you?

A

variance “explained” by within-group deviations from mean

25
Q

xtreg output; what does R-squared between tell you?

A

variance in group means y ̄i “explained” by the group mean x’s: x ̄i

26
Q

xtreg output; what does sigma_u tell you?

A

estimate of the standard deviation in fixed effects (ci )

27
Q

Assumptions for FE?

A

FE.1: linear model yit = β1xit1 + … + βkxitk + ci + uit
FE.2: cross-sectional units are a random sample
FE.3: xit varies over time for some i, no perfect collinearity
FE.4: ∀t, E(uit|Xi,ci) = 0 or the expected value of u given x in all time periods is zero (strict exogeneity)
FE.5: Var (uit |Xi , ci ) = Var (uit ) = σu2 - homoskedasticity
FE.6: for t ̸= s errors are uncorrelated: Cov (uit , uis |xi , ci ) = 0. No serial correlation.

28
Q

What assumptions do you need for unbiasedness for FE and first difference?

A

FE.1: linear model yit = β1xit1 + … + βkxitk + ci + uit
FE.2: cross-sectional units are a random sample
FE.3: xit varies over time for some i, no perfect collinearity
FE.4: ∀t, E(uit|Xi,ci) = 0 or the expected value of u given x in all time periods is zero (strict exogeneity)

29
Q

What assumptions do you need for FE model to be BLUE?

A

FE.1: linear model yit = β1xit1 + … + βkxitk + ci + uit
FE.2: cross-sectional units are a random sample
FE.3: xit varies over time for some i, no perfect collinearity
FE.4: ∀t, E(uit|Xi,ci) = 0 or the expected value of u given x in all time periods is zero (strict exogeneity)
FE.5: Var (uit |Xi , ci ) = Var (uit ) = σu2 - homoskedasticity
FE.6: for t ̸= s errors are uncorrelated: Cov (uit , uis |xi , ci ) = 0. No serial correlation.

30
Q

When is fixed effects more efficient than the first difference model?

A

FE.6: for t ̸= s errors are uncorrelated: Cov (uit , uis |xi , ci ) = 0. No serial correlation.

31
Q

Where is variation in FE (within) model coming from?

A

uses deviations from unit means, e.g., mean “pre” vs. mean “post”

32
Q

Where is variation in first difference model coming from?

A

uses variation in successive time periods, e.g., just prior to and just after a “treatment” (a change in x)

33
Q

Is the assumpion that errors ui are iid typically satisfied in panels?

A

No–With repeat observations on the same cross-sectional unit, it is likely that errors are correlated across observations for the same i.

34
Q

How do you cluster standard errors in fe?

A

The “cluster” is typically the cross-sectional unit, although when the regressor of interest is aggregated at a higher level (e.g., state), can cluster at that level. Theory requires large N and that higher levels nest the cross-sectional units.

35
Q

Two advantages of fixed effects models?

A
  • Unobserved ui can be correlated with the explanatory variables
  • β1 is estimated using within-group (i) variation in x,y
36
Q

5 disadvantages of fixed effects models?

A

Cannot estimate slope coefficients for time-invariant x
Fixed effects “remove” a lot of the variation in y
The “within” model is less efficient (higher standard errors)
There may be more measurement error (and attenuation bias) when relying on within-group changes vs. levels
Group intercepts use up a lot of degrees of freedom