Econometrics 7: Dummy Variables Flashcards

(11 cards)

1
Q

What are dummy variables and why do we use them?

A

Dummy variables (also called binary, indicator, or zero-one variables) take values of 0 or 1.
Used to encode qualitative information in regression models.
Examples:
Femaleᵢ = 1 if individual i is female, 0 otherwise.
Marriedᵢ = 1 if individual i is married.
Recessionₜ = 1 if the economy is in recession in quarter t.
Econometrically, choosing Femaleᵢ vs. Maleᵢ makes no difference, but interpretation depends on context.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain how to interpret the coefficient on a dummy variable in a regression.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why does the choice of reference group matter in dummy variable models?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do we handle multiple categories in dummy variables?

A

Note: the m − 1 dummy variables must be
▶ Mutually exclusive
▶ Exhaustive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are multiplicative dummy variables and how do we interpret them?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe the four regression outcomes based on interaction terms between dummy and continuous variables.

A

δ₀ = 0, δ₁ = 0 → Coincident regressions (same line).
δ₀ ≠ 0, δ₁ = 0 → Parallel regressions (same slope, different intercepts).
δ₀ = 0, δ₁ ≠ 0 → Concurrent regressions (same intercept, different slopes).
δ₀ ≠ 0, δ₁ ≠ 0 → Dissimilar regressions (different intercepts and slopes).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do we interpret interactions between two dummy variables?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does OLS estimate dummy variable coefficients?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define the Linear Probability Model and explain how it is used in econometrics.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

List and explain the limitations of using the Linear Probability Model.

A

Non-normality of errors:
Errors take only two values → not normally distributed.
But CLT helps with large samples.
Heteroscedasticity:
Variance of errors depends on Xᵢ.
Use robust standard errors.
Predicted probabilities outside [0,1]:
LPM doesn’t constrain predictions to valid probability range.
Constant marginal effects:
Assumes effect of Xᵢ on P(Dᵢ = 1) is the same across all Xᵢ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are alternatives to the Linear Probability Model?

A

Logit and Probit models:
Designed for binary dependent variables.
Address LPM’s limitations (e.g., predicted probabilities, heteroscedastic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly