kk Flashcards by Axel Lilja

What is the dependent variable in the difference-in-differences (DID) model?

𝑝𝑜𝑙𝑙𝑢𝑡𝑖𝑜𝑛𝑖𝑡

It represents street level pollution for area 𝑖 of the city in year 𝑡.

How well did you know this?

Not at all

Perfectly

What does the treatment dummy (𝐷𝑖) represent in the DID model?

Equal to 1 if area 𝑖 is treated with the congestion charge policy, and 0 otherwise.

How well did you know this?

Not at all

Perfectly

What is the role of fixed effects for each area (𝛼𝑖) in the regression?

To control for unobserved characteristics of each area that may affect pollution levels

For example, different areas may have different baseline pollution levels.

How well did you know this?

Not at all

Perfectly

What do year fixed effects (𝛿𝑡) do in the regression?

Control for time-specific factors that affect pollution across all areas

For example, a nationwide policy change in 2018 affecting all areas.

How well did you know this?

Not at all

Perfectly

Why is the term 𝐷𝑖×𝑦2017𝑡 omitted in the regression specification?

It is redundant since it is the base year for the DID analysis.

How well did you know this?

Not at all

Perfectly

What is the key assumption required for the validity of the DID method?

The parallel trends assumption

This means that in the absence of treatment, the treated and control groups would have followed the same trend over time.

How well did you know this?

Not at all

Perfectly

Can the parallel trends assumption be tested in this case?

Yes, by examining pre-treatment trends in pollution between treated and control areas.

How well did you know this?

Not at all

Perfectly

Which coefficient estimates the causal effect of the congestion charge on pollution in 2018?

The coefficient for 𝐷𝑖×𝑦2018𝑡.

How well did you know this?

Not at all

Perfectly

Rewrite the regression model to estimate the overall difference in pollution among treated and control areas before and after treatment.

𝑝𝑜𝑙𝑙𝑢𝑡𝑖𝑜𝑛𝑖𝑡=𝛼𝑖+𝛿𝑡+𝛽𝐷𝑖+𝜀𝑖𝑡

How well did you know this?

Not at all

Perfectly

What does including a quadratic term for sleeping hours suggest about the relationship with grades?

A suspected non-linear relationship.

How well did you know this?

Not at all

Perfectly

What is the marginal effect of sleeping hours on the outcome?

The derivative of the outcome with respect to sleep_hours.

How well did you know this?

Not at all

Perfectly

What is the marginal effect when sleeping hours = 8?

Specific numerical value based on the model.

How well did you know this?

Not at all

Perfectly

What is the marginal effect when sleeping hours = 9?

Specific numerical value based on the model.

How well did you know this?

Not at all

Perfectly

What can be concluded about the optimal amount of sleep based on marginal effects?

It must lie somewhere between 8 and 9 hours.

How well did you know this?

Not at all

Perfectly

Are the controls for students’ age and sex considered ‘good’ or ‘bad’?

This depends on their relevance to the causal relationship being studied.

How well did you know this?

Not at all

Perfectly

Interpret the coefficient on the female dummy variable.

Study These Flashcards

Suggests a gender gap in academic performance of 6.875 points.

Under what conditions is it appropriate to control for cognitive ability?

Study These Flashcards

When it does not mediate the causal relationship between sleep hours and grades.

What does the lack of statistical significance of the interaction between ability and female suggest?

Study These Flashcards

That the interaction may not be relevant in explaining academic performance.

Why is adjusted R² often preferred over regular R² when comparing model specifications?

Study These Flashcards

It accounts for the number of predictors in the model.

Under what circumstances may missing students lead to sample selection bias?

Study These Flashcards

If the missingness is related to the outcome variable.

List all variables and parameters in the OLS model.

Study These Flashcards

Dependent variable, independent variables, error term, intercept, and slope coefficient.

What is the minimization problem used to derive the OLS estimator?

Study These Flashcards

Minimize the sum of squared residuals.

What do the first-order conditions (FOCs) imply about the sum of OLS residuals?

Study These Flashcards

The sum of residuals equals zero.

What does it mean for ̂𝛽1 to be an unbiased estimator of the true 𝛽1?

Study These Flashcards

On average, it equals the true parameter value across repeated samples.

What is the key assumption required for ̂𝛽1 to be unbiased?

The assumption of no omitted variable bias.

What does the 95% confidence interval of ̂𝛽1 mean?

It indicates the range in which the true parameter is likely to fall with 95% certainty.

What is meant by heteroskedasticity of the error term 𝑢𝑖?

The variance of the error term varies across observations.

kk Flashcards

(27 cards)