Final Exam Flashcards

Question

Disadvantages of random effects estimator:

Answer 1

 Biased if assumption that 𝑎𝑖 is uncorrelated with regressors is incorrect (while FE estimator allows arbitrary correlation between 𝑎𝑖 and regressors)

Answer 2

 It rarely seems likely that 𝑎𝑖 is uncorrelated with any regressors; fixed effects model is generally far more convincing

Answer 3

❑ Fixed and random effect estimators can be compared with a Hausman test (previously seen in instrumental variables context as test for endogeneity)

Answer 4

 Under random effects hypothesis, both RE and FE estimators are consistent (should give similar results); under alternative hypothesis, FE consistent but RE is not  Therefore, if estimates are significantly different, can reject null hypothesis of random effects

Answer 5

❑ General advice: use fixed effects estimator if it’s feasible

Answer 6

Divide the coefficient by the standard error to get the t-value

Answer 7

Sign (β2) * Sign (Corr (X1, X2)) = Sign of Bias (β1)

Answer 8

Theory: is there sound justification for including the variable? Bias: do the coefficients for other variables change noticeably when the variable is included? T-Test: is the variable’s estimated coefficient statistically significant? R-square: has the R-square (adjusted R-square) improved?

Answer 9

First-order serial correlation occurs when the value of the error term in one period is a function of its value in the previous period; the current error term is correlated with the previous error term.

Answer 10

compare DW(d) to the critical values (𝐝_𝐋, 𝐝_𝐔)

Answer 11

occurs in correctly specified equations

Answer 12

arises due to model misspecification

Answer 13

Multicollinearity exists in every equation & the severity can change from sample to sample. There are no generally accepted true statistical tests for multicollinearity. VIF > 5 as a rule of thumb

Answer 14

Linear Probability Model (LPM) & Logit / Probit Model

Answer 15

- Similar to OLS regression - R-squared is no longer an accurate goodness-of-fit measure - Interpretation: probability that Y=1 on a percentage point scale

Answer 16

- Restricted between 0 and 1 - Automatically corrects for heteroskedasticity - Marginal effect of X is not constant - Not linear in the coefficients

Answer 17

On average, a 1-unit increase in DISTANCE is associated with a negative 7.2 percentage point change in the probability of choosing Cedars Sinai, holding all else constant

Answer 18

Unboundedness | The linear probability model produces nonsensical forecasts (>1 and <0)

Answer 19

Adj-R^2 is no longer accurate measure of overall fit

Answer 20

Marginal Effect (slope) of a 1-unit increase in X is forced to be constant

Answer 21

Error term is neither homoskedastic nor normally distributed

Answer 22

The sign and significance can be interpreted just as in linear models. B1 is the effect of a 1-unit increase in X1 on the log-odds ratio

Answer 23

margin, dydx (X) at mean

Answer 24

B*100 percentage point change in the probability that Y=1

Answer 25

B change in the log-odds ratio of Y=1

Answer 26

(dy/dx)*100 percentage point change in the probability that Y=1

Answer 27

Treatment is not necessarily randomly assigned because of other systematic differences in the error term (endogeneity) which would cause bias in the treatment’s effect. We need a valid counterfactual to truly understand the effect of the intervention / treatment.

Answer 28

a control group that is exactly the same as the treatment group except it does not receive the treatment

Answer 29

Randomization | -Researcher randomly assigns subjects to either a treatment or control group to estimate the treatment effect

Answer 30

Randomized experiments are hard to do in the social sciences, so researchers often rely upon natural experiments where an exogenous event mimics the treatment and control group framework in the absence of actual random assignment

Answer 31

Counterfactual Challenge | Hard to find an untreated group that really is otherwise identical to the treated group

Answer 32

Formed when cross-sectional and time-series data sets are combined to create a single data set. Main reason for working with panel data (beyond increasing sample size) is to provide insight into analytical questions that can’t be answered by using time-series or cross-sectional data alone

Answer 33

Increased sample size so more degrees of freedom & sample variability Able to answer new research questions Can eliminate omitted variable bias with fixed effects (controlling for unobserved heterogeneity)

Answer 34

Heteroskedasticity & Serial Correlation

Answer 35

Does a good job of estimating panel data equations, and it also helps avoid omitted variable bias due to unobserved heterogeneity.

Answer 36

Each cross sectional unit has its own intercept. A fixed effects analysis will allow arbitrary correlation between all time-varying explanatory variables and 𝑎_𝑖

Answer 37

measurement error, autocorrelation, heteroskedasticity

Answer 38

The omitted variable bias arising from unobserved heterogeneity can be mitigated with panel data and the fixed effects model.

Answer 39

How? Estimates panel data by including enough dummy variables to allow each cross-sectional of individual i and time period t to have a different intercept. These dummy variables absorb the time-invariant, individual-specific omitted factors in the error term

Answer 40

When the explanatory variable of interest is time-invariant

Answer 41

Ai and regressors (Xit) are uncorrelated

Answer 42

- Can handle time-invariant variables | - Uses fewer degrees of freedom than FE because of the lack of subject dummies

Answer 43

Compares fixed and random effect estimators to see if their difference is statistically significant. If different  fixed effects model preferred (reject the null hypothesis of random effects) If not different  random effects model to conserve degrees of freedom (or provide estimates of both the fixed effects and random effects models) If both models predict VERY DIFFERENT results, it suggests RE model has omitted variable bias and endogeneity present, making FE more statistically accurate

Answer 44

Cook’s D is used to detect influential outliers.

Answer 45

𝐻𝑆𝑖 = 0.42 + 0.028𝑚𝑒𝑑𝑢𝑐𝑖 + 0.002𝑚𝑒𝑑𝑢𝑐𝑖 | 2 + 0.06𝑤𝑜𝑟𝑘�

Answer 46

1) HS෢i could be ≤ 0 or ≥ 1. Linear probability model is difficult to interpret as a probability because HS෢i is not bounded by 0 and 1. The linear probability model produces nonsensical forecasts (greater than 1 and less than 0). 2) The marginal effect of a 1-unit increase in any X is forced to be constant, which cannot possibly be true for all values of X. 3) 𝑅ത2 is no longer an accurate goodness-of-fit measure. The predicted values of Y are forced to change linearly with X, so you could obtain a low 𝑅ത2 for an accurate model.

Answer 47

Random experiments involve researcher’s randomly assigned subjects to either a treatment or control group to estimate the treatment effect. Natural experiments or quasi-experiments, attempt to utilize the “treatment-control” framework in the absence of actual random assignment to treatment and control groups. Instead of researcher randomly assigning treatment, rely on some exogenous event to create treatment and control groups. When the event or the policy is truly exogenous, treatment is as good as randomly assigned.

Answer 48

1) Random experiments are often very costly or cannot be carried out due to being unethical. 2) Non-random samples. They often lack generalizability since the sample may not be randomly drawn from the entire population of interest. 3) Attrition bias because treatment or control units non-randomly drop out of the experiment. 4) Hawthorne effects (people behave differently when observed, may respond to treatment/control status). 5) Randomization failure (can only control for observed treatment-control differences; bias may result if unobservable characteristics not perfectly balanced).

Answer 49

This method estimates the impact of a treatment by comparing the outcomes of a treatment group and a control group before and after the treatment is received.

Answer 50

The main underlying assumption: in the absence of the treatment, the difference between the outcomes of the two groups would not have changed (i.e., they would have followed a common trend). The change in outcomes of the control group is viewed as the counterfactual for the change in outcomes of the treatment group.

Answer 51

Panel data are repeated observations of multiple units over time. It is a combination of cross-sectional and time-series.

Answer 52

1) More degrees of freedom and more sample variability than cross sectional data alone or time series data alone allow for more accurate inference of the model parameters and hence increase the efficiency of estimates. 2) Eliminate omitted variables bias. It is often argued that the real reason someone finds an effect is because of ignoring specific variables that are correlated with the explanatory variables when specifying the model. Panel data allows us to control for missing or unobserved variables. 3) Ability to answer types of questions that cross-sectional and time-series data cannot accommodate. For example transitions from employment to unemployment, from employment to retirement, changes on health status or any other variables that can change through time.

Answer 53

The fixed effects model allows for 𝑎𝑖 to be correlated with the regressors and the random effects estimator assumes 𝑎𝑖 is not correlated with the regressors.

Answer 54

Advantages of the random effects model are that allows time-invariant regressors to be included and it includes more degrees of freedom.

Answer 55

The main disadvantage of random effects estimator is that is biased if the assumption that 𝑎𝑖 is uncorrelated with regressors is incorrect.

Answer 56

An advantage of the fixed effects model is that allows arbitrary correlation between 𝑎𝑖 and any regressors

Answer 57

One of the main disadvantages is that drops out time-invariant regressors.

Answer 58

Unless we wish to estimate the effect of a time invariant variable, fixed effects are generally preferred over random effects due to having less restrictive assumptions.

Final Exam Flashcards

(82 cards)