OLS Flashcards

Question

Can OLS be used for non-linear relationships?

Answer 1

Yes, by applying some transformations

Answer 2

If a regressor is in logs then it is not affected by a scale or unit of measurement change in the variable. If a distribution is strongly skewed then taking logs will make the distribution more symmetrical and more ‘normally’ distributed variables are more suited to linear regression.

Answer 3

1) level-level: y =a +bx + u Interpreted as a unit change in x leads to a b*unit change in y on average 2) level-log: y = a +bln(x) + u Interpreted as a 1% increase in x leads to a b/100 unit change in y 3) log-level: ln(y) = a + bx + u A unit change in x leads to a 100%*b (%) change in y (semi-elasticity) 4) log-log ln(y) = a + bln(x) + u b is the % change in y when x changes by 1%. This is the elasticity of y with respect to x.

Answer 4

std.error(B^) = sqr(σ2/n*var(x)). The standard error of βˆ tells us the precision of the estimate i.e. how far on average is the sample estimate from the population parameter

Answer 5

1) Variance of x 2) Sample size 3) Variance of the population errors

Answer 6

(B^-B)/sqr(var(B^)), for hypothesis testing where H0: B=0, this becomes: B^/std.err(B^)

Answer 7

1) The estimator of the causal effects of interest may be biased or inconsistent. This will be the case if the exogeneity assumption does not hold (A2 violated). E(u|x) = 0 is pretty strong. Could be caused by omitted variable bias, measurement error, simultaneity (reverse causation), selection bias... If assumption A2 is violated then our OLS estimator will not longer be an unbiased (and consistent) estimator of the causal effect of our independent variables on our dependent variable. In this case OLS still has a descriptive purpose – it can tell us about the correlations (or linear associations) between variables. 2)The estimated standard errors may be inconsistent. If this is the case, we cannot conduct tests on the parameters of interest. This can happen if the errors are not homoskedastic (i.e. there is heteroskedasticity, A3 violated), or if they are correlated (sample not iid, A4 violated).

Answer 8

E(B^|x,z) = B + γ* (cov(x,z)/var(x)). 1) If cov(x,z)>0, γ>0 --> z↑ --> x↑, y↑ => E(B^) > B 2) If cov(x,z)<0, γ>0 --> z↑ --> x↓, y↑ => E(B^) < B 3) If cov(x,z)>0, γ<0 --> z↑ --> x↑, y↓ => E(B^)< B 4) If cov(x,z)<0, γ<0 --> z↑ --> x↓, y↓ => E(B^) > B

Answer 9

Human behaviour is constrained by rules. Rules sometimes are arbitrary but they generate interesting experiments.

Answer 10

1) RDD exploits precise knowledge of the rules determining treatment. 2) RDD is based on the idea that in a highly ruled-based world, some arbitrary rules provide good experiments. 3) Researchers are interested in the causal effect of a binary intervention treatment or a probability intervention treatment on a dependent variable. 4) Units may be individuals, firms, countries or other entities, which are exposed or not to a treatment with a clear cut-off. 5) RDD comes in two styles: sharp and fuzzy

Answer 11

1) For the treatment, we use an eligibility index or assignment variable, on which the population can be ranked. 2) A clearly defined cutoff score. 3) In an RD, the assignment variable typically has an effect on the outcome. ▶ If it does not, that is not a problem at all. The RD will work fine, but we might not even need an RD.

Answer 12

1) Sharp RDD is used when treatment status is a deterministic and discontinous function of xi, where Di = {if xi>= 0, 1, if xi<0, 0}. where x0 is a known threshold or cutoff. This assignment mechanism is a deterministic function of xi because once we know xi we know Di. 2) Treatment is a discontinuous function of xi because no matter how close xi gets to x0, treatment is unchanged until xi = x0. Potential outcomes can be described by a linear constant effects function: E[Y1,i|xi] = α + βxi + ρDi + ϵi where ρ is the causal effect of interest. The regressor of interest Di is correlated with xi and it is a deterministic function of xi.

Answer 13

Non linearities in the running variable

Answer 14

Plot the running variable (determinant of treatment) on horizontal axis and outcome variable on vertical axis) because non linearity can be misten for discontinuity.

OLS Flashcards

(38 cards)