Selection on Observables Flashcards
(14 cards)
Stratum/strat
Groups in which both treated and untreated units share a common characteristic
Curse of Dimensionality
As the number of variables increases, it gets harder to find good matches
Assignment mechanism
Determines which units receive treatment and which ones don’t. It is also a way to prevent “back-door relationships” between treatment (D) and the outcome (Y)
Main estimators
Subclassification, matching, regression, and a combo of the three
Propensity score
The selection probability that is conditional on the confound variables. Represented mathematically as p(X) = P(D=1|X). Conditionality on the propensity score is enough to ensure that the treatment indicator and potential outcomes are independent from each other. Can also use this for matching by estimating the propensity with its formula and matching on the estimated propensity score.
Fundamental Problem of Causal Inference
Comparisons between the observed outcomes of the treated and untreated groups can be misleading and create bias. So to make casual inferences, you need an explanation for why some units received treatment and why some did not
Why use SOO and not an RCT?
SOO is useful for finding causation without clear randomized assignment. RCTs can also be too expensive, unethical or unfeasible to do all the time
Identification Assumptions
Treatment Assignment is randomly assigned within each straum of X, and that you can observe participants and nonparticiapnts with the same characteristics.
Identifying the ATE
Must know which groups (X) were more likely to receive the treatment, the assignment to treatment has to be random.
How to estimate the causal effects of OLS
There are 3 ways: using the subclassification estimator, a regression, or through matching
Under what conditions do you need to match?
First, check the balance on pre-treatment covariates, then perform matching, then check the balance in matched data, if the balance is good, then you estimate treatment effects, but if not do the matching again
Exact Matching
Match treatment observations to control observations witht he sam characteristics you are observing. The observations need to have the same values to be an exact match.
Nearest Neighbor Matching
If perfect matches are not available, then match units with the estimator of the ATT. You can get this estimator by adding the missing potential outcomes of each treat unit using the observed outcomes from the closest untreated unit into the formula for the ATT
Subclassification Estimator
Basically dividing the sample into bins based on the value of the key covaraites, then you estimate the difference in means between the treated and untreated units within each of the bins you made, and finally you then construct the weighted averages of the differences in means (by using the proportion of units in each bin to get the ATE, by using the proportion of treated units in each bin to get the ATT, and by using the proportion of untreated units in each bin to recover the ATC).