Exam 1 Review Flashcards
(15 cards)
What is a mediator? Explain and draw the DAG
X > M > Y
Mediation is the process by which a third vari-
able (i.e., the mediator) transmits the effect of an independent variable on a dependent variable
* π: Attending an elite high school (main independent variable/treatment/exposure)
* π : Going to an elite university (dependent variable/outcome)
* π : Better teacher in the elite high school (mediator)
Draw a fork DAG. Label the variables and explain the essence of a fork DAG with an example
Z
ββ
X >Y
What is a direct effect, and how does it relate to mediation?
A direct effect refers to the mediation effect produced every other mediator except π .
In the example from question 2, it would be the every other reason besides good teachers
why going to an elite high school helps with getting into an elite university.
In the language of DAGs, what does it mean to close all relevant backdoor
paths?
It means to only close the backdoor paths associated with confounders, not colliders or
mediators β i.e., assuming an interest in some form of an average treatment effect as
the estimand
What is a collider? Draw the DAG and explain.
C
β β
X Y
Colliders (πΆ) are variables that, if adjusted for, can introduce a spurious relationship
between π and π
What is selection bias and how do you indicate it in a DAG?
Selection bias is when you have of availability of data in your sample that does not rep-
resent the population for which you are making your inference. You indicated selection
bias with a boxed S
Draw a DAG with selection bias on the dependent variable. What can we learn in such instances? Explain with an example.
In our smoking example, sample selection bias on the dependent variable (π ) entails
having a sample of either people mostly with lung cancer or mostly without lung cancer.
In either case, it is diο¬icult to make any within-sample inference with respect to causality
or prediction, because there is not enough variation in people. For external validity, we
canβt make inferences to a larger population if we donβt have (1) data representative
of that population; or (2) variables in our sample to adjust for the sample-population
differences
You are presented with a regression in which the author uses female literacy as the independent variable (π) and overall literacy (π ) as the dependent
variable. Does this seem like a valid set up? Why or not?
No, it is not a valid set up. Clearly, female literacy is part and parcel of overall literacy.
Accordingly, there is no point in running a regression of something that already partly
explains something else by design.
What is positivity in the context of internal validity? Provide an example to show that you understand it. It may also be helpful to draw something.
Whether the different manifestations of the independent variable (π)
overlap across subgroups/strata of the treatment and control, taking into account selec-
tion bias ( π ) that can result in under- or over-coverage
What is positivity in the context of external validity? Provide an ex-
ample to show that you understand it. It may also be helpful to draw
something
Whether the different manifestations of the independent variable (π)
and effect modifiers (π ) overlap across the sample and population, taking into account
selection bias ( π ) that can result in under- or over-coverage.
What are generalizability and transportability?
Generalizability is when the sample is embedded within the population of interest, and
transportability is when the sample corresponds to another population of interest
What is an Intent to Treat (ITT) effect?
The effect of assigning the treatment, even if people did not comply with their treatment
assignment. The ITT is often a very conservative estimand.
What are the treatment (π), instrument (π), dependent variable (π ),
and confounder (π) in Column (2) of Table 4 of Acemoglu, Johnson, and
Robinson (2001)? Also, draw the DAG.
- π: average protection against expropriation risk (quality of institutions)
- π: settler mortality in colonial era
- π : log GDP per capita
- π: latitude
What is the exclusion restriction in Acemoglu, Johnson, and Robinson
(2001)?
Settler mortality at the time of colonization (π) must not be directly related to current
GDP per capita (π ). If settler mortality at the time of colonization were related to current GDP, it would violate the exclusion restriction assumption.