Epi Methods 753 Flashcards
(91 cards)
2 categories where 1 is reference group (typically “unexposed”)
Dichotomous Variable
Parameterization of variable into discrete categories
Categorical Variable
Categorical variable that assigns 0 or 1
Binary Variable
Categorical variable that doesn’t have ordering/order not of interest; collection of k-1 binary indicator variables
Nominal Variable
Categorical variable that has ordering/order of interest; collection of binary variables assigned score; step between categories constrained to be equal
Ordinal Variable
Test Ho that B=0, where B is coefficient for category score variable; if p<0.05 best estimate for step from one category to next is different from 0
Mantel Test for Trend
Variable can take any value between lower & upper limit
Continuous Variable
Divide continuous variable by factor; coefficient of variable affected
Rescaling
Subtract continuous variable by factor; intercept affected
Centering
Closely related to counterfactual; compare observed outcome to non-observed (counterfactual) outcome; estimate measures of causal effect by measures of association assuming exchangeability (differences due to confounding)
Potential Outcomes
Observed outcomes in unexposed are good stand-in for unobserved potential outcomes for exposed persons under no exposure & vice versa; not testable but met in expectation with randomization
Exchangeability Assumption
Comparison of pre-treatment covariates in exposed & unexposed groups; comparability doesn’t guarantee assumption met
Exchangeability Assessment
Relax exchangeability assumption to be conditional on covariates; assumes no unmeasured confounders
Conditional Exchangeability
Used to assess how average value of continuous outcome varies systematically with X’s; E[Y] = B0+B1X1+…; B1=average difference (cross-sectional) or change (longitudinal) in Y per 1-unit X1
Linear Regression
Used to assess how log odds binary outcome varies systematically with X’s; log(odds Y)=B0+B1X1+…; B1=difference in log(odds Y) per 1-unit X1; PrOR for cross-sectional or ROR for longitudinal
Logistic Regression
Risk or prevalence > 10%
OR Overestimates RR or PrR
Used to assess how log probability binary outcome varies systematically with X’s; log(Pr(Y=1))=B0+B1X1+…; B1=difference in log(prob Y) per 1-unit X1; PrR for cross-sectional or RR for longitudinal
Log-Binomial Regression
Path from E to O that starts with E & all arrows point in same direction
Causal Path
Any other path from E to O; unconditionally open backdoor paths are confounded vs. unconditionally closed backdoor paths are blocked at collider
Non-Causal Path
Covariate set that leaves all causal paths open & non-causal paths closed vs. does this without any extra variables
Sufficient vs. Minimally Sufficient
Variables only causally associated with exposure; decreases precision if put into model
Instrument
Not necessary for confounder control but may increase precision
Variables Associated with Outcome
Confounding is causal concept but collapsibility is statistical concept; depends on prevalence of outcome & type of measure of association
Problems with Collapsibility Definition
Stratify table by exposure, do not include outcome, & do not include p-values
Causal Inference Table 1