Midterm Flashcards

Question

pros and cons of nested

Answer 1

pro - exposure measured at baseline before development of disease -selection of controls by random sampling from a well-defined, primary source population sources of selection bias - incomplete case ascertainment -cohort losses to follow-up -selection bias associated with participant selection in the entire cohort itself

Answer 2

When associated with both exposure and outcome and is not a mediator on the casual pathway. Can be caused by an imbalance b/w exposed and nonexposed groups in another, extraneous exposure (confounder) If there is confounding and the variable is identified and measured, then can adjust as long as there was no bias in selection of cases or controls within each stratum of the covariate. For example - if SES is only associated with exposure, but there is over selection of high SES controls, then there is an artifactual inverse association with the outcome leading to confounding. Can be addressed through stratification (Mantel-Haenszel method)

Answer 3

Controls and the source pop should be alike with respect to the exposure under study

Answer 4

A meaningful difference between the unadjusted RR and the adjusted RR calculate and inspect RR for each stratum of potential confounder. If the stratum specific RRs are similar, then potential confounding. If they are different it may be effect modification

Answer 5

Design phase - identify potential confounders by consulting the lit data collection - measure potential confounders accurately analysis - check theoretical confounders and other study variables. determine if there is confounding.

Answer 6

methods based on stratification multivariable statistical models standardization (direct or indirect)

Answer 7

RRunadj - RR adj ------------------------ x 100 RRadj This percentage should be more than 10%. No need to look at p value here. This will show whether is confounding

Answer 8

To calculate: Set up i 2x2 tables (where i is the # of strata or categories of a potential confounding variable) Compute the weighted average of the stratum-specific RRs (# of subjects or person-time experience in each stratum) For cohort - risk ratio or rate ratio For case-control studies - odds ratio ORmh = sum of aidi/Ni ---------------------- sum of bici/Ni

Answer 9

adjustment for a single confounder that is a categorical variable simult adj for 2 or 3 confounders, as long as the number of strata for each confounder is relatively small MH only is to be used for categorical variables cannot use for large strata - too cumbersome

Answer 10

Linear - no RR estimated Y = b0 + b1 *X1 If want to find b1 in 10 years, just b1 * 10 Null value is 0 The following models are all log-transformed: Logistic - odds ratio - used in case-control studies -other studies with binary dependent variables -risk prediction -ignores time ln(odds of Y) = bo + b1*X... Poisson (log-linear) - IRR - cohort studies with person-time data - incidence rate studies that use aggregate level data ln(incidence rate of Y) = b0 + b1*X... Cox Proportional Hazards - HRR - studies with binary outcome and person-time data - cohort studies - RCTs - Survival analysis ln[h(t)] = ln[h0(t)] + b1*X...

Answer 11

unconditional - used in unmatched case control (can also use stratification such as the traditional Mantel-Haenszel method with stratification by the matching factors for unmatched case control) conditional - used in some matched cases control

Answer 12

for example, it could be any continuous variable. ex: N = 10 beta(per year) = .04 RR(per year) = 1.05 beta(per year)*10 = .4 then e^.4 = 1.63 This is done in relation to reference level - could be 10 could be 20, but N=10 will always be the same.

Answer 13

Can model it as a singular term, if linear then can use as continuous. Test for trend can be used to assess evidence of an exponential trend (linear on a log scale). only applied to exposures with a natural order. To do this: 1) model variable as categorical variable to capture shape of the dose-response relationship 2) model variable as a single term in a separate regression model to test for trend p-value for trend = p-value for b1 in the single term model. if p<.05 then it is significant

Answer 14

Stratification-based method of comparing rates of an outcome between two populations that have different distributions of one or more confounders To make the comparisons fair (i.e. to remove confounding) by forcing the two populations to have the same covariate distribution

Answer 15

Used for retrospective (historical) cohorts with an external comparison group such as special exposed cohorts. Standardization covariates MUST be categorical

Answer 16

incidence ratio: total observed cases ----------------------------- total expected cases mortality ratio: total observed deaths -------------------------------- total expected deaths

Answer 17

When your study adjusts for a variable or set of related variables that do not completely remove the confounding by that/those variables. Coarse categorization: This may be because you use too broad of categories so that there are heterogenous groups of people within each stratum. This is problematic because these heterogenenous groups of people could also differ with respect to their exposure prevalence and risk of the outcome. Suboptimal modeling of the confounder in a multivariable model (e.g. modeling a covariate as continuous when the true dose-response curve is U-shaped) Inadequate adjustment for complex, multidimensional confounders, such as smoking, SES, and health status Inadequate measurement of the confounder (measurement error - unvalidated data collection instrument), collection of insufficiently detailed information **If confounding remains due to not adjusting at all for a particular confounder this is NOT considered residual confounding.

Answer 18

Healthy vaccinee effect - seniors are at high short-term risk of death who are unvaccinated

Answer 19

Measurement - measure potential confounders as carefully as the exposure under study. Especially if multidimensional Data analysis - Use sufficiently fine covariate categorization, optimize modeling of covariates in multivariable models, strive to capture full dimensionality of multidimensional confounders in multivariable models *however need to take into account statistical imperative of model parsimony - ratio of # of outcomes to # of covariates should be more than 10. interpretation - Be transparent about the residual confounding in interpretation and how it could be better accounted for.

Answer 20

Adjust for one or more potential confounders in the design phase of your study Select non-exposed participants who are similar to the exposed participants with respect to the distribution of one or more potential confounders. Potential confounders are called matching factors. When matched no need to account in the analysis phase but only if there's complete follow-up

Answer 21

to adjust for one or more potential confounders in the design phase of the study selection of controls who are similar to cases with respect to their distribution of one or more potential confounders However matching in the design phase alone does not completely remove confounding and so will need to still adjust in the analysis phase matching intentionally introduces selection bias and creates a new, superimposed confounding toward the null Matching on a true confounder increases statistical efficiency by optimizing precision

Answer 22

Selection of controls such that the distribution(s) of one or more potential confounders is/are similar in cases and controls Often used when matching factors are demographic variables (e.g. age, sex, race) For example if some stratum have 0 individuals, you risk not being able to use the data from all subjects in the study leading to reduced statistical efficiency

Answer 23

Selection of one or more controls that are identical to a given case with respect to one or more potential confounders Useful for controlling for a confounder using "fine stratification" (mini stratum) matching factors that are multidimensional confounders using risk-set sampling of controls in nested case-control studies The matched set is the stratum Cannot do twin studies with unmatched case-control Must use conditional logistic regression - don't need to include matching factors OR stratification - mantel-Haenszel matched analysis (McNemar Test) - this gives matched OR

Answer 24

For each case, N number of matched controls can randomly sampled from the case's risk set - can restrict the risk set by matching factors Enables selection of control with the same risk set as case - same concurrent time at risk for development of outcome

Answer 25

four possible combinations of matched pairs concordant, concordant, discordant, and discordant. Only need to look at the discordant pairs. q r s t r and s are the discordant pairs r/s = Matched Odds Ratio

Answer 26

No, because matching forces controls to be the same as cases with respect to the matching factor therefore, there is no way to find the association.

Answer 27

Overmatching generally refers to matching that is counter productive, by either causing bias or reducing efficiency. This causes a new superimposed confounding toward the null and leads to loss of statistical efficiency. Overmatching must be corrected in analysis phase Matching on a mediator in a causal pathway between exposure and disease will bias the effect estimate towards the null Matching on a non-confounder that is associated with exposure, but not a risk factor for disease

Answer 28

Study of the distribution of time elapsed from a baseline time to an outcome(event) Study of the effect of exposures (including treatments) affect the distribution of time to event Used for two study designs: cohort studies and RCTs Baseline data examples: date of entry into a cohort, birth date, etc. outcome examples: death, incident disease, disease cure, etc. It is better to experience a beneficial outcome earlier than later It is better to experience an adverse outcome later than earlier

Answer 29

CI (0 to 1) is the proportion of a specified population at risk that experienced the outcome under study during a specified time period Probability (risk) of experiencing the outcome under study in the specified time period CS (0 to 1) is the proportion of a specified population at risk that does NOT experience the outcome under study (i.e. "survives") during a specified time period Probability (risk) of NOT experiencing the outcome under study in the specified time period Can both be calculated directly if closed cohort

Answer 30

CI curve is the proportion of subjects who have experienced the event as a function of time since baseline CS curve is the proportion of subjects who have NOT experienced the event as a function of time since baseline

Answer 31

where CI = CS = .5

Answer 32

1. rank survival times from lowest to highest 2. create intervals that start when one or more events occur 3. calculate cumulative incidence during interval 4. calculating cumulative survival would just be subtraction/total instead of addition/total

Answer 33

cumulative incidence will be underestimated because it assumes those who withdrew, lost to follow-up or died did not experience that incidence.

Answer 34

1. Rank survival times from lowest to highest 2. divide survival time into intervals that start when one or more events occur (ei and ci) and calculate # at risk at start of each interval (ni) 3. calculate probability of surviving each interval (pi = (ni-ei)/ni) 4. calculate cumulative survival during each interval (Si = Si-1 x pi) - first interval is always 1

Answer 35

Termination of follow-up for a subject on a specified date because it is unknown whether the outcome occurred or would have occurred after that date. unknown whether outcome occurred or would have occurred Kaplan-Meier survival estimates calculated cumulative incidence/survival taking censoring into account, but assumes that censoring is unbiased

Answer 36

Compares K-M curves for 2 or more groups.

Answer 37

Compares K-M curves for 2 or more groups using stratification to control for confounding limitation: method breaks down if data becomes too sparse

Answer 38

The further to the right, the fewer subjects at risk and the more uncertainty Good practice to end the plot at a follow-up time when only 10-20% of subjects are still at risk.

Answer 39

KM survival curves (descriptive and cannot readily calculate RR or adjust for multiple covariates) and cox proportional hazards regression

Answer 40

allows baseline hazard to vary over time assumes the hazard ratio is constant over time which is equivalent to stating that the exposure-outcome relationship is NOT modified by follow-up time (therefore not an effect modifier) - if PH assumption is violated then follow-up time is a modifier and stratification by follow-up time would be needed allows adjustment of multiple covariates and provides an RR

Answer 41

the instantaneous incidence rate at a point in time (change in number of new cases at time point) - basically the slope between two points on the curve. Incidence rate could change with time

Answer 42

When the proportional hazards curves cross one another.

Answer 43

less than or equal to

Answer 44

more than or equal to

Answer 45

direct methods (gold standard)- determine the cause of death for each decedent. This can be done by review of medical records or death certificates but medical records is better. indirect methods - take overall-mortality estimates and apply a correction to them, in order to estimate the number of deaths due to a specific cause - through relative survival

Answer 46

Provides an estimate of cause-specific survival in a cohort. corrects for deaths from causes other than the disease under study RS = observed OS/expected OS If expected OS = 1, RS = observed OS If expected OS<1, RS > or equal to observed OS

Answer 47

Usually expected OS of person of the same demographics and calendar period from publicly available vital statistics data Key assumption: OS in the diseased cohort would be the same as the OS of the comparison population, if the cohort members did not have the disease (assuming that the only difference between the two cohorts is the disease).

Answer 48

Variation in the magnitude of the association between an exposure and an outcome across strata of a second exposure (the effect modifier) Has an underlying public health, clinical, biologic, or psychosocial basis. Not merely a statistical phenomenon. Can be assessed through stratified analysis and multivariable models Effect modification is reciprocal since there is an interaction

Answer 49

If stratify and RR for each stratum is not similar then there is a potential for effect modification. If this is the case you can then calculate a p-value for heterogeneity (interaction) (this is a likelihood ratio test). If p-value for heterogeneity is significant, then effect modification/interaction, if not then no effect modification/interaction. calculate p-value of the interaction term, if multiple, the interaction terms in aggregate

Midterm Flashcards

(75 cards)