Biostatistics Flashcards
(55 cards)
How do you calculate annual incidence
Number of new cases in a given year/the number of people in that population at risk of developing that condition
Note: people who already have the condition or who are not able to get the condition are not included in the denominator
Note: if there are 200 new cases but all of them die before the end of the year the incidence would still use 200
Standardized mortality ratio
Observed number of deaths/expected number of deaths
Note: SMR of 2 indicated that mortality of that population is twice that of the general population
How do you calculate maternal mortality rate
Number of maternal deaths/number of live births
Note: women who were pregnant but had miscarriages are not included in this calculation
When would you use odd ratio vs relative risk
You can only use relative risk in prospective studies that follow pts over time so that you can calculate the risk of developing a disease over a certain period of time
An odds ratio is used during case control studies where you record data at a single point in time (recording who has been exposed to the exposure of interest and who has developed the disease in question). You can’t calculate risk in this situation because you didn’t follow a group of people after being exposed, so you calculate the odds that a pt with the disease had the exposure vs the odds that a pt without the disease had the exposure.
How do you calculate relative risk?
(Number of exposed pts who developed the disease/total number of exposed pts) divided by (number of unexposed pts who developed the disease/total number of unexposed pts)
How do you calculate an odds ratio?
(Number of exposed people who have the disease/number of unexposed people with disease) divided by (number of exposed people without disease/number of unexposed people without disease)
Note: you are calculating the odds of exposure in disease pts and dividing that by the odds of exposure in pts without the disease
Coefficient of determination
A value used to describe degree of correlation that expresses the percentage of variability in the outcome factor that is explained by the predictor factor
Note: If folic acid intake and plasma homocysteine levels have a correlation coefficient of 64%, then 64% of the variability in plasma homocysteine levels is due to changes in folic acid intake
How do you calculate the coefficient of determination?
It is the square of the correlation coefficient
E.g. two factors that have a correlation coefficient of -0.8 have a coefficient of determination of 0.64 (or 64%)
If 30 of 60 smokers develop lung cancer and 10 of 40 non smokers develop lung cancer, what is the attributable risk of smoking?
Risk in smokers is 0.5 and risk in nonsmokers is 0.25
Attributable risk = 0.5 - 0.25 = 0.25
Note: Atributable risk is the risk of developing disease in exposed pts - risk of developing disease in unexposed pts
Attributable risk percent
The percentage of disease cases in exposed pts that can be attributed to that exposure
Note: if 20% of smokers develop lung cancer and 10% of nonsmokers develop lung cancer, then the attributable risk percent is 50% (50% of smokers who developed lung cancer got it because they were smokers)
Population attributable risk
The percentage of total cases that are attributable to a certain exposure
Note: 30 of 60 smokers develop lung cancer and 10 of 40 nonsmokers develop lunch cancer, the population attributable risk is 37.5% (because 50% of smokers who got lung cancer, 15 people, got it because of smoking. Out of 40 people who developed lung cancer, 15 of these can be attributed to smoking: 15/40 = 37.5%)
How to calculate the Number needed to treat
1/absolute risk reduction
Note: absolute risk reduction = risk of untreated pt developing outcome - risk of treated pt developing outcome
25 of 50 pts treated with a new drug survive 5 years and 25 of 100 pts treated with standard therapy survive 5 years. What is the absolute risk reduction?
Mortality in those treated with standard therapy is 0.75 and mortality in those treated with the new drug is 0.5, so the absolute risk reduction is 0.75 - 0.5 or 0.25
Note: the number needed to treat is then 1/0.25 or 4
Disability-adjusted life years
DALYs are a way to measure disease burden that estimates the total number of years of life lost due to that disease. It can be calculated by adding the years of life lost due to decreased quality of life PLUS the years of life lost due to premature mortality
If a pt develops depression at age 30 and commits suicide at age 50 (assuming life expectancy at age 50 is 84 and that the disability weight for depression is 0.35), then:
DALY = years of life lost (84-50) + years of life with disability (30 + (20 x 0.35)) = 41 years (lost 7 years due to loss of quality of life during disability and lost 34 years due to premature mortality)
Note: DALYs should be minimized. Disability weights are similar to time trade offs used for quality-adjusted years of life but are standard values used for populations, whereas TTOs are an individual pts self-reported number
Quality-adjusted life years
QALYs is way to measure the burden of a disease. The pt thinks about how many years with their current disease they would be willing to trade for 1 year of life at full health.
If a pt states that 5 years in their current state is equivalent to 1 year at full health, then the time trade off is 1/5 or 0.2, which can then be used to calculate QALYs. If this pt was healthy until age 30, then had disease until present at age 40. The quality adjusted life years would be 32 (30 years at full health + 10 years at 0.2)
Note: The goal is to maximize QALYs through treatments that increase the time trade off factor (treatments that increase quality of life)
What graph is often used in survival analysis (e.g. determining whether peritoneal or hemodialysis prolongs survival)?
A Kaplan-Meier survival curve, which reports the proportion of subjects surviving at each time point through the study (the slower the decline in the curve the more likely those subjects were to survive)
Note: There is only statistical significance if the p-value reported from a log-tank test of the curve is < 0.05
What does an odds ratio < 1 indicate
The exposure being studied is associated with a lower odds of the outcome
When would you use any odds ratio rather than relative risk?
In case control or cross sectional studies (where you already know who has the outcome and who doesn’t and you’re trying to figure out what exposures may be associated with higher odds of the outcome)
Note: Relative risk is used for observational or experimental follow-up studies where subjects are tracked through time to determine whether an outcome occurs
Multiple linear regression vs multiple logistic regression
Multiple linear regression is used to evaluate associations between 1 quantitative dependent variable (a primary outcome with a numerical value such as LDL level) and 2 or more independent variables that can be either quantitative or qualitative
E.g. Use multiple linear regression to evaluate the association between statin use and LDL levels while also adjusting for BMI, creatinine clearance, and sex (dependent variable, LDL levels, is quantitative)
Multiple logistic regression is used to evaluate the association between 1 dichotomous dependent variable (primary outcome has only 2 options, such as alive or dead) and 2 or more independent variables
E.g. Use multiple logistic regression to evaluate the association between obesity and the presence of diabetes while adjusting for age and sex (dependent variable, presence of diabetes, is dichotomous because they either have or do not have diabetes)
What is it called when a study is designed to evaluate multiple interventions and all possible combinations of those interventions?
Factorial study (or fully crossed design)
E.g. A study that randomized pts to treatment with placebo, glutamine alone, antioxidants alone, or glutamine + antioxidants
Statistical power
The ability of a study design to identify a statistically significant difference between two groups if a difference actually does exist
Power = 1 - beta
Note: Beta is the rate of type II errors (failure to reject a false null hypothesis)
Case fatality rate vs mortality rate
The case fatality rate is the proportion of known cases of a particular condition who end up dying from that condition
The mortality rate is the probability of dying from a particular disease in the general population
Standardized mortality ratio
An epidemiological parameter used to determine whether there is an unusually high number of deaths in a given group by comparing it to how many deaths would be expected in a similar group of the general population (controlling for age, sex, etc)
It is calculated using the mortality rates for the general population of the disease in question to estimate the number of expected deaths, which are compared to the observed deaths to give the standardized mortality ratio
How do you calculate specificity?
True negatives/(true negatives + false positives)
Note: This is the probability of a non-diseased person actually testing negative