Biostat Flashcards

(117 cards)

1
Q

What is surveillance?

A

The systematic (ongoing) collection of relevant data (disease, injury, hazard) and their constant evaluation and dissemination to all who need to know (for the purpose of prevention)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the surveillance cycle?

A
  1. Plan a change or test
  2. Do the change or test
  3. Observe effects
  4. Study the results
  5. Repeat
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the goal of surveillance?

A

Continuous improvement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the levels of prevention?

A

Primary, secondary, tertiary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Described primary prevention

A

Predisease (no known risk factors or disease susceptibility)
Examples: health promotion activities such as exercise and specific protections such as immunizations, automobile safety measures, recommended nutritional supplements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe secondary prevention

A

Latent disease
Example: screening (in populations and of individuals) for early detection of disease and early treatment of disease (e.g. mammography)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe tertiary prevention

A

Symptomatic disease (initial care, subsequent care)
Examples: Disability limitation (e.g. medical or surgical treatment to limit damage from a disease)
Rehabilitation (e.g. rehabilitation after a stroke)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sentinel health event

A

An unnecessary disease, disability, or untimely death which is preventable and whose occurrence serves as a warning signal that preventative and/or medical care may need to be improved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some goals of surveillance?

A

Estimate magnitude and determinants, targeted intervention, track trends and distribution, identify failure of prevention (sentinel health events)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What morbidity measures are used to describe disease occurrence?

A

Incidence (cumulative incidence and incidence density) and prevalence (period prevalence and point prevalence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is incidence?

A

An estimate of the risk or probability of developing a disease during a specified time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Incidence density

A

of new cases during a specified time period
___________________________________
population at risk of disease during the same time period (also measured as person-time)
(x 1,000)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cumulative incidence

A

of new cases during a specified time period
___________________________________
population at risk
(x 1,000)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which (incidence density or cumulative incidence) is more precise?

A

Incidence density

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is prevalence?

A

Describes the burden of disease in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Prevalence calculation

A

total # of cases of disease during a time period (or at one point in time)
____________________________________________
total (usually mid-period) population during the same time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the relationship between incidence and prevalence?

A

When the disease is stable:

Prevalence = incidence x duration of disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What happens when new treatments that increase longevity of a particular disease are discovered?

A

The prevalence of the disease will increase since, even if incidence rates remain the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the main sources of morbidity data?

A

Public health surveillances, health surveys, registries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the 2 main surveillance systems?

A

Active and passive
Active involves outreach by some public authority (most complete and accurate, but expensive)
Passive relies on physician to report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Sentinel surveillance

A

A surveillance system that uses a prearranged sample of sources who have agreed to report all cases of one or more notifiable diseases
Often uses largest hospitals in a geographic area
Data are not generalizable to the geographic population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Syndromic surveillance

A

Developed for early detection of a large-scale release of a biological agent, current surveillance goals reach beyond terrorism preparedness
Focuses on the early symptom (prodrome) period before clinical or laboratory confirmation of a particular disease
Gathers information about patients’ symptoms during the early phases of illness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are health surveys also called?

A

Prevalence studies

Since they allow for the estimation of the proportion of the population with a particular health problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the limitations of morbidity data in the U.S.?

A

Severity of illness (only more severe are likely to be reported), access to care, validity of screening test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is ICD-10?
Codes that are used to classify all causes of death on the death certificate Promotes international comparability of mortality statistics
26
What effects accuracy of death information?
Who fills out the form If they follow the instructions If they were the patient's private physician If an autopsy was performed
27
What is crude death rate? What is it poor in?
A very rough measure of the level of morality in a population A particularly poor measure when comparing 2 or more populations which have differing age distributions
28
What is the calculation for crude death rate?
of deaths in one year ___________________ (x 1,000) total mid-year population
29
Where is info needed to calculate crude death rate taken from?
``` # of deaths ----> Vital Registration System Total mid-year population ----> Census Bureau ```
30
Age-specific death rate
of deaths in one year to age group a ______________________________ (x 1,000) mid-year population of age group a
31
What is the calculation for infant morality rate? What is it a good indication of?
number of deaths to children under 1 year ________________________________ (x 1,000) total live births Good indicator of health of a population because it tells you about services available to mothers and babies
32
Cause-specific death rate
Deaths due to a cause during a specified time period ______________________________________(x 100,000) total population during that time period
33
What is case fatality rate?
Presents the risk of dying during a defined period for those who have a particular disease Often used during a disease outbreak Can be used for non-infectious diseases
34
Case fatality rate calculation
``` # of deaths during a specified time period after disease onset _____________________________________ (x 100) # of individuals with that disease during that time period ```
35
What is proportionate mortality?
Presents the proportion of total deaths that are due to a specific cause Tells us, within a population, the relative importance of specific cause of death in the total mortality picture Each cause is expressed as a percentage of all deaths, and the sum of the causes must add to 100% These proportions are not mortality rates, because the denominator is all deaths rather than the population in which the deaths occurred
36
Proportionate mortality calculation
of deaths due to cause x during a specified time period ______________________________________ (x 100) total # of deaths during that time period
37
Proportionate mortality ratio (PMR)
Comparison of 2 proportionate mortalities A PMR greater than 1 indicates that a particular accounts for a greater proportion of deaths in the population of interest than you might expect
38
What is a confounder?
A variable which is related to both study variables and obscures the relationship b/w the variables
39
What are the 2 methods for controlling for a confounding factor?
1. Calculate specific rates (stratification) 2. Use an adjustment or standardization procedure; these procedures allow for adjustment of confounders while providing a summary measure that is easy to work with
40
What are the 2 types of standardization procedures?
1. Direct method of rate adjustment - choose a standard population distribution; calculate adjusted rates by applying the age specific death rates to a standard age distribution 2. Indirect method of rate adjustment - choose a standard set of rates; calculate standardized mortality ratios by applying a standard set of rates to the age distribution of populations of interest
41
What are the outcomes of the 2 standardization procedures?
Direct method of rate adjustment - directly adjusted rate | Indirect method of rate adjustment - Standardized Mortality Ratio (SMR)
42
Standardized Mortality Ratios (SMR)
Ratio of the number of observed deaths to expected death (if your groups experienced the mortality rates of a standard population) often expressed as a percentage
43
What are advantages and disadvantages of crude rate?
Advantages - Simple to calculate | Disadvantages - Does not calculate for the impact of confounders
44
What are advantages and disadvantages of specific rate?
Advantages - Controls confounders; can see more details | Disadvantages - Need detailed data for rates; cumbersome
45
What are advantages and disadvantages of directly-adjusted?
Advantages - controls confounders; summary measures | Disadvantages - need detailed data for rates; can miss details; not a real rate (relative)
46
What are advantages and disadvantages of indirectly-adjusted (SMR)?
Advantages - controls confounders; summary measure; fewer data needs Disadvantages - SMR is not a rate; can miss details
47
When to use which method of rate adjustment?
Specific rates - if data are of good quality and few comparisons are needed Direct method - if summary measure is preferred and data quality is good Indirect method - use with very small numbers, because rates are unstable
48
What factors can rates be adjusted for?
Any factor that might confound results, not just age
49
What are the 2 general types of life tables?
Cohort life tables | Period life tables
50
Cohort life tables
A real group of people or cohort who are followed over time to profile their mortality experience
51
Period life tables
A hypothetical population of 100,000 who experience current mortality trends The probability that an individual will die during any particular year is calculated, using age-specific death rates End result of this table is the life expectancy
52
Descriptive epidemiology
Generates hypotheses | Why this distribution in person/place/time?
53
Case series
Several cases strung together of a certain time
54
Analytic epidemiology
Tests hypotheses 1. Concerned with the determinants of disease-etiological factors 2. Concerned w/ explaining different disease rates in different populations 3. May form the basis for control of disease
55
Common characteristics in cohort
Date of birth Exposure Disease Treatment
56
Relative risk
Rate in exposed persons/rate in non-exposed persons
57
Attributable risk
``` Incidence rate (exposed group) - incidence rate (unexposed group) Attributes risk to that exposure ```
58
Population attributable risk proportion
``` Incidence rate(total population) - incidence rate (unexposed group) ______________________________________________ Incidence rate (total population) ```
59
What is the null hypothesis regarding relative risk?
Relative risk = 1
60
What is the null hypothesis regarding attributable risk?
Attributable risk = 0
61
Relative risk asses what of a risk factor?
Etiological strength
62
Attributable risk asses what of a risk factor?
Public health impact
63
What is effects of non-response in a cohort study with respect to exposure (yes) and outcome (no)?
Incorrect estimate of exposure
64
What is effects of non-response in a cohort study with respect to exposure (no) and outcome (yes)?
Incorrect estimate of outcome
65
What is effects of non-response in a cohort study with respect to exposure (yes) and outcome (yes)?
Incorrect estimate of association, exposure, and outcome
66
What is effects of non-response in a cohort study with respect to exposure (no) and outcome (no)?
No problem
67
Confounding variable
A variable associated w/ both outcome and risk factor(s) of interest
68
How can confounders be dealt with?
Matching (efficient, difficult, $$$) Stratification (less efficient, less difficult) Regression (after the fact)
69
Matching to deal w/ confounders
○ For every person who is a smoker and works in a coal mine, they are matched w/ a non-smoker who works in a coal mine; they are paired ○ Differ only in this one thing (smoking vs. non-smoking) ○ Can see differential in lung cancer rates and now it isn't due to confounding variables since the smokers and non-smokers are matched in all the other things ○ Efficient since w/ relatively few people can pick up effect of smoking
70
Stratification to deal w/ confounders
○ Not as powerful statistically ○ Instead of waiting for one-to-one matches, just do the study any way you want and then divide it into stratas at the end ○ Can make batches (such as those who don't have coal mining as occupation, and comparing smoking and non-smoking) ○ Ratio of death holds up because looking at them in different batches (one batch could be coal-mining and another batch could be non-coal-mining)
71
Regression to deal w/ confounders
○ Can do a regression equation that predicts risk w/ and w/o the confounding variable ○ Can get the statistical significance of things
72
Case-control study
Assemble data based on outcomes; start w/ batch of cases and batch of controls and ask them if the suspected cause was present
73
Odds ratio
Odds ratio = (AD)/(CB)
74
Population attributable risk (case-control studies)
p (odds ratio - 1) _____________ p (odds ratio - 1) + 1 where p = b/(b+d) and is an estimate of the proportion of the population exposed
75
Compare case-control studies and cohort studies
Case-control studies: done w/ smaller samples, relatively cheap, relatively fast to complete, better for rare diseases, subject to more bias in exposure info, subject to incomplete data due to lack of recall or record, able to provide data on the odds ratio as an estimate of relative risk, but no incidence Cohort studies: done w/ larger samples, relatively expensive, relatively slow to complete, better for rare exposures, subject to more bias in disease diagnosis, subject to incomplete data due to loss of follow-up, able to provide data on the relative risk, as well as the incidence of disease outcome
76
Cross-sectional study
Presence of disease or infectious agent int a given population at a given time. Gives population prevalence. At best, may describe temporal association (at that time). Provides no info about: 1. risk factors 2. transmission of disease 3. duration of disease 4. outcome of disease
77
Selection bias
Choosing which drug you take, based on symptoms ("confounding by indication" is a subset of this) - people with worst symptoms pick most-active drug
78
Sampling bias
Who volunteers to be in a study in the first place? Health-aware or needing money/treatment
79
Recall bias
People w/ disease might be motivated to remember risk factors
80
Lead-time bias
Can detect tiny nodules, so time from detection/diagnosis to death increases, but disease treatment might not necessarily be more effective
81
Surveillance bias
Knowing that you are in a study improves your health behaviors/adherence
82
Late-look bias
Using clinic patient population, dead people are not included; less severe symptoms/cases/outcomes are over-represented
83
Sample size considerations in clinical trials
If sample size is too small (underpowered), are exposing people to risk w/o reasonable prospect of an informative experiments (and also wasting resources and polluting the literature with a study affected w/ Type II error, which will delay further progress) If sample size is too big (excess statistical power), you are exposing more people than necessary to risk than would result in a definitive study; and if new treatment is valuable, you are unnecessarily delaying progression to an available, effective treatment
84
Randomization
Patients are randomized into treatment groups, so that high-risk and low-risk pts are just as likely to end up in one treatment as the the other treatment Evens out the distribution of risk factors (even unknown risk factors) Gets rid of conscious or unconscious bias in assignment of treatments
85
Double-blind
Those assessing the results are unaware of which treatment was employed in the specific patient
86
Phases of clinical trials
Phase I - Human pharmacology Phase II - Therapeutic exploratory Phase III - Therapeutic confirmatory Phase IV - Therapeutic Use
87
Cross-over design
Instead of being on either drug A or drug B, each person takes drug A and then drug B or vice versa
88
NNT
"Number needed to treat" How many ppl do you treat before you expect to effect one additional positive outcome? To compute NNT, need to subtract the rate in the treatment group from the rate in the control group and then invert it (divide the difference into 1)
89
When do you reject the null hypothesis?
When p < α
90
Type I error
Also called α error | Declaring that there is something significant when there isn't
91
Type II error
Also called β error | Failing to detect that alternate hypothesis is true
92
Statistical power
Ability to detect | = 1 - (β error rate)
93
How does statistical power vary?
Increases with increased sample size Decreases with increased standard deviation (bc more difficult to distinguish a difference b/w groups when there is increased inherent variability)
94
Nominal measurement scale
Categorical data w/ no order | E.g. what is your gender? Male/female
95
Ordinal measurement scale
Ranked or scaled data | A 4 doesn't necessarily mean twice as bad as 2
96
Numerical quantitative
Underlying continuous distribution | Weight, height, choelsterol
97
Numerical discrete
Number of office visits, number of dental carriers | can't have 1/2 office visit
98
Median
(N+1) x 0.50 = position of the 50% value in an ordered array | This is a common percentile value
99
Range
Difference b/w the largest and smallest values
100
Interquartile range
Difference b/w the 75% and 25% values
101
Standard deviation
Measurement of the deviation of each data point from the arithmetic mean of that set of data
102
When are the mean, median, and mode the same?
When data is unimodal and symmetrical
103
Right skew
Positive skew | Mean is larger than median
104
Left skew
Negative skew | Median is larger than mean
105
Choice of summary measures
Normally distributed data: Mean and SD | Skewed data: Median and interquartile range
106
Choice of statistical tests
Normally distributed data: parametric tests | Skewed data: non-parametric tests
107
Box and whiskers plot
Bottom of box is 25% Top of box is 75% Middle line is 50% Interquartile range: top value (75%) minus the bottom value (25%) of the box Whiskers extend from the ends of the box and to the outermost data points that all within the distance computed: quartile value +/- 1.5 x (interquartile range)
108
Two types of box plots
Quantile (all % values) and outlier (w/ outliers)
109
Sensitivity
P(T+ / D+) | Proportion of people who test positive given that disease positive
110
Specificity
P (T- / D-) | Proportion of people who test negative given that disease negative
111
Predictive value of a positive test
P(D+ / T+) Proportion of people who are classified as positive by the screening test for a particular disease and who in fact have the disease
112
Predictive value of a negative test
P(D- / T-) Proportion of people who are classified as negative by the screening test for a particular disease and who in fact don't have the disease
113
What determines if a test is reliable?
Prevalence of disease in population
114
Likelihood ratio for positive results from a test
LR positive = TPR/FPR = Sensitivity/(1-Specificity) TPR = true positive ratio FPR = false positive ratio
115
Likelihood ratio for negative results from a test
LR negative = FNR/TNR = (1-Sensitivity)/Specificty FNR = false negative ratio TNR = true negative ratio
116
What does an ROC plot tell you?
Which diagnostic test offers the best trade-off b/w true and false positives Describes test performance in terms of area under the ROC plot Allows comparison of different tests' performance across a range of potential positivity criteria
117
In an ROC plot, which curve is better?
Upper, leftmost curve