Biostatistics Flashcards

1
Q

What are 3 principles of clinical trial design?

A

Physician-Patient relationship must exist, Physician must act in patient’s best interest, Written consent must be obtained from patient after full disclosure of clinical trial

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When should median be used in place of mean?

A

When data is highly-skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relationship between mean, median, mode in setting of negative skew?

A

Mean < Median < Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Relationship between mean, median, mode in setting of positive skew?

A

Mode < Median < Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Benefit of pooling data from several different studies in a meta-analysis?

A

Increases power of study … by increasing N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe the difference between effect modification and confounding?

A

Effect modification = SIGNIFICANT difference between strata; Confounding = NO significant difference between strata … (in confounding, the significance disappears)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Equation for Positive Predictive Value (PPV)?

A

A/(A+B) … TP/(TP+FP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Definition of PPV?

A

Probability that patient actually has a disease, given a (+) test result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Observer bias typically occurs in which setting?

A

Absence of blinding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Observer bias is particularly likely when the studied outcome is …

A

Qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Evaluation of multiple endpoints during a study increases the risk of …

A

Type 1 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe a Type 1 error?

A

Reject the null, when the null was correct … (say there’s a difference, when there really wasn’t)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

___ refers to increased likelihood of making a Type 1error when evaluating multiple end points?

A

Multiplicity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define net clinical benefit?

A

Measure of a medication’s possible benefit, minus its possible harm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the most important aspect of a study, according to the Intention To Treat (ITT) principle?

A

Randomization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Intention To Treat (ITT) principle attempts to avoid the effects of …

A

Crossover, Dropout

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the goal of Phase 4 clinical trials?

A

Post-marketing surveillance to monitor safety of medications that have already been released on market

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Why might some clinical trials fail to detect serious side effects of new medications before Phase 4?

A

Inadequate power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Calculations for Positive Predictive Value (PPV) requires knowledge of …

A

Prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

___ is an expression of sensitivity and specificity that can be used to assess the value of a diagnostic test, independent of prevalence

A

Likelihood ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Equation for (+) likelihood ratio?

A

Patient with disease testing (+) / Patient without disease testing (+) … A / B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Equation for (–) likelihood ratio?

A

Patient with disease testing (–) / Patient without disease testing (–) … C / D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Smaller LR corresponds to ___ disease prevalence

A

Less likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Most valid measure of central location for normally distributed data?

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Most valid measure of central location for date with skewed distribution?
Median
26
___ refers to bias in which working populations are generally healthier than general population, exhibit lower mortality rates
Healthy worker effect
27
Healthy worker effect is a type of ___ bias
Selection
28
Describe sensitivity?
Probability of detecting a disease if it is present
29
Area under curve (AUC) if a Reciever-Operating Characteristic (ROC) is a reflection of …
Diagnostic accuracy
30
Describe statistical power?
Ability to detect an association, if that association exists
31
Value of meta-analysis?
Increase power of study
32
Higher SN means fewer …
False (–)
33
Higher SP means fewer …
False (+)
34
Relationship between SP and PPV?
Higher SP will increase PPV
35
Description of length-time bias?
Survival benefits of a diagnostic test are overstated … due to detection of a disease that is benign, slowly-progressive
36
Description of lead-time bias?
Test diagnoses a disease earlier … time from diagnosis to death appears prolonged … appears like early detection improves survival … BUT actually, disease was just diagnosed EARLIER in course
37
Randomized clinical trials are designed to test ___, but not ___
Efficacy; AEs
38
___ occurs in a meta-analysis when studies with significant (+) results are included, but studies with (-) non-significant results are not included
Publication bias
39
Greatest source of bias in case-control studies?
Recall bias
40
Quadriplegic patient remarks that that 5 years in his current state are equivalent to 1 year in full health – what is he referring to?
Quality-adjusted life years (QALYs)
41
Quadriplegic patient remarks that that 5 years in his current state are equivalent to 1 year in full health – what is his Time Trade-Off?
1/5 = 0.2
42
A patient had full health until 30 yo, then developed disability with TTO = 0.2; What is QALY at 35 yo?
30(1) + 5(0.2) = 31
43
Question for calculating Disability-Adjusted Life Years (DALYs)?
Years of Life Lost (YLL) + Years Lived with Disability (YLD) = YLL + YLD
44
Description of a factorial study design?
Study design that utilizes 2+ interventions with ALL possible combinations of those interventions … in a study with 2 interventions, a factorial design has 4 possible groups … Treatment A, Treatment B, Treatments A+B, Placebo (neither)
45
Alternate name for factorial study design?
Fully-crossed study
46
Equation for NNT?
1/ARR (absolute risk reduction)
47
What is the NNT for control group rate = (8/1000) and treatment group rate (6/1000)?
1/ ((8/1000) – (6/1000)) = 1/(0.002) = 500
48
In which situation would you need a larger sample size to show a significant difference between 2 groups … decreasing power, increasing a from 0.01 to 0.05, showing difference between groups = 5 instead of 8, using 1-tailed test instead of 2-tailed test?
Showing difference between groups = 5 instead of 8
49
Does using a 1-tailed or 2-tailed test require MORE power (aka larger sample size)?
2-tailed test
50
___ refers to repeating primary analysis calculations in a study by modifying certain criteria or variable ranges to determine whether those modifications significantly affect the results initially obtained
Sensitivity analysis
51
___ refers to bias that occurs when test A diagnoses a disease earlier than test B; As a result, the time from initial diagnosis to death appears prolonged with test A, although there is no actual improvement in survival
Lead-Time Bias
52
Recall bias is also known as a misclassification of ___
Exposure
53
Equation for attributable risk percent (ARP)?
[(Risk in exposed) – (Risk in unexposed)] / (Risk in total population)
54
Prevalence of smoking in a population is 50%; 5-year risk of ischemic CVA is 1:1000 in smokers, 0.5:1000 in non-smokers – what is the attributable risk percent of smoking on ischemic CVA?
(0.001-0.0005)/[(0.001*.50) + (0.0005*.50)] = 33%
55
Equation for Absolute Risk Reduction?
ARR = control rate – treatment rate
56
Equation for Relative Risk Reduction?
RRR = ARR/control rate
57
Equation for Relative Risk?
RR = treatment rate/control rate
58
Equation for NNT?
NNT = 1/ARR
59
Definition of positive likelihood ratio?
(probability of patient with disease testing +) / (probability of patient without disease testing +)
60
Definition of negative likelihood ratio?
(probability of patient with disease testing –) / (probability of patient without disease testing –)
61
Relationship between likelihood ratios and prevalence of disease?
LRs are independent of prevalence
62
Equation for Positive LR?
Positive LR = SN/(1-SP)
63
Equation for Negative LR?
Negative LR = (1-SN)/SP
64
LR < 1 indicates …
Decreased likelihood that disease is present
65
LR = 1 indicates …
No change in likelihood of disease
66
LR > 1 indicates …
Increased likelihood that disease is present
67
Main measure of association for case-control studies?
OR
68
Main measure of association for cohort studies?
RR
69
Main measure of association for cross-sectional studies?
OR
70
How can you tell if a difference between 2 groups is significant, looking at standard error bars?
Overlap of standard error bars = non-significant difference
71
Definition of Hazard Ratio?
Likelihood of an event occurring in a treatment group relative to a control group
72
What is the null value for Hazard Ratio?
1
73
Interpretation of Hazard Ratio < 1.0?
Event is less likely to occur in a treatment group vs. control group
74
Interpretation of Hazard Ratio > 1.0?
Event is more likely to occur in a treatment group vs. control group
75
Value of multiple linear regress?
Evaluate association between 1 quantitative dependent variable and 2+ independent variables (either quantitative or qualitative)
76
How can you make a confidence interval more narrow?
Increase the power (sample size)
77
Asymmetry in a funnel plot suggests …
Publication bias
78
Utility of the ANOVA test?
Comparing mean value of continuous variable across several (3+) groups
79
Which statistic does ANOV test generate?
F statistic … which is then used to calculate a p-value
80
___% of data in a normal distribution lies within 1 standard deviation
68
81
___% of data in a normal distribution lies within 2 standard deviations
95
82
___% of data in a normal distribution lies within 3 standard deviations
99.7
83
Relationship between mean, median, mode in a R-skewed distribution?
Mode < Median < Mean
84
___ refers to probability of an individual testing positive given the presence of disease
Sensitivity
85
___ refers to probability of an individual testing negative given the absence of disease
Specificity
86
___ refers to probability that an individual who tests positive for disease actually has the disease
Positive Predictive Value
87
___ refers to probability that an individual who tests negative for disease actually does not have the disease
Negative Predictive Value
88
___ refers to the probability of an individual with the disease testing positive divided by the probability of individual without the disease testing positive
Positive likelihood ratio
89
___ refers to the probability of an individual with the disease testing negative divided by the probability of individual without the disease testing negative
Negative likelihood ratio
90
Equation that relates Positive likelihood ratio to SN and SP?
LR+ = SN/(1-SP)
91
Equation that relates Negative likelihood ratio to SN and SP?
LR- = (1-SN)/SP
92
2 values that must be known to calculated the PPV and NPV?
Prevalence, Pre-test probability
93
Relationship between sample size (n) and Type 1 error?
Type 1 error increases as sample size increases
94
Equation for attributable risk percentage?
[(Risk exposed) - (Risk nonexposed)] / (Risk exposed)
95
Equation for hazard ratio?
(Hazard rate in intervention group) / (Hazard rate in control group)
96
RCT investigates anti-inflammatory intervention as treatment for neuropathic pain; Prelim results shows significant reduction in pain score among treatment group (p=0.06); Significant reduction in pain among non-obese subjects (0.01), but non-significant increase in pain scores among obese subjects (p=0.06) - what is best explanation for difference in finding between obese and non-obese subjects?
Effect modification
97
Relationship between PPV and R (correction coefficient)?
Higher R, Higher PPV
98
What is non-ABCD equation for RR?
(rate of outcome in group 1) / (rate of outcome in group 2)
99
Measure of association for case-control studies?
OR
100
Primary endpoint of study is 30% reduction in PTH after treatment with Drug X or Drug Y; Primary efficacy for Drug X = 74%, for Drug Y = 85.4% with p<0.001; What is an appropriate 95% confidence interval for this study?
11.4 must be included in CI and at center of CI; CI cannot include 0
101
DHS investigates an outbreak of Gi illness; Total of 25 individuals with GI bug and 75 individuals without GI bug are studied; 96% of patients with GI bug ate a local restaurant 2-4 days prior, 4% of patients without GI bug ate at same restaurant - what is the study design here?
Case-Control
102
Highly sensitive tests are useful for___; highly specific tests are useful for ___
Screening; Confirmation
103
Measure of association for case-control studies?
OR
104
Measure of association for cohort and RCT studies?
RR
105
Description of a nested case-control study?
Case-control study that draws its cases and controls from people participating in a follow-up (cohort or RCT) study
106
A screening test with high ___ is the best for early detection of breast CA in a population with stable incidence of disease
SN
107
Group of researchers wants to investigate an acute outbreak of acute diarrhea; Approximately 50 people developed severe hemorrhagic diarrhea, thought to be related to local seafood; Which study design is most appropriate to test this hypothesis?
Case-control
108
Difference between indications for case-control vs. cross-sectional studies?
Case-control = investigate outbreaks … Cross-sectional = compare prevalence
109
Utility of case-control studies?
Investigate outbreaks
110
Cross-sectional studies begin with …
Risk factor/exposure … investigate outbreaks
111
Utility of cross-sectional studies?
Compare prevalence
112
Which study design is helpful for studying rare exposures?
Cohort
113
Equation for Standardized Mortality Ratio?
(Observed # of Deaths) / (Expected # of Deaths)
114
Purpose of pre-clinical trials?
Explore if & how new treatment may work
115
Study population of pre-clinical trials?
Laboratory and animal models
116
Purpose of phase 1 clinical trials?
Evaluate safety
117
Study population of phase 1 clinical trials?
Healthy subjects, small number
118
Purpose of phase 2 clinical trials?
Explore efficacy (does it work?)
119
Study population of phase 2 clinical trials?
Small number of affected subjects
120
Purpose of phase 3 clinical trials?
Compare new treatment to current Standard of care
121
Study population of phase 3 clinical trials?
Large number of affected subjects randomly assign to treatment or control
122
Purpose of phase 4 clinical trials?
Identify rare and long-term adverse effects
123
A study has power of 80%; what is the probabilty of finding a significant effect if none actually exists?
Type 1 error = alpha; alpha = P value
124
Description of (+) LR?
(probability of diseased patient testing positive) / (probability of healthy patient testing positive)
125
Description of (-) LR?
(probability of diseased patient testing negative) / (probability of healthy patient testing negative)
126
Equation for (+) LR?
SN / (1-SP)
127
Equation for (-) LR?
(1-SN)/SP
128
What is the meaning of the kappa statistic?
Inter-rater reliability (concordance)
129
11 mo female presents shortly after diagnosed with rare disease; Parents ask about factors that mayhave contributed to developing of disease - which study design would be best?
Case Control
130
___ studies are best at identifying potential risk factors for rare diseases
Case Control
131
Description of “per protocol” analysis?
Compares treatment groups in an RCT by including ONLY patients who strictly adhered to protocol
132
___ refers to an analysis that compares intervention groups by including all subjects as initially allocated after randomization, regardless of what happens during the study period
Intention to Treat (ITT)
133
Intention to Treat (ITT) analysis leads to a ___ estimate of intervention effect
Conservative
134
The results of a Intention to Treat (ITT) analysis reflect the __ effect of intervention as intended in the population
Real
135
___ refers to an analysis that compares intervention groups by including data only from subjects who complete the intervention originally allocated at time of randomization
Per protocol
136
___ refers to an analysis in which subjects are evaluated based on intervention they received, rather than the intervention to which they were randomized
As-treated analysis
137
Which type of analysis is used to help preserve the benefits of randomization in superiority trials?
Intention to Treat (ITT)
138
When comparing performance measures of 2 different tests, which statistic can show that there is no significant difference between the tests?
Overlapping CIs
139
Which type of screening test will identify individuals without disease?
Screening test with high SP and high NPV
140
2 tests for heterogeneity?
Q statistic, I2 index
141
Which result of Q statistic suggests no heterogeneity?
P > 0.05
142
Which result of I2 index suggests mild heterogeneity?
25%
143
Which result of I2 index suggests moderate heterogeneity?
50%
144
Which result of I2 index suggests high heterogeneity?
75%
145
What is an epidemiologic parameter that does not vary with disease prevalence and provides useful information for individual patients?
Likelihood ratio (LR)
146
Equation for positive LR?
Positive LR = SN/(1-SP)
147
Equation for negative LR?
Negative LR = (1-SN)/SP
148
___ refers to type of measurement bias that occurs when a study uses gold standard testing selectively in order to confirm a positive (or negative) result of preliminary test
Verification bias
149
Effect of confounding variable on statistical significance?
Difference disappears when patients are stratified (based on the confounding variable)
150
Confounding threatens the ___ of a study
Internal validity
151
An epidemiologic study looks at adverse events in 10,000 patients who underwent PCI; According to results, 2% of patients who received metal stents developed thrombosis; 40% of patients who developed stent thrombosis passed away – the risk of death in a patient who develops stent thrombosis is best described as …
Case fatality rate
152
How would you describe mortality rate?
Likelihood in general population of dying from disease
153
Equation for FPR?
FPR = 1 – SP
154
Equation for FNR?
FNR = 1 – SN
155
Definition of standardized incidence ratio?
Observed # of cases / Expected # of cases
156
What is the value of standardized incidence ratio?
Determine if occurrence of disease in a small population is high or low compared to expected value derived from a larger comparison population
157
___ refers to the probability of detecting a true difference between normal and pathologic groups
Power
158
Equation for power?
1 - b
159
Relationship between variability and power?
Lower the variability, higher the power
160
Relationship between precision and power?
Higher the precision, higher the power
161
Diagnostic test shows that 92% of all skin tissue samples with (+) result were cancerous, while 99% of all skin tissue samples with (-) result were normal – what is best interpretation of these findings?
92% is PPV, 99% is NPV
162
If RR of an outcome in group A as compared to group B is x, then RR in group B compared to group A is …
1/x
163
RR of SCD in low-dose HCTZ group as compared to high-dose HCTZ is 0.4; how much does high-dose HCTZ increase risk of SCD compared to low-dose group?
Increased by factor of 2.5
164
SN of study is 50% (2/4), SP is 98% (394/400) – what is PPV?
25%
165
Ecologic (correlational) studies are based on ___ level
Population
166
___ refers to the mistake of applying population-level information to individual level
Ecologic fallacy … results of ecologic study cannot provide individual-level conclusions
167
___ refers to misclassification of events that results from knowing exposure status of patient in RCT
Observer bias
168
Best way to reduced observer bias?
Blinding
169
Description of Hawthorne Effect?
Patients change behavior when they know they’re being studied
170
Hawthorne Effect poses a threat to the ___ of a study
Internal validity
171
Association between risk factor and disease is more likely to be causal if ...
Strength increases as exposure level increases (dose response relationship) 
172
__Refers to the incidence rate of an event and intervention group divided by the incidence rate of the same event in the control group
Rate Ratio