General Biostats Flashcards

(93 cards)

1
Q

Sensitivity is best described as:

A

True Positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

False negative probability can be calculated from sensitivity by:

A

1-sensitivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Specificity is best described as:

A

True negative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The false positive probability can be calculated by

A

1-specificity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Recall Bias

A

Cases are more likely to recall exposures than controls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

relative risk

A

RR = experimental event rate / control event rate

A/(A+B) / C/(C+D)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Regression to the mean

A

Tendency for extreme values of a variable to fall closer to the group mean when retested

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Hawthorne Effect

A

The tendency of subjects to act differently because they are being studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why use person years?

A

Accounts for variable follow up periods and number under observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

___ percent of the population falls within 1 standard deviation on the curve.

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

2 standard deviations contain

A

95% of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

3 standard deviations?

A

1%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

chi-square test

A
  • analyze categorical data
  • no cell has less than 1
  • no more than 20% of cells have less than five
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

fischer’s exact test

A

same as for chi-sq but w/ SMALL sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

McNemars test

A
  • paired dichotomous data without fractions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

student t test

A
  • independent continuous variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Differential misclassification is___

A

Not random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Differential misclassification cause an ____of an association.

A

overestimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Non-differential classification is ____

A

random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Non-differential misclassification causes an ___ of an association.

A

under estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Selection bias

A

systematic errors in the way subjects are included in a study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Odds ratio is good for _____ conditions.

A

Rare

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

With rare conditions, the OR approaches__

A

RR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

The lower the confidence, the ___ the confidence interval.

A

narrower

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Increasing the sample size ____ the confidence interval
Decreases
26
The probability that two independent events will occur is:
p(A) + p(B) - p(A+B)
27
Case control studies are best suited for ___ conditions.
Rare
28
Length bias
Rapidly progressing cancer are less likely to be detected when asymptomatic.
29
Lead time bias
screening gives the appearance that the person diagnosed earlier survived longer.  Even when survival is to same age
30
Information bias
systematic difference in the way data is collected
31
Surveillance bias
over detection of disease because one population is monitor more closely
32
Positive predictive value
Measures how often the test is right when positive or negative
33
Positive predictive value 
TP/(TP+FP)
34
Negative predictive value
TN/(TN+FN)
35
You cannot calculate predictive value of a test without__
prevalence
36
Sensitivity and specificity ___ depend on prevalence
do not
37
Decreasing prevalence causes _____ in positive predictive value
decrease
38
Decreasing prevalence causes ____in negative predictive value
increase
39
Correlation studies
compare disease frequencies among entire populations
40
Attributable risk
Ie - Iue
41
Attributable risk percentage
(Ie-Iue)/Ie
42
Population attributable risk
Attributable risk X Prevalence
43
Intent to treat analysis
The inclusion of all subjects randomized to drug and placebo.
44
Cross-sectional study
study of a population at one point in time
45
Prevalence
of cases/population at risk
46
categorized data
ie. ill vs not ill
47
Power ___ not important with significant results.
is not
48
secondary attack rate
of new cases among those exposed to index case/# of all those exposed to index case
49
vaccine efficacy
(Iuv - Iv)/Iuv
50
In decision analysis, utilities refers to
the relative values placed on outcomes by patients
51
A method of controlling for confounding after data collection is
stratification and multivariate analysis
52
A bar chart is useful for
depicting the frequency of nominal or ordinal data
53
A pie chart is useful for 
depicting frequency of categorical data
54
A frequency polygon is useful for
illustrating frequency distributions for discrete or continuous data. Overlapping. 
55
Line graphs are useful for
presenting continuous data over time. 
56
A curve skewed to the left has a mean ___ than the median
Less, closer to the tail,or to the left. 
57
A curve skewed to the right has a mean ____to the median
Right, or greater than the median. Again, closer to the tail. 
58
Curves are skewed to the 
tail
59
Is there a test 100% sensitive or specific?
No.
60
The x axis is Receiver-operator curves is
1-specificity.
61
If alpha is set to .10, then the confidence interval is 
90%
62
Linear regression examines
the association between two continuous variables. 
63
Multiple regression is used to 
examine the relationship between multiple dependent variables and the independent variable
64
Logistic regression is used 
when the dependent variable is is dichotomous.
65
Correlation analysis is used
to determine a linear relationship between continuous variables in correlation studies of populations.
66
Which gives a better picture of an intervention's impact: ARR or RRR? Why?
AR. RR depends on mortality in control group. Low mortality in a control group means little improvement with intervention, even if RRR is large. 
67
Numbers needed to treat =  
1/ARR
68
If the crude and adjusted rates are the same, the stratification variable is
not a confounder.
69
The Kappa statistic is used 
to assess interrater/intrarater reliability.
70
Wilcoxon rank sum tests, Wilcoxon signed rank test, and signed tests are used to 
study non normally distributed sample sizes
71
Kaplan-Meier Survival Curves
Estimates the probability of survival each time an event occurs. 
72
Advantage of Kaplan Meier curve:
Takes into account "censored data" which occurs if a patient withdraws before the outcome is reached. 
73
Log rank test
Compares the survival distributions of two samples. Non parametric.
74
Hazard ratio
Risk of an event an instantaneous point in time between two groups. 
75
Cox proportional hazards model
Multivariate survival analysis used in studies in which participants are followed for unequal amounts of time.
76
Cox proportional hazards model relies on:
the assumption that the proportional effect of study factors does not change over time
77
Why are randomized clustered samples not as good as individual randomization?
Similarities of individuals in the clusters reduces variability, there by reducing the power.
78
Clustered data results when:
preexisting group structure is used to select participants, but the researcher is interested in individual level data
79
intercluster correlation coeffiient (rho) is a measure of
Relatedness of clustered data, can range from 0-1, 1 is all the same.
80
Nominal Data
No intrinsic order. Ie Race.
81
Sensitivity
Positive in disease
82
Specificity
Negative in health.
83
Ordinal
Have an order, but no set numerical relationship between values. "Worse, same.."
84
Interval
Ordered with numerical units, but no actual zero. Dividing them does not make sense. Ie one date of birth cannot be twice another date of birth.
85
Ratio scales
Order, numerical, and a real zero. Can divide them. Kelvin.
86
Sensitivity
Chances of test being positive in those with disease.
87
Specificity
Chances of test being negative in those without disease.
88
Calculate TP:
Sensitivity x prevalence
89
Calculate TN
Specificity x (1-prevalence)
90
Calculate false positive
(1-specificity)x(1-prevalence)
91
Calculate false negative
(1-sensitivity)x prevalence
92
In a normal distribution, the mean, median, and mode are?
The same
93
What happens to PPV as prevalence increases.
It increases.