Stats Flashcards

1
Q

Berksons bias

A

False observation of negative correlation between two positive traits. Ie only have one. ACTUALLY = UNEQUALLY OBSERVED

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Hawthorne effect

A

Observer effect!
Individuals modify behaviour in response to being observed. BUT OBSERVER EFFECT

Also secondary observer effect ie researchers looking at q responses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Neyman bias

A

Selection bias eg v sick or ill excluded from study!!

Ie died excluded - less severe
Ie recovered exluded - more severe

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Case control

A

Compare groups of cases and controls re exposure factors often nested eg sample of controls
Clear case definition
ODD RATIO = EST RELATIVE RISK rare disease assumption
Ie cases: exp/non exp divided controls: exp/non exp

Care with matching ie over match/factors affecting exposure as it should be equal in groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cohort study

A

Compare exposed and non exposed re incidence of disease
Longitudinal v restrospective/historical

Measure exposure
Match or id confounders for analysis ie counterfactual ideal
Measure outcomes
Analyse
GENERAL PMR
Proportional mortality ratio deaths given cause/total deaths in same time period

SMR - deaths in study pop/expected deaths in general pop. Standardise w age bands/gender

Types of comparison
Internal ie unexposed IN cohort ie exp unexp in cohort of factory workers
External ie similar cohort of factory workers or gen pop. May differ! Gen pop needs to have low rate of exposure!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cohort study weaknesses

A

Healthy worker effect ie gen pop sicker if used as controls
Loss to FU ie TIME - bias = difference between groups DIFFERNTIAL
Difficulties w long latency or rare diseases. TIME/RESOURCES!
Limited records/info ie on exp in retrospective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cohort studies strengths

A

Longitudinal - clarity re sequence CHAIN OF EVENTS
Calculate INCIDENCE and RISK
RARE EXPOSURE
MULTIPLE EFFECTS
Longitudinal avoid enrolment SELECTION BIAS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Case control advantages

A
RARE DISEASES
Long LATENCY - resources!!
Easier to obtain exp data
DYNAMIC POP
MULTIPLE CAUSES/exp
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Case control disadvantages

A

SELECTION BIAS
Inefficient RARE EXPOSURE
OBSERVER BIAS
ODDS RATIO not relative risk or CAUSATION

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Confounder definition

A

A confounding factor independently determines the risk of disease/outcome studied and is UNEQUALLY distributed between groups ie is correlated with exposure

NOT PART OF CAUSAL CHAIN OF EVENTS
Positive = stronger
Negative = weaker assoc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sources of bias

A

Unrepresentative of pop/inaccurate info

Selection bias
Information bias
Observer bias
Recall bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Interpretation of associations

A

Bias
Chance
Confounding
Effect modifier - other causative factor part of causal chain of events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Linear regression

A

X axis = independent variable
Y axis = dependent variable ie effect being studied

Y=mx+c
Correlation coefficient +- 1 = strongest association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Distribution

A

Mode = most freq value ie uni bimodal
Positive skew - long upper tail, pulls mean up
Negative skew - long lower tail, pulls mean down

Mean - weighting or geometric ie log, mean, anti log

Median

Range

Percentiles ie 5th = 5% values below
Quartiles ie 25-75th in box plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard deviation

A

Dispersion of values about mean
NORMAL DISTRIBUTION 95% values within +-2SDs of mean

Coefficient of variation SD/mean x 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Probability ie confidence intervals

A

Ie of RR or OR
95% confidence that value is located within range
Assume random sample, normal distribution

17
Q

Significance v chance ie P values

A

Test validity of null hypothesis
P value 0.05 / 5%

Two means - chi2/students T test
Non parametric - distribution free

18
Q

Adjustment re confounders

A

Stratify
Direct standardisation - eg weight. same weight ie same group size /denominator. Need all age etc specific rates.
Indirect standardisation ie SMR - different WEIGHTS ie age structure of pop. Only need total deaths and pop size. Useful if small numbers

Stats ie mantel henizel
Regression models
Pooled adjustment for residual confounders

19
Q

Measures of occurence

A

Incidence = new cases/1000/yr
Prevalence ie point/period = cases in given time
Approx incidence x ave duration

20
Q

Sensitivity

A
Sensitivity = True positives/positive CASES ie true positives + false negatives
PPV = true positives/positive RESULTS ie true positives + false positives
Specificity = true negatives/negative cases ie true negatives + false positives
NPV = true negatives/negative results ie true negatives + false negatives