Stats Flashcards

(45 cards)

1
Q

Count

A

Cannot be compared b/c they arise from populations of different sizes
Use when important to public health or to allocate resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ratio

A

Shows relative size of 2 values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Proportion

A

Numerator is subset of denominator
Dimensionless
Between 0 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Rate

A

a/a+b (can be proportion, always a ratio) over an amount of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Incidence

A

Frequency of the occurrence of new cases over a specified period of time
Measures appearance of disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cumulative incidence

A

Risk of probability of an individual getting a disease
Proportion: # of new cases of disease/# at risk at beginning of follow up or over a specified time period
Fixed populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Incidence rate

A
# of new cases/sum of disease-free person-time over specified time period
Takes into account population differences in periods of follow up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Person-time at risk

A

Sum of disease-free time in population

  1. Add individual risk periods (exact)
  2. Use average number of people multiplied by study duration
  3. Use average duration per person
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Prevalence

A

Proportion of people in a population w/ the disease at a specified point in time
Measures existing disease
Describes health burden

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Point prevalence

A

Proportion: # of existing cases/total population at a specified point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Period prevalence

A

Proportion: (# of existing cases + # of cases that occur during the interval)/population at midpoint of interval or avg population size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Prevalence-Incidence relationship

A

Prevalence depends on incidence and disease duration
P = ID
If a disease is of short duration, I ~ P
If a disease is chronic, P > I
Prefer incidence b/c interested in etiology and you don’t want to vary too many factors at the same time (birth defect problem)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Binary data

A

One of two answers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Nominal data

A

Categorical data w/ no order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Ordinal data

A

Categorical data w/ order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Continuous data

A

Data measured continuously or on integer scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Frequency distribution

A

Means of describing categorical data

Must add up to 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Mean

A

Average

Limitations: sensitive to extreme values, not ideal for skewed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Median

20
Q

Mode

21
Q

Variance

A

Average of square of deviations about the sample mean

S^2 = (sum(xk -xbar)^2)/(n-1)

22
Q

Negative skew

A

Number of outlying values on low end (hump is on right)

23
Q

Positive skew

A

Number of outlying values on high end (hump is on left)

24
Q

Standard deviation

A

Square root of variance

Std = sqrt((sum(xk -xbar)^2)/(n-1))

25
Normal distribution
Theoretical probability distribution that is symmetric about its mean and is "bell" shaped Mean = Median = Mode
26
Standard normal distribution
Specific distribution with mean of 1 and Std of 1 68% of data w/in 1 std 95% of data w/in 2 std 99.7% of data w/in 3 std
27
Shapiro-Welk test
Null hypothesis = data are normally distributed | p < 0.05 means data are NOT normally distributed, reject null hypothesis
28
Screening
Presumptive identification of unrecognized disease or condition by application of tests, examination, or other procedures Attempts to classify asymptompatic people as likely or unlikely to have disease Goal is to delay onset of symptoms and prolong survival Only done for healthy people
29
Primary Prevention
prevent disease before it starts
30
Secondary Prevention
delay symptoms
31
Tertiary Prevention
slow disease progression
32
Lead Time
duration of time by which diagnosis is advanced as a result of screening
33
Validity
Does the test measure what it's supposed to measure? | Bullseye
34
Internal validity
Does the test measure what it's supposed to measure?
35
External validity
Generalizability, how well does the result generalize to the population?
36
Reliability
Does the test give the same result over and over?
37
Sensitivity
Sensitivity = a / a + c Number of people who screen positive over number of people who actually have the disease Increase to prevent disease transmission Sensitivity + FN = 1
38
Specificity
Specificity = d / b + d Number of people who screen negative amongst those who don't have disease Increase for fatal disease w/ no treatment Specificity + FP = 1
39
True positive
Individuals who test positive and have disease
40
True negative
Individuals who test negative and don't have disease
41
False positive
Individuals who test positive and don't have disease Increased w/ increasing sensitivity FP = b / b + d Specificity + FP = 1
42
False negative
Individuals who test negative and half disease Increased w/ increasing specificity FN = c / a + c Sensitivity + FN = 1
43
Overall Accuracy
Assesses proportion of true test results among all test results Overall accuracy = A + D / A + B + C + D = TP + TN / TP + FP + TN + FN
44
Positive predictive value
Number of people w/ true disease who tested positive divided by number of people who tested positive Likelihood of having true disease if you test positive PPV = a / a + b
45
Negative predictive value
Number of truly non-diseased people who tested negative divided by number of people who tested negative Likelihood of not having disease if you test negative NPV = d / c + d