Epi/Biostats Flashcards

(47 cards)

1
Q

Beta

A

Regression Coefficient, expected (average) change in Y when X (explanatory variable) changes by one unit and the other explanatory variables stay the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Wald Statistic

A

Test whether regression coefficient of a variable is zero

Beta squared over var(B)
p-value = P(chi-squared > Wald statistic)

If p is small, the variable associated with the regression coefficient is important (statistically significant)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Likelihood Ratio Test

A

Test to compare two models: one with q (“null”), the other with p variables with p>q (nested models)

If p < 0.05, the group of p-q variables in the extended model is important (statistically significant)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Type I error

A

Probability of receiving a significance result (rejecting the null) when it is not true - False positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Type II error

A

Probability of failing to reject the null when it should be rejected - false negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sensitivity

A

Probability of positive test given case (A / A+C)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Specificity

A

Probability of a negative test given a non-case (B/B+D)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

PPV

A

Probability that the case is actually a case given that it tested positive (A/A + B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

NPV

A

Negative Predictive Value probability that a non-case is true given a negative test result (D/C+D)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Residual variance in regression equation

A

Error term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Types of bias

A
  1. Confounding
  2. Selection Bias
  3. Information Bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Propensity Score

A

Probability of a unit being assigned to a particular treatment or exposure given an observed set of covariates.

Used to reduce selection bias by equating groups on these covariates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When to use log-binomial

A

When risk or prevalence is >10% risk odds ratio and prevalence odds ratio will overestimate the prevalence ratio so need to use log-binomial to directly estimate the prevalence ratio or risk ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Risk vs odds

A

Risk = probability of occurrence of an event or outcome
Odds = probability of occurrence of an event or outcome / probability non-occurrence of the event or out come

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

P-value

A

Probability of obtaining results as extreme as those observed under the null hypothesis. Protects from type I error or false positives, which lead us to conclude there is an association that isn’t really there.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

ICC

A

Intraclass correlation coefficient- the degree to which the variance of the cluster explains the variance of the whole. The between individual variance / the total variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Vaccine effectiveness formula

A

(1 - adjusted OR) x 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Risk ratio formula

A

(a / (a+b)) / (c / (c+d))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Residuals

A

The difference between the observed outcome and the mean in each group

20
Q

Kappa statistic

A

Determines percent of the inter-rater reliability agreement beyond what would be expected by chance

Po - Pc / 1 - Pc

> .8 is an almost perfect level of agreement beyond chance

21
Q

Interaction coefficient

A

Measures how much an association between Y and one predictor (X1) differs across levels of another predictor (X2)

22
Q

Marginal

A

Does not include other covariates in the model

23
Q

Structural model

A

Model for counterfactual outcome

24
Q

Wilcoxon rank sum vs t-test

A

WRS compares medians. T-test compares means. WRS more appropriate for data with outliers.

25
Conditional / random effects model
Include other covariates
26
Frequentist
Parameters are the truth
27
Bayesian
Parameters have a distribution
28
Covariance
Measure of joint probability of two random variables. If both variables are high at the same time covariance is positive. If one is high when the other is low, covariance is negative The sign of covariance thus shows the tendency of the linear relationship
29
Correlation Coefficient
Normalized version of covariance. Shows the magnitude and thus the strength of the linear relationship
30
Survey Weight
Value assigned to each case to indicate how much each case will count in a statistical procedure
31
Problems with survey weights
Almost always increase standard errors
32
Design Effect
Variance from survey / Variance estimate with SRS
33
AIC
Chooses the best model from a set using : -2(log-likelihood) + 2K
34
Variance
Average of the squared differences of observations from the sample mean
35
Normalize distribution
Subtract the mean from each value and divide by the sd (Z = X - u/o)
36
Central Limit Theorem
If you have a population with mean mu and standard deviation sigma and take a sufficiently large sample then the distribution of the means will be approximately normally distributed
37
CI
Probability that a population parameter will fall between a set of values for a certain proportion of times
38
F-statistic
Test if sample variances are equal. P<0.05 means they are not equal
39
Horvitz-Thompson estimator
Inverse probability weighting applied to samples to account for differences between the sample and target population
40
Log-linear model
Betas are the derivative of the log of expected y | derivative x -> (log(E(y|x)) Parameters are linear but the data isn’t B*100 Measures the percentage change in y when x increases by one unit keeping other variables constant
41
Maximum Likelihood Estimation
MLE is a method that will find values of mean, u, and sd, o, that result in the curve that best fit the data
42
Bayesian Inference
The process of deducing properties about a population or probability distribution from data using Bayes theorem: P(A|B) = P(B|A)*P(A)/ P(B)
43
Poisson Offset
Population or person-time
44
ANCOVA
Analysis of Covariance - used to test for interaction or effect measure modification -> whether means of a dependent variable are equal across levels of a categorical independent variable Generate an interaction term for the model Test if the interaction term = 0 If the interaction term is not significant, reduce to MLR/simpler model
45
Correlation vs Covariance
Two terms that are opposed but related. Correlation shows how two variables are related, covariance shows how two variables differ
46
Attributes of Surveillance System
Simplicity Flexibility Acceptability Sensitivity Positive Predictive Value Representativeness Timeliness
47
Sources of measurement error in surveys
1. The tool 2. The method of data collection 3. The interviewer 4. The respondent