stats Flashcards

(51 cards)

1
Q

Mean greater than median, which skew?

A

positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Median greater than mean, which skew?

A

Negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

SD formula?

A

Square root: Sum of (x1-xmean)squared /n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

…% data is within 1SD of the mean?

…% data is within 2SD of the mean?

A

68% (32 outside)

95% (5 outside)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Equation for the Z value?

A

Z=x1-xmean/SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Confidence Interval equation?

A

Mean + or - (1.96x SE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SE equation?

A

SD/ square root of n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SE of the difference between two means equation?

A

SE= square root [(SD^2/n1) + (SD^2/n2)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

SE of proportions equation?

A

SE= square root [p(1-p)/n ]

where P is the risk of outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

SE of difference between proportions equation?

A

SE= square root [p(1-p)/n + p(1-p)/n2]

where P is the risk of outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Statistically significant if..?

A

P value less than 0.05 (or 0.01 if that’s the cut off)

CI doesn’t include zero (or the value specified in the null hypothesis e.g. if its a risk ratio not 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

false negative rate=

false positive rate=

A

false negative rate(type II)=1- sensitivity (Neg and seN)

false positive rate (type 1)=1-specificity (Positive and sPec)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Normal or reference range?

A

the 95% CI range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Standard deviation vs SE use?

A

SD- describes variability of the data/ spread

SE Measure if the precision of an estimate, used to form a CI, which can infer significance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

If binary independent variable and continuous dependent variable?

A

T-test, paired or unpaired according to whether it is the same individual or different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If paired binary independent variable and continuous dependent variable but not normally distributed?

A

Wilcoxon matched pairs test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

If paired samples but the data is ordinal dependent variable, and binary independent variable?

A

Wilcoxon matched pairs test/signed rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

If paired samples but both data is binary?

A

Mcnemars test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If unpaired binary independent variable and continuous dependent variable but not normally distributed?

A

Mann- whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

If multiple unpaired categorical independent variable and continuous dependent variable?

A

ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Two unordered categorical variables (nominal or binary)?

A

Chi squared test

e.g. outcome or not is binary- makes a contingency table of rates etc

22
Q

Chi squared test requirements?

A

Unordered categorical data
Can form into a contingency table
80% expected n’s above 5
all expected counts over 1.

23
Q

If 3+ ordered categorical variables and a nominal/ binary independent outcome data?

A

Chi squared test for trend (e.g. continancy table lots of ordered exposures e.g. in dose response)

24
Q

If two continuous variables normally distributed?

A

Regression/pearsons correlation coefficient.

25
If two continuous variables not normally distributed?
Spearmans correlation coefficient (or ordinal X variable0
26
If continuous independent variable but binary dependent variable?
Logistic regression
27
if more than 2 independent variables but continuous dependent?
multiple linear regression
28
If pared sample- degrees of freedom?
n-1
29
If pared t-test sample- degrees of freedom?
n-1
30
if non paired t-test sample- degrees of freedom?
n-2
31
Chi squared degrees of freedom?
number of categories-1.
32
test options for continuous data?
Outcome continuous:paired t-tests, Wilcoxon (non paramtric) or independent t-tests and Mann-whitney if non parametric. ANOVA if multiple exposures. both: Regression/pearsons correlation. Spearmans rank if not normal. Multiple linear regression if multiple. Continuous exposure data: Logistic regression (binary outcome) or spearmans rank (ordinal)
33
Test options for categorical data?
Two unordered: Chi squared Ordered outcome: chi squared test for trend. Paired proportions: McNemar
34
Multiple linear regression equation?
Y=b0 (intercept) + (b1x1) +(b2+ x2)…. where b1=the regression of the first variable and X1 is the subjects exposure i.e. (Male=0 and female=1) with r=0.2 if female = (1x0.2)
35
R squared is..
The variation of the dependent variable that is explained by the independent variable. 1-r2 = the unexplained variation E.g. r2=0.66 then 66% of variance is explained
36
Regression assumptions?
X does not need to be normally distributed, but the residuals (deviations from the regression line) need to be normally distributed for the X values. Variance the same for X and Y Linear relationship
37
specificity is=1- | Sensitivity is=1-
Sensitivity= 1-proportion of false negatives (HAVE DISEASE) | specificity=1-proportion of false positives (WITHOUT THE DISEASE)
38
Likelihood ratio positive equation? WHat is it?
sensitivity/1-spec | Likelihood of positive result with the disease vs without
39
Likelihood ratio negative equation? WHat is it?
1-sens/spec | Likelihood of negative result with the disease vs without
40
standardised effect size calculation?
Mean of treatment- mean of control/ SD, where 1 means 1SD away
41
What determines sample size? (4)
1. Target anticipated effect size/ difference 2. The variability/ SD of the outcome data 3. Power (typically 80%) 4. The significance level (0.05)
42
sample size increases with? (4)
Binary outcomes, large SD, lower signifincance level, higher power.
43
what is needed for a sample size calculation for a normally distributed RCT? (4)
1. Type 1 error 2. Type 11 error 3. SD of outcome 4. Size of difference to detect
44
what is needed for a sample size calculation for a binary RCT? (4)
1. type 1 error 2. Type 2 error 3. anticipated % with the outcome on new and standard treatment (read difference off table) 4. size of difference want to be able to detect. (SD not needed)
45
what is needed for a sample size calculation for estimating the prevalence of a disease in population (survey)? (4)
1. Estimate of the disease prevalence 2. The confidence level (e.g. 5% either side) 3. How precise the estimate should be. 4. Estimated response to survey
46
If it says the trials results are unadjusted which test to use?
Chi squared, if is adjusted logistic regression
47
what is needed for a sample size calculation for a cluster RCT? (4)
1. Average no. of patients per cluster 2. Risk of type I error 3. Size of difference to detect 4. Intracluster coefficient (not correlation coefficient)
48
ROC curve plot?
1-spec on X axis, sensitivity on y
49
LR (+) if high or low use?
If low useful for ruling out a disease, if high useful for ruling in a disease
50
T test T statistic equation?
Paired: mean difference/SE=t (n-1) Unpaired: Mean X1-Mean X2/SE=t (n-2)
51
CI for relaive risks?
SE= square root [1/a-1/a+c + 1/b -1/b+d]