Lectures Notes Flashcards

Question

your 95% CI for a relative risk includes 1.00 - what does this mean?

Answer 1

there is NO difference between groups

Answer 2

no. people with disease / no. people without

Answer 3

odds (exposed) / odds (unexposed)

Answer 4

they'll basically be the same - but for a common event, they can be really different

Answer 5

standard deviation

Answer 6

peak of data is at the left, tail extends to the right. the mean will be greater than the median.

Answer 7

mean = median : symmetrical mean > median : positive skew mean < median : negative skew

Answer 8

peak of data is at the right, tail extends to the left. mean will be less than the median.

Answer 9

median and interquartile range

Answer 10

P(A or B) = P(A) + P(B)

Answer 11

P(A and B) = P(A) x P(B)

Answer 12

``` 0 = it can never happen 1 = it definitely happens ```

Answer 13

estimate of the precision of a sample estimate - a measure of how far from the true population value a sample estimate is likely to be.

Answer 14

SD / sq root of n

Answer 15

square root of: p(1-p) / n (p = sample proportion)

Answer 16

SD of first sample / n for that sample + SD of second sample / n for that sample square root the answer

Answer 17

that your estimate of a population mean is imprecise

Answer 18

that your estimate of a population mean is precise

Answer 19

down - we get a more precise estimate

Answer 20

mean ± (1.96xSE)

Answer 21

if the study were to be repeated 100 times, of the 100 resulting 95% CIs, we would expect 95 of these to include the population mean

Answer 22

We are 95% confident that the true population mean sys. BP lies between 120 and 130, but the best estimate we have is 125.

Answer 23

SD describes the variability of the observations in the sample, whereas SE is a measure of the precision of an estimate of the population mean.

Answer 24

1. state null hypothesis 2. choose a significance level 3. obtain P-value 4. use P-value to decide whether to reject your null hypothesis

Answer 25

the probability of observing your results, or more extreme, if the null hypothesis is true

Answer 26

carry out a statistical significance test, and that generates a test statistic. we then use the test statistic and distribution tables to find the P-value.

Answer 27

observed value - hypothesis value | all divided by standard error

Answer 28

the probability of rejecting the null hypothesis when it is actually false. i.e. probability of concluding that there is a difference, when a difference truly does exist. = 1 - beta (beta = type II error)

Answer 29

same as a false negative - probability of not rejecting the null hypothesis, when it is in fact false.

Answer 30

this is the P-value! | same as false positive - probability of rejecting null hypothesis when it is in fact true

Answer 31

NO. but if it doesn't - don't need the P-value, as it shows that it is statistically significant to the 5% level

Answer 32

a type of statistical test, that assume data are distributed according to a specific distribution (e.g. Normal distribution)

Answer 33

t-test analysis of variance (ANOVA) linear regression techniques

Answer 34

type of statistical test that does not make any assumptions about the shape of the data. used when you can't meet assumptions for parametric test, data is skewed, or there are outliers. useful for data that is skewed, ranked or ordinal. robust to outliers. based of ranks of the data, not the actual data.

Answer 35

paired data = same individuals studied at two different times. independent = data collected from two separate groups.

Answer 36

- that the differences between values are Normally distributed (e.g. difference in PHQ9 score at 0 and 4 months) - that the differences are independent of each other

Answer 37

non-parametric equivalent of the paired t-test! | used when you can't meet the assumptions of the paired t-test.

Answer 38

- analysis of variance (ANOVA) - parametric. | - Kruskal-Wallis test (non-parametric version)

Answer 39

Chi-squared, difference in proportions, or Fisher's exact test

Answer 40

when the sample is large enough. np and n(1-p) should both be greater than 5. ``` n = total no. individuals in both samples. p = proportion of individuals with the condition (regardless of group) ```

Answer 41

- two nominal categorical variables that can form a r x c contingency table - at least 80% of expected cell counts >5 - all expected cell counts >1

Answer 42

should be used for all chi-squared tests on 2x2 tables

Answer 43

when values are too small to do chi-squared!

Answer 44

McNemar's test

Answer 45

data where there are two variables, either categorical or numerical

Answer 46

when you aren't implying an order or causation, just an association

Answer 47

when one variable (Y) is a response to another variable (X) - you could use value of X to predict Y

Answer 48

it's a measure of the linear association between two variables (cannot use to predict one variable from another)

Answer 49

r must be between -1 and +1 +1 = perfect positive linear association. -1 = perfect negative linear association. 0 = no linear relation at all.

Answer 50

plot scatter plot with the X (predictor/explanatory variable) on X axis, and the Y (response) variable going up Y axis. Then finds line of best fit using "least squares" model. equation from that line can be used to predict Y from X.

Answer 51

Y = a + bX ``` a = intercept b = slope ```

Answer 52

form of regression used when there are multiple variables influencing the outcome variable.

Answer 53

1. to identify any explanatory variables that may be associated with the Y variable 2. to investigate extent to which 1+ variables are linearly related to Y, after adjusting for other variables 3. to predict value of Y from X variables

Answer 54

Y = a + then multiple 'b's (coefficients), which you multiple by each corresponding variable (X)

Answer 55

difference in means between intervention and control group, divided by the standard deviation of the outcomes

Answer 56

1. target/anticipated effect size (δ) 2. standard deviation of the outcome data (σ) 3. power (typically 80-90%) 4. significance level (0.05)

Answer 57

goes up. | this is the same as saying type II error decreases

Answer 58

1. how precise should estimate be? e.g. within ±5% 2. probability that estimate is close to the population parameter 3. some idea of the prevalence in the population under study

Answer 59

= true positives / no. people with disease

Answer 60

= true negatives / no. people without disease

Answer 61

given that the patient has the disease, sensitivity is the proportion of times the test is positive

Answer 62

given that the subject doesn't have the disease, specificity is the proportion of times the test will be negative

Answer 63

probability that someone has the disease when the test is positive

Answer 64

true positives / no. positive results

Answer 65

probability that someone is without disease when the test is negative

Answer 66

true negatives / no. negative results

Answer 67

true positives + true negatives / no. people tested

Answer 68

can use the Receiver Operating Characteristic (ROC) curve - plots sensitivity vs 1-specificity for each distinct cut-off value. best cut-off point is the one nearest the top left-hand corner. an ROC curve lying on the 45 degree line is no better than chance!

Answer 69

sensitivity / 1-specificity this is the probability of getting this result, if patient is truly diseased vs if they were healthy. interpret as you would any other ratio!

Answer 70

inverse of LR(+) 1-specificity / sensitivity

Answer 71

a large LR(+) e.g. >10 = test could be useful in ruling IN a diagnosis. small LR(-), close to 0 = test could be useful in ruling OUT a diagnosis

Answer 72

Independent samples t-test

Answer 73

Mann-Whitney U

Answer 74

Mann-Whitney U | or, Chi-squared test for trend

Answer 75

Chi-squared test

Answer 76

Comparison of two proportions OR Chi-Squared

Answer 77

Chi-squared with Yates' correction OR Fishers' exact test

Answer 78

paired t-test

Answer 79

Wilcoxon matched pairs test

Answer 80

Sign test or Wilcoxon matched pairs test

Answer 81

McNemar's test

Answer 82

trick question! consult statistician!

Answer 83

(n1 + n2) - 2

Answer 84

1. two independent groups 2. continuous outcome variable 3. outcome data Normally distributed in both groups 4. outcome data in both groups have similar SDs

Answer 85

1. the differences between pairs are plausibly Normally distributed (not the actual data itself) 2. the differences between pairs are independent of each other

Answer 86

the probability of NOT making a type II error

Answer 87

1. variance of Y is same at each value of X 2. standard deviation of Y is same at each value of X 3. relationship between two variables is linear 4. residuals are Normally distributed for each value of X

Answer 88

1. target/anticipated effect size 2. SD of outcome 3. power 4. significance

Lectures Notes Flashcards

(124 cards)