Key Concepts Flashcards

Question

What features define a frequentist approach to statistical inference?

Answer 1

1. Using p-values, confidence intervals, maximum likelihood 2. Inference is based on the observed data. 3. Make probability statements about the data, given the value of a parameter: “The probability of observing data as extreme as this, given there is no treatment effect is 3%.” 4. Different people will get the same results applying the same analysis to the same data.

Answer 2

1. Credible intervals, priors, posterior probability 2. Incorporates prior beliefs into statistical inference 3. Allows probability statements about parameters, given the data (and prior beliefs) e.g. “Given the data we have observed, there is a 97% chance the treatment is effective” 4. Different people will get different results depending on their prior beliefs

Answer 3

The sample size is large enough and the strength of prior beliefs weak.

Answer 4

1. Confidence interval 2. p-value:

Answer 5

An interval of uncertainty around an estimate for a parameter

Answer 6

Intervals that, under repeated sampling, would contain the true value alpha percent of the time .

Answer 7

95% confidence intervals

Answer 8

The standard deviation of an estimate’s sampling distribution

Answer 9

Divide standard deviation by the square root of the sample size

Answer 10

Get smaller as n increases

Answer 11

False The standard deviation does not change systematically with sample size

Answer 12

95% 𝐶𝐼=𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒 ±1.96×𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟 E.g. for a mean 95% 𝐶𝐼=𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒 ±1.96 (𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛)/√𝑛

Answer 13

t-distribution This leads to a different multiplier for the standard error to 1.96, usually fairly close to 2

Answer 14

The probability of observing the data, or data more extreme given the parameter of interest takes a given value.

Answer 15

The value the parameter is set to take Typically the null hypothesis is for no effect or association.

Answer 16

for parameters to be equal to zero

Answer 17

Statistical test of hypothesis

Answer 18

A statistical test of hypothesis consists of five parts 1 . The null hypothesis, denoted by H0 2. The alternative hypothesis, denoted by H1 1. One tailed: H1 d: parameter > H0 2. Two tailed: H1 : parameter ≠ H0 Two tailed p-values are almost always used 3. The p-value 4. A significance threshold (0.05)

Answer 19

If the p-value is below the significance threshold we reject the null hypothesis and conclude that the alternative hypothesis is true

Answer 20

This does not mean the null hypothesis is true A non-significant p-values tells us we do not know much

Answer 21

There is evidence that there is a difference: If there was no difference we’d have been unlikely to see the data we did.

Answer 22

There is insufficient evidence to conclude there is a difference. If there was no difference our results would not be unexpected.But we cannot rule out a difference. If a p-value is not statistically significant we cannot conclude that there is no difference.

Answer 23

Type 1 error (α) Type 2 error (β)

Answer 24

Falsely conclude there is a difference Controlled with significance threshold If the significance threshold is 0.05 we expect a type 1 error rate of 5%

Answer 25

Fail to conclude that the there is evidence for a difference when there is a true difference Sample size, magnitude of true difference, and variability of data effect type 2 error rates Power = 1 - β Power is the probability of concluding there is a difference, when true. Low powered test: Unlikely to be significant even if there is a difference

Answer 26

Whether the p-value is statistically significant at the α level. i.e. given a 95% confidence interval you can tell if the p-value would be significant at the 5% level

Answer 27

For example if the null hypothesis is 0: 95% CI of -1.1 to -0.1 would correspond to a statistically significant result -1.1 to 0.1 would correspond to a result that was not statistically significant.

Answer 28

Multiple testing & p-hacking

Answer 29

-Selective reporting enhances the problem, eg: Only report significant results and ignore non-significant results - Place more emphasis on significant results -Selective reporting can occur at the study level: studies with non-significant findings are less likely to be published

Answer 30

1. Bonferroni correction: divide significance threshold by number of tests - This can be conservative - Leads to larger sample sizes being required 2. Pre-specification of outcomes, analysis methods, and studies - Can specify primary outcome – stops emphasis being shifted to significant results - Makes visible the number of tests conducted - Compulsory in randomised controlled trials e.g. All trials campaign http://www.alltrials.net/ - Harder to do in more exploratory studies

Answer 31

1. Can be manipulated with multiple testing 2. They are often misinterpreted People often interpret p > 0.05 as meaning “no effect” 3. Over reliance on significance thresholds p = 0.04 given wildly different interpretation to p = 0.06 4. Bayesian argument: p-value tells us probability of observing the data given no effect What we want to know is probability of an effect. This can only be achieved with Bayesian inference.

Key Concepts Flashcards

(57 cards)