significance and power Flashcards

(24 cards)

1
Q

what are the principles of null hypothesis significance testing (NHST)?

A

assume H0 is true - fit a model to get data, get a test statistic - calculate the probability of getting test statistic, assuming H0 is true (p)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do you get the test statistic in NHST?

A

comparing amount of “signal” to “noise”

or “systematic variation” to “unsystematic variation”

or “effect” to “error”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how can NHST be mised?

A

p-value not measuring probability to getting results by chance or that a specific hypothesis is true

statistical significance is not the same as practical importance

p-value alone is not a good measure of evidence regarding a model of hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

when do you get a type I error?

A

experiment result - H1 true

reality - H0 true

α

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

when do you get a type II error?

A

experiment result - H0 true

reality - H1 true

β

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is power?

A

probability of finding an effect assuming one exists in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how do you calculate power?

A

1 - β

1 = absolute certainty
β = usually how much type II error you are happy to accept, probability of not finding effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is β typically?

A

0.2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what factors affect power?

A

effect size

number of participants

alpha level

variability

design

test choice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is effect size?

A

objective and standardised measure of magnitude of an effect

larger value = bigger effect size

can help to know how many participants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are is the name of the effect size for the different tests?

A

Cohen’s d = t-test

Pearson’s r = correlation

Partial eta squared = ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how does a larger number of participants affect power?

A

more “signal”, less “noise”

more powerful study

more population have, less remove for sample error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how should choose the effect size?

A

depending on expected effect size

larger effect size = fewer participants needed to get “real” effect

smaller effect size = more participants needed to detect “real” effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is alpha level?

A

probability of obtaining a type I error

compare p value to this criteria when testing significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

when should an alpha level be chosen?

A

before running study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the general choice for alpha level?

17
Q

when are results statistically significant?

A

if p-value < α

18
Q

what are the problems with alpha level?

A

balance of type I vs type II error

if run multiple tests, will increase rate at which might be get type I error - Familywise experimental error rate

can account for this by limiting number of tests or by using corrections such as Bonferroni correlation but this reduces power

19
Q

how does design affect power?

A

within-subjects more powerful than between-subjects studies

design depends on type of study

20
Q

what is a one-tailed test?

A

hypothesise will be difference in scores

specific about which score will be higher

α = .05 at one end

21
Q

what is a two-tailed test?

A

hypothesise will be difference in scores

could be either direction

α = .025 at both ends

22
Q

why does p-value change between one and two tailed test?

A

two-tailed hypothesis tries to assess in both directions

23
Q

how does type of test affect power?

A

one-tailed test more powerful as α higher

several caveats and considerations

most recommended that run two-tailed tests

24
Q

why does power and the factors that affect power matter?

A

want to calculate power obtained in study post-hoc

want to calculate how many participants we need to collect for a study a priori (can be done using statistical programs like G* power)