Quiz 3 Flashcards
Quiz 3 for Stats I (85 cards)
Flip side of power ( __%) is ___ (__%)
80%, type II error / beta (20%)
directional hypothesis is to _____ as non-directional is to ____
one tailed, two tailed
What is the mean of the distribution of a z statistic when H0 (null) is true
mean of zero
When null is false, H1 is ____
true
Power
- The greater the number of subjects in a sample, the more power (generally)
- Though the incremental increase in power becomes smaller as the sample size increases
- Power analysis is a statistical method to calculate the sample size needed to achieve desired power
Influences on Power
- Significance level (alpha), more lenient alpha = more power
- One-tailed vs two tailed test (one tailed has more power)
- As standard error decreases, there is more power
more lenient alpha = _____ power
more
which has more power: a one-tailed t-test or a two tailed
one tailed
As standard error decreases, power ________
increases
why is there greater likelihood for us to reject the null hypothesis when there is less standard error?
there is less overlap between the graphs of the null and alternative distributions
One-tailed test
Tests a directional hypothesis (specific direction of an effect)
E.g. change > 0
Two-tailed test
Test a non-directional hypothesis (doesn’t specify the direction of an effect
E.g. Change ≠ 0
Why does a one-tailed test provide us with greater power?
One tailed test provides us with greater power because the critical value is smaller than the cutoff for a two-tailed test
Researchers tend to use a two tailed test in order to ______.
discriminate between a zero effect and a negative effect
How are one-tailed tests often misused?
One-tailed tests are often misused in a way which increases type 1 error rate.
If people incorrectly reject the null after there is the illusion of an effect at the wrong side of the curve for a directional hypothesis.
Effect Sizes
- A standardized measure of the size of an effect:
- Standardized =
comparable across
studies- Not (as) reliant on the sample size - Allows people to objectively evaluate the size of the observed effect
- Amount that two
populations do not
overlap
- Amount that two
E.g. cohen’s d, correlation
Types of effect sizes
The family of effect size measures has been categorized into TWO broad groups:
Measures of mean differences (e.g. Glass’s Delta, Cohen’s d)
Measures of strength of relations (e.g. r, R^2, eta squared)
Two groups of measurement of effect sizes
- Measures of Mean differences
- Measures of strength relations
Measures of mean differences examples
Glass’s Delta, Cohen’s d, Hedges g
Measures of strength of relation examples
r, R^2, eta squared
Measures of Mean Differences
Largely calculated in the same manner
E.g. d = mean1/mean2 / population SD
We don’t know the population SD,
Differ in how they estimate the population SD
Glass’s delta
- uses the SD using the control group because it is “untainted” by the treatment
- Strength of this metric depends on the size of the control group, the larger the control group, the more appropriate the SD as it estimates the population SD
Cohen’s d
- combines the SD of both groups
- Because cohen’s d pulls information from sd of both groups, when the sd’s across groups are very different, it makes more sense to use glass’s delta
Hedges g
- Multiply cohen’s d with some scaler
- Cohen’s d tends to overestimate the amplitude of effect in smaller samples
- Corrects for cohen’s d bias, so better to use in smaller samples (20 or less)