Comparing 2 Means Flashcards
Hypothesis testing
Basic idea , example you out two opposing things against each other, your favourite team is your research hypothesis vs your enemy team . You want them to be different, you want your team to win ( reject the null)
Null hypothesis
There is no difference between group scores
You expect to find:
1. No difference between group scores ( t-tests, ANOVA families ) u
2. No relationships between variables( regression families p, chi- square
Alternative hypothesis
There is a difference in group scores
Relationship between variables
Hypothesis testing, rockem sockem robots if statistics
How do we determine which robot won?
P values
Cut off scores
But we usually use p values , cut off scores is a different way of doing hypothesis testing
- we want to reject the null hypothesis , this is the success, this is when we can say something is statistically significant
What do we do if we do not have enough evidence to reject the null ?
We say it’s not statistically significant, this doesn’t mean that there is not difference between the groups but we won’t have enough evidence to support rejecting the null hypothesis
P-values ?
The probability of getting that results ( t values , f values , chi square ) if the NULL were true if there is no difference between the groups the NULL is true
- you want the null to be false , so you want the probability of being wrong to be low
Alpha
The probability of type 1 error , alpha does not equal p- value
- alpha set as a criterion for a low type 1 error, it is not same as your p actual found in your experiment
A false positive or type 1 error is …
Is alpha
Here you reject the null although the null is true but we get a small p value thus we think it’s significant and reject the null
Beta
The probability of type 2 error, the opposite of power
Type 2 error , failing to reject the null when you should reject it
( aka your research hypothesis is supported but you missed it )
- this is a false negative
Power
The probability of rejecting the null when you should reject the null
- your hypothesis is supported and you showed that
Normally we use the power to calculate our sample size, how many individuals we need in an experiment to find statistical significance
A power of 80% means that even though there is a real difference in the world, your analysis will only show that 80% of the time
Assumptions
Things that must be true for your test to return an answer that is reasonably correct
- when assumptions aren’t met you do not know what the answer you got means
Effect size
Is a measure of how big an effect was in your experiment
Ex: you might reject the null hypothesis, the experiment worked, but then determine that the group differences of relationships were small
- effect size is seen as a measure of strength of a phenomenon for a technical definition
Cohen’s d
Is one of the most well known calculations for effect size
The general formula gives the standardized distance between the two populations means,
- formula is very adaptive and can be used for many differences between sample and within sample tests
- based on difference between the mean, and pooled standard deviation
Small: 0,02
Medium: 0,05
Large: 0.8
Variance overlap for anova and regression
N^2 for ANOVA
R^2 regression
These stats are based on the amount of variance that you have accounted for by your manipulation ( groups ) or independent variable ( like predictors in regression) out of the total variance
Different values will be used in different analysis
Effect size ( n squared)
There are partial versions of all three of these statistics for when you have more than one IV
Small :0.01
Medium:0.09
Large:0.25
Since the statistic is the proportion of variance over a total, it ranges from 0 to 1 and cannot be negative