Stats 2 Flashcards
(50 cards)
What is the null and study hypothesis’s ?
Null= no dif between exposed and unexposed Study = there is a dif between exposed and unexposed
What is a p-value?
The chance that what you saw happened by chance
P= 0.02 means probability it happened by chance is 2 in 100
3 steps to hypothesis testing
1) specify the study hypothesis and the null hypothesis
2) assume the null hypothesis is true
3) calculate the p-value- if low (<0.05) reject null hypothesis
What is a type 1 error?
False positive -reject the null hypothesis even though it is actually true
What is a type 2 error?
False negative - fail to reject null hypothesis even though it is actually false.
How do you calculate the T statistic?
Observed mean difference/
Standard error of difference between means
What assumptions does the t-test make?
The outcome is continuous and normal distribution
And variance in the two groups is equal
What does the levene’s test measure?
That variance between two groups is equal (one of the assumptions needed for the t-test)
In spss what is the p-value called?
Sig
Chi- squared tests are used to calculate p-values from what kind of data?
Categorical data
T- tests are used to calculate the p-value from what kind of data?
Continuous data
What is the student t-test?
Independent samples t-test
Used when u want to compare two groups of continuous, normally distributed variables
(Assumes that variance/scatter in two groups are similar)
How does the t-test work?
Uses the observed difference in sample means and the standard error (sampling error) for the dif in means to calculate the p value
What is this t-statistic?
Difference in sample means/ standard error of the dif in means
What would result in a small t-statistic and what does it mean?
Either small difference in means or a large SE.
Means probability that the observed dif happened by chance.= large = large p-value
What is a parametric test? Name one.
Only valid when the data is normally distributed and 2 populations have equal variance
Example= t-test
What are the negatives and positives of NON- parametric tests?
Pos: makes no assumptions of underlying distribution of data.
Neg: less powerful than parametric test- difficult to get confidence intervals
How to transform moderately positively skewed data?
Logarithm of each number
How do you transform strongly positively skewed data?
Reciprocal (l/x)
How do you transform weakly positively skewed data?
Square root
How do you transform moderately negatively skewed data?
Square
How do you transform strongly negatively skewed data?
Cube x3
How do you transform data with unequal variation
Log, reciprocal or square root
How to describe skewed variables if data not normally distributed?
1) present medians
2) present interquartile range
3) difference between two medians (CI not easy)