Statistical tests Flashcards
(10 cards)
What tests do we use for normally distributed univariate analysis of two independent samples of quantitative variables?
Pooled (two sample) t test or Welch’s t test (no need for equal variance but normality). Both compare means.
pooled is same as outcome ~ treatment
If we have a balanced design (equal sample sizes in the two groups) then the pooled t approach and the Welch’s t approach yield essentially the same results
What tests do we use for a univariate analyse of two independent samples of quantitative variables if we can’t assume normality?
- Bootstrap mean difference (compares means)
- Wilcoxon-Mann Whitney rank-sum test (no equal variance/normality, estimated median of difference)
What are characteristics of a paired sample?
Every value is connected to a corresponding value for the same subject
Calculate the difference for each subject, so long as there aren’t any missing data
What paired sample test do you use if normally distributed?
Paired t-test (one sample t test)
What paired sample test do you use if not nomally distributed?
- Bootstrap population mean (Hmisc::smean.cl.boot)
What test do you use for analysis of two or more means when normally distributed with equal variances?
Analysis of variance (ANOVA)
test is fairly robust to violations of the Normality assumption
and variance is less of an issue in a balanced design
What test do you use for analysis of two or more means if difference in variance or skew?
- identify pairwise differences while correcting for multiple comparisons - Bonferroni or Tukey’s Honestly Significant Difference (HSD)
- severely skewed data - use Kruskal-Wallis test
What tests are used to test association between categorical variables
Pearson chi-square and Fisher’s Exact
compare observed cell counts to what would be expected if variables were independent.
What assumptions must hold true for Pearson chi-squared test?
- Assumes that the expected frequency will be at least 5 (and ideally 10) in each cell
- Cochrane conditions: no cells with zero counts and at least 80% of the cells in our table with expected counts of 5 or higher
- If assumptions not met then collapse categories
- If can’t collapse then just describe data without a statistical comparison
When should we use Fisher’s exact test instead of Pearson squared test?
- When sample size is small
- expected cell count < 5
- often for 2x2 tables
- No assumptions about sample size
exact probability of getting table as extreme as observed if independent