Stats Flashcards
(56 cards)
Null hypothesis (H0)
No difference between the two groups (default hypothesis)
Alpha (type I) error
Incorrect rejection of the null hypothesis - conclude there is a difference when there is no true difference
Beta (type II) error
Incorrect acceptance of the null hypothesis - conclude there is no difference when there is
Standard deviation
Measure of the variability of the data from the mean, equal to the square root of the variance
Standard error
Measures the variability of means when many similar samples were taken from the population of possible measurements - how close the sample mean is likely to be to the true population mean
Power
The ability to detect a significant result where a true difference exists (the probability of rejecting the null hypothesis when the alternative hypothesis is true)
Accuracy
How close a value is to the true value
Precision
How consistent results are when measurements are repeated
Validity
Suitability of the experimental method to address the aim of the experiment
Internal validity
If outcome is related to aim
External validity
If results are applicable to the wider population
Odds ratio
How much more likely one group is to be in an outcome group than another
What is the alpha (type I) error also known as?
Specificity
What are alpha (type I) errors directly related to (statistically)?
P value
What are beta (type II) errors also known as?
Sensitivity
What is the equation for calculating power (in simple terms)?
1-beta error
What power value do we typically want an experiment to have?
Between 0.8 and 0.9
What does power relate to (statistically)?
Sample size and effect size. If sample size is large enough, even a tiny effect size may be statistically significant (but not necessarily clinically or functionally significant)
When may precision and accuracy become uncoupled?
Systematic error
When is mean vs median usually used?
Mean - when data are normally distributed with no major outliers
Median - when data are skewed or non-normally distributed
If the median of data is used, what should be used to measure spread?
Interquartile range
If the mean is used, what should be used to measure spread?
Standard deviation - spread of direct data
Standard error - where the real mean may be based on the population
X% confidence intervals - the range containing the population mean X% of the time based on the data collected
How can we test for normality?
Visually of q-q plots
Using tests such as the Shapiro-Wilk test
If data are normally distributed, what type of test is used?
Parametric