Stats Flashcards
(15 cards)
SD & ND
Mean (average)
Sum of all data points divided by the number of data points
SD & ND
Variance
How far each number in the data set is from the mean. Subtract the “x” value from the mean, square the output, then divide by n.
(x - Mean)^2 / n
SD & ND
Standard deviation
Standard deviation is simply the square root of the variance and gives an idea of how spread out the values in a data set are around the mean.
SD & ND
Normal distribution
- A probability distribution that is symmetric around the mean.
- In a graph, it looks like a “bell curve.”
- The mean, median, and mode of a normal distribution are equal.
SD & ND
Key features of normal distribution
- 68% of data falls within one standard deviation of the mean (μ ± σ)
- 95% of data falls within two standard deviations (μ ± 2σ)
- 99.7% of data falls within three standard deviations (μ ± 3σ)
SD & ND
Z Score
Tells you how many standard deviations it is away from the mean:
Z = (x − μ) / σ
SD & ND
A smaller standard deviation indicates…
Data is more tightly clustered around the mean/is more consistent
SD & ND
A larger standard deviation indicates…
Greater variance in the data set
SD & ND
How does the data change if you change the mean?
If you add a constant to every data point, the mean would shift by that constant, but the standard deviation would stay the same
How does the data change if you change the standard deviation?
Multiplying each score by a constant would scale the standard deviation by the absolute value of that constant. E.g., if you double every score, the new standard deviation would be 2 x SD = 2SD. So, e.g., if SD = 1.41, then 2 x 1.41 = 2.82
Is it possible to make predictions about other percentiles?
No
Box and whisker plot
Middle line in the box represents the median, or middle, of the dataset. The outsides of the box are the medians of the data below and above the median, respectively, which mark the first and third quartile boundaries, or Q1 and Q3
Interquartile range
Distance between Q1 (median of the first half of the dataset) and Q3 (median of the second half of the dataset)
T/F: Normal distributions are always centered on and symmetrical around the mean
True
For any evenly spaced set..
The median equals the mean