Interpreting Statistics Flashcards
(33 cards)
What does a box plot show in relation to data?
The shape,
Central Tendency,
Variability of the data
What the best conditions to use a boxplot?
When sample size is greater than 20
Can box plots be used to identify outliers?
Yes
What is the coefficient of variation? (CoefVar)
Describes the variation in the data compared to the mean
How do you interpret the variation of data?
The larger the variation, the greater the spread in the data.
What are quartiles?
Values that divide the sample of ordered data into four equal parts
What does the 25th percentile mean?
It indicates that 25% of the data are less than or equal to this value
How do you calculate the Interquartile Range (IQR)
Quartile 3 - Quartile 1
*Descriptive stats in excel will not do this
What does a histogram do?
Divides the sample values into many intervals and represents the frequency of data values in each interval with a bar
How can you tell if the data is normally distributed?
If the data is symmetric (spike in the middle with equal tails left and right)
This can be indicated by a curve
Bell-shaped
What charts can display skewness easily?
Histograms and boxplots
What is kurtosis?
It indicates how the peak and tails of a distribution differ from the normal distribution
What does a kurtosis value of 0 mean?
Baseline:
Value of 0 indicates that the data follows the normal distribution perfectly.
What does a positive kurtosis value mean?
This indicates that the distribution has heavier tails and a sharper peak than normal distribution.
What does a negative kurtosis value mean?
This indicates that the distribution has lighter tails and a flatter peak than the normal distribution.
What can the maximum value be used for?
To identify any possible outliers or data entry errors. (especially if the maximum is very high compared to the rest of the data)
How can you interpret the mean?
- Used as a standard measure of the centre of the distribution of the data
- Measures central tendency
- If data is symmetric mean and median are similar
What is the median?
Midpoint of the data set.
Half of the data is below the median and the other half is above the median
How can you interpret the median?
- Measures central tendency
- Outliers affect the median less than they affect the mean, useful for comparison
What is the mode?
The value/number in the data set that occurs most frequently
There can be multiple modes so make sure to check
How can you interpret the mode(s)?
- Can be used to identify problems in the data
- If the data has two modes it is called bimodal
- Data has more than two modes, it is called mutli-modal
- Mode can be used with mean and the median to provide an overall view of the data
How can you interpret the range?
A large range indicates greater dispersion in the data
A small range indicates that there is less dispersion in the data
What does SE stand for?
Standard error of the mean
What does standard error of the mean do?
Estimates the variability between sample means
(so if you took repeated samples of the same population you can estimate the mean)
(The SD only measures the variation in a single sample)