Chapter 3: Summarising Data Flashcards
What is a measure of central tendency?
Represents the ‘centre’ of a set of data, including mode, median, and mean.
Define mode in data.
The one that appears the most; the most common value.
What is a modal class?
The class with the highest frequency.
What is the median?
The middle value of a dataset.
How do you find the median of discrete data?
- Put the numbers in order from smallest to largest. 2. Find the (n+1)th value, which indicates the median position.
What is the formula to find the median position?
(n + 1) / 2.
What should you do if the median position is a decimal?
Find the two surrounding values and average them.
How do you find the median in grouped data?
Identify the median class which contains the median position.
What is the estimated median using linear interpolation?
Use ½ n to find the median position and calculate within the median class.
What is the mean (arithmetic mean)?
The sum of all values divided by the number of values.
Provide the formula for mean.
𝑥̅ = ∑𝑥 / n.
How do you calculate the mean from a frequency table?
Add an extra column for f × x, sum it, and divide by total frequency.
What is the formula for weighted mean?
Weighted Mean = ∑(weight × value) / ∑weights.
What is the geometric mean?
The nth root of the product of all values.
Why is transforming data useful?
To simplify calculations with large numbers.
What happens to the mode when new values are added?
It could change if the new value affects which value appears most.
How does adding a value greater than the median affect the median?
The median might increase.
What is the range in statistics?
The difference between the largest and smallest values.
What is the formula for range?
Range = Largest Value - Smallest Value.
Define interquartile range (IQR).
The middle 50% of the data when in order.
What is the formula for the interquartile range?
IQR = Upper Quartile - Lower Quartile.
What is the lower quartile (LQ)?
The value at 25% of the way through the data.
True or False: The mean is always affected by extreme values.
True.
List the advantages of using mode.
- Easy to use
- Always a value in the data
- Unaffected by extreme values
- Can be used with quantitative and qualitative data