1 - Numerical Measures, Graphs and Diagram Flashcards
(12 cards)
What is the median and its notation?
The median is the middle value when data is arranged in ranked order. If there is no single middle value, it is the average of the two middle values. It is denoted as (Q2), representing the second quartile (50% through the data).
What is the mode and its limitations?
The mode is the value with the highest frequency in a dataset. For grouped data, it is the modal group/class with the highest frequency.
The limitations of this is that the mode may not be representative of the dataset in some cases.
Key Measures of Spread
Range, Interquartile Range (IQR), Standard Deviation, Variance
What is an Outlier and how can it be identified?
An outlier is a data point that significantly differs from other observations. It can be identified using
Quartiles:
Lower outlier: ( Q_1 - 1.5(Q_3 - Q_1) )
Upper outlier: ( Q_3 + 1.5(Q_3 - Q_1) )
What is a Box and Whisker Plots and an example of one
Definition: A box plot summarizes data using five key figures: minimum, maximum, median, and quartiles.
Example: Compare masses of male and female turtles using box plots.
What is a Cumulative Frequency table and an example of one
Definition: Cumulative frequency is the running total of frequencies, useful for estimating medians and quartiles.
Example: Heights of giraffes summarized in a cumulative frequency table.
Advantage and Disadvantage of Mode
A - Easy to identify
D - May not represent data well
Advantage and Disadvantage of Median
A - Not affected by outliers
D - Less informative for small datasets
Advantage and Disadvantage of Mean
A - Uses all data points
D - Affected by outliers
Advantage and Disadvantage of Range
A - Simple to calcuate
D - Ignores distribution of data
Advantage and Disadvantage of IQR
A - Robust against outliers
D - Requires ordered data
Advantage and Disadvantage of SD
A - Comprehensive measure of spread
D - Complex to calcuate