Statistics 2 Flashcards
What is a measure of location?
Single value that describes a position within a data set.
If this value is describing the centre of the data?
This is a measure of central tendency.
Mode/modal class?
Value/class that occurs most often.
Median?
Middle value when all values ordered.
Mean calculated by?
Sum of data values (∑x)/ number of data values (n)
When is the mode an appropriate measure?
-Qualitative/quantitative data
-Single mode/bimodal data.
Inappropriate to measure mode?
Each value only occurs once.
Median usage?
Quantitative data only.
Advantage of median vs mean?
Not affected by extreme values, so can be used in data with such values.
Mean usage?
-Utilises all pieces of data, giving a true measure of the data.
-Used for quantitative data only.
-Is affected by extreme values.
For data values of a frequency table, mean calculated by?
Frequency Density (Midpoint x Frequency) ∑xf/
Frequencies (∑f
Median?
-Describes the middle of the data set, splitting the data into 2 50% halves.
For effects on measure by a new data value, how is this evaluated?
Compare the previous value with the new one, if its larger, it increases etc.
Lower quartile?
1/4 of the way through the data set.
Upper quartile?
3/4s of the way through the data set.
Percentiles?
-Split data into 100 parts.
(e.g. 10th percentile is 10/100 (1/10) of the way through the data).
Calculate lower quartile for discrete data?
-n/4
-If integer, lower quartile halfway between this data point + next above.
-If not integer, round up and utilise this data point.
Upper quartile of discrete data?
-3/4 of n
-If integer, upper quartile halfway between this data point + one above.
-If not integer, round up and utilise this data point.
If data is presented in a group frequency table, how can medians, quartiles and percentiles be estimated?
Using process of linear interpolation.
Why is there assumption involved in process of linear interpolation?
Assumed that data values are evenly distributed within each class/range.
Lower + upper quartile + median in grouped continuous/cumulative frequency data calculation?
Q1: n/4th value
Q2: n/2th value
Q3: 3n/4th value
Measure of spread?
Measure of how spread out data is.
Range?
Difference between largest and smallest values of data set.
IQR?
Difference between the upper quartile and lower quartile, Q3-Q1.