Topic 3: intro to measurment of basic summary stats Flashcards
(12 cards)
1
Q
Define measure of central tendency
A
- Single/central/typical value for given variable around which other values cluster
2
Q
Give examples of central tendency
A
- Mean
- Median
- Mode
3
Q
Define measure of dispersion
A
- Extent of spread of values of given variable
4
Q
Give examples of dispersion
A
- Variation
- SD
- Interquartile range
5
Q
Describe mode
A
- Most common value of variable
- Highest frequency in sample
6
Q
Describe median
A
- Middle point of a distribution
7
Q
Describe the mean
A
- Add all the values > divide by number of values
- In normal distribution = most values clustered around mean
8
Q
Describe range
A
- Largest value - smallest value
- Easy to compute
- Not very informative = considers only 2 observations = effected by extreme values
9
Q
Describe quantiles
A
- Solve the problem of range only having 2 values
- Sorts values from min-max then split into parts
- Tertiles = 3 categories
- Quartiles = 4 categories
- Quintiles = 5 categories
- Q1 = 25%
- Q2 = 50% median
- Q3 = 75%
- Interquartile range = Q3-Q1
10
Q
Describe SD
A
- Used to quantify spread of variation around mean cluster
- S = square root of variance
FROM GRAPH - Calculate mean
- Calculate how much each point deviates from mean
- Calculate average of deviations
- SD = how much data differs from mean on average
11
Q
Describe a box plot
A
- 5-number summary
1) Min
2) Q1
3) Median
4) Q3
5) Max
12
Q
Describe histogram
A
- Summary graph for single numeric variable
- Use = understand pattern of variability in data
- Range of values = divided into equal intervals
- Shows = number of individual data points in each interval