Chapter 5: Measures of Variability Flashcards
(17 cards)
Variability (Dispersion)
- Degree to which an individual data point is distributed around the mean
- Statistics attempts to sort out what the ‘causes’ of variability in behaviour are
Range: median
- Distance from lowest –> highest score
- -Influenced by outliers
- -(Xf - Xi)
- Not a good measure of spread b/c extremes (high/ low) can have big effect
Interquartile Range (IQR): median
- 2 inner quartiles/ center most 50% of population
- -Range is taken w these groups to eliminate outliers
- Is a trimmed stat b/c it is a stat calculated based on a trimmed sample
- Goal = eliminate outliers that are due to measurement errors
Advantage
-Based on middle 50%, less influenced by extreme values
Disadvantage
- Discards too much data
- -Creates a very trimmed sample
Trimmed Sample
Samples with a % of extreme scores removed
Trimmed Statistics
Statistics calculated based on trimmed samples
Sample Variance (s^2): mean
Sum of squared about the mean, divided by (N-1)
-Is bias unless calculated correctly
Population Variance
Variance of a population is usually estimated, rarely computed
-A difference in variance reflects the differences in distribution
Standard Deviation: mean
- +’ve square root of the variance
- Measure of average of the deviations of each score from the mean
- -How much, on average, scores differ from the mean
Bias
-Proportion of statistic whose range average isn’t a parameter it estimates
Degrees of Freedom (df)
# of independent pieces of information remaining after estimating 1+ populations -(N-1) = df
Expected Value (E)
Long-range average of a statistic over repeated samples
Boxplot: median
- Graphical representation of the dispersion of a sample & extreme scores
- Hinges: scores cutting off top & bottom 1/4
- H-spread: range b/w hinges (Hf - Hi)
- Median: line inside the box
- Whiskers: line indication = 1.5xH-spread
- -If biggest/ smallest score = less than a whisker, the whisker line should only extend to that value
- Outliers: any score beyond the whisker
Quartile Location
[ (Median Location +1) / 2 ]
-Location of the quartile in an ordered series
Whisker
Line from top & bottom of the box to the farthest point that’s no more than 1.5X the IQR from the box
Winsorized Variance
Variance of a Windsor sample
-Take out ie. greatest & lowest 20% & replace with the lowest/ highest values remaining
Winsorized Standard Deviation
Standard deviation of a winsorized sample
Average Deviation
- Sum of the difference b/w scores & mean
- We don;t use this b/c the -ve #s cancel out
- -Therefore we square it, get rid of -ve deviations & obtain variance
- -We then sqr root variance & obtain standard deviation
- –> v = s^2