Stats Key Info Flashcards
Population
Entire set of items (sampling units) in the group being studied.
Census
Measuring every member of a population
Evaluation - Census
+ Accurate
- Expensive
- Some testing destroys the item
Sampling frame
List of sampling units
(It is not always possible to create this, thus can be a disadvantage of some techniques)
Simple Random Sampling
Equal chance of being selected - done using random number generator/ lottery sampling alongside sampling frame.
Type of RANDOM Sampling
Evaluation - Simple Random Sampling
+ Bias-free
- Sampling frame required
Systematic Sampling
Taking every k^th unit (k = population / sample), pick random number between 1 and k for start point
Type of RANDOM Sampling
Evaluation - Systematic Sampling
+ Quick to use
- Sampling frame required
Stratified Sampling
Sample is proportionally representative of the strata (groups) of the population.
Formula: Sample / population x strata (for each strata)
(use either simple random/systematic to fill groups)
Type of RANDOM Sampling
Evaluation - Stratified Sampling
+ Reflects Population
- Need clearly classified strata (groups) for population
Opportunity Sampling
Sample based on who/what is available at the time.
Type of NON-RANDOM Sampling
Evaluation - Opportunity Sampling
+ Easy, cheap
- Unlikely to be representative
Quota Sampling
Similar to stratified sampling, but strata are filled by the researcher using opportunity sampling, thus are not necessarily representative of the population.
Type of NON-RANDOM Sampling
Evaluation - Quota Sampling
+ No sampling frame needed
- Not random, potential bias
Data Types
Qualitative: Non-numerical
Quantitative: Numerical
Types of Qualitative:
Discrete: Can only take certain values (often integers) => e.g. shoe size
Continuous: Can take any value in a range, must be grouped. => e.g. foot length
Median (Location)
LQ: n/4 th term
Median: n/2 th term
UQ: 3n/4 th term
xth percentile = x/100 n th term
Decile = 10% Chunk = Percentile/10
Mean (Location)
x̄ = ∑fx / ∑f (or ∑x / n)
Variance (Spread) σ^2
(∑f)(x^2) / ∑f) - x̄^2
MSMSM
Mean of the Squares Minus Square of the Mean
(Also = Sxx / n)
Coding
If y = ax + b…
then mean of y
= a(mean of x) + b
AND
σ of y = a x (σ of x)
Linear Interpolation (Location)
Using the assumption that all data values are evenly spread throughout each class, using proportion to find how far through each class the data value should be.
Remember to add on the lower-class boundary after finding the correct data value.
e.g.
Class Limits: 12.5 Q1 15.5
|——–|—–|
Cumulative Freq.: 5 10 13
12.5 + (10-5 / 13 - 5) x 3 = 14.375 = Q2
Sampling Units
Individuals of a population
Finding Quartiles (Location)
n/4 or n/2 or 3n/4
If decimal: round UP ALWAYS
If whole number, find midpoint with next value.
Outlier Boundaries (Representation)
Q1 - 1.5(IQR) or Q3 + 1.5(IQR)
(USUALLY)
Interquartile Range (Spread)
Q3 - Q1
+ Ignores extremes