Data Collection Flashcards
What’s a population
Entire collection of things you’re investigating.
Size of population
N
Mieuw symbol
Mean of population
Looks like 6 squared symbol.
Variance of population
Census
Survey of all members of population
Advantages of census
Accurate because all members considered
Unbiased
No worries about sampling frames or representative samples
Disadvantages of census
Long costly
Cant test to destruction
Difficult to make sure all pop surveyed and done accurately?
Difficult to process large data sets may not be relevant.
What’s a sample
Selection subset of population
Size of sample
n
Sample mean
X with bar on top
Variance of sample denoted
S squared
Sample unit
Single item of population
Sampling frame
List database of all members of population
Sample survey
Investigation collects data From every member of sample.
Well conducted survey can provide very good estimates.
Advantages of sampling
Large well mixed sample can represent pop
Cheaper and quicker than census
Can test products to destruction
Data readily available can be more detailed and better quality
Disadvantages of sampling
Can be biased because of poorly designed sample methods.
Can be inaccurate because of random variations.
Sample size may not give information about sub groups of pop.
There may be no sampling frame.
Sampling error
Error due to a sample survey there are two of those
Sampling error
Random error
Error caused by random variations. These errors can be positive or neg, on average size of random errors is zero.
Sampling error
Bias or systematic error
Error caused by non random method of choosing sample. These errors can be positive or neg, on average this error is non zero (biased)
Random sampling
Each item chosen by pure chance.
Free from bias, sampling frame required, difficult more expensive than non random samples.
Three types of random samples
Simple random sampling
Stratified sampling
Systematic sampling
Simple random sampling
Sample of size n that has equal chance of being selected as any other sample of size n.
Stratified sampling
Population of size N divided into mutually exclusive subgroups called strata.
Strata can be decided by one or more criteria.
Random sample taken from each stratum. Total sample size is n.
Sample size for each stratum =
Population of each stratum x n/N