sampling+large data set Flashcards
5 types of sampling
simple random, systematic, stratified, opportunity, quota
describe simple random
assign number to every unit. use random number generator to pick units, ignore repeats
pros and cons of simple random
+bias free, easy and cheap
- not suitable when population too big, sampling frame needed
describe systematic
find interval k (pop/sample), take every kth unit
pros and cons of systematic
+simple, quick, suitable for big pop
-sampling frame needed, can introduce bias if frame not random
describe stratified
divide population into strata, simple random in each strata
pros and cons of stratified
+reflects pop structure, proportional representation of groups
-must be clearly stratified, (same an simple random)
describe quota
pop divided into characteristics, quota of items in each group, recruit samples until quota is hit
pros and cons of quota
+representative samples, no sampling frame, easy comparison between groups
-introduces bias, non responses not recorded, needs to be divided
descirbe opportunity
sample taken from people available at time of study who fit criteria
pros and cons of opportunity
+easy, cheap
-unlikely to be representative, dependent on individual researcher
pros and cons of census
+accurate, representative
-time consuming, expensive
pros and cons of sampling
+quicker
-may not be representative
cities in large data set
heathrow, leeming, leuchars, camborne, hurn, jacksonville, beijing, perth
lds dates
mar-oct 1987 and 2015
cleaning data
replacing trace with 0 or 0.025. if n/as, remove the entry