sampling/ data Flashcards
(42 cards)
What is continuous data ?
any value in a range
What is discrete data ?
data that can only take certain values in a range
What is the population ?
complete set of items that share a common property
What is census ?
Where every member of the population provides data
What is a sample ?
a subset of the population which is used to collect data from
What is a random sample ?
Where members of the population have an equal chance of being picked
What is a biased sample ?
Where the sample is not random
What should the sample size of a population roughly be (using n as the population)
√n (square root population size)
Give an advantage of using a census
accurate results as representative of the entire population
Give a disadvantage of using a census
time consuming as large amounts of data to organise and analyse
Give an advantage of using a sample
quick to organise and analyse the data
Give a disadvantage of using a sample
small sample sizes can lead to inaccurate results as not representative of the entire population
What are the 5 different sampling techniques ?
1) random
2) stratified
3) cluster
4) quota
5) systematic
What is the method of how to use random sampling ? (2)
1) Uniquely number every member of a population
2) Randomly select n different numbers using a random number generator
Give an advantage of using random sampling
useful if the population is small
Give a disadvantage of using random sampling
if the population is very big, it can be time consuming to number/ list every member of the population
What is stratified sampling ?
Where the population is divided into groups based on demographics and then a random sample is conducted within each group.
What is the method on how to use stratified sampling ? (2)
1) calculate the number of members to be randomly sampled from each group
2) take a random sample from each group
How would you calculate the number of members to be randomly sampled within each group when using stratified sampling ?
The proportion of a group that is sampled is equal to the proportion of the population that belong to that group
What is an advantage of using stratified sampling ?
useful when very different groups of people within a population to make sure the data collected will still be representative of the population structure
What is a disadvantage of using stratified sampling ?
groups may not be easy to select as the groups may overlap
What is cluster sampling ?
This is when the population already falls into groups / clusters, a random number of these clusters are then selected to be sampled.
What is the method on how to conduct cluster sampling ?
1) identify the clusters within the population
2) randomly select a number of clusters to use
3) use all the members in these selected clusters
Give an advantage of using cluster sampling
easy to complete as well as inexpensive