Stratified sampling Flashcards
(26 cards)
What is stratified random sampling?
a) a form of random sampling which involves splitting the population into distinct, non-overlapping sub-groups called strata
b) strata are homogenous
How do you perform stratified sampling?
a) identify potentially relevant strata
b) take independent random samples from each stratum
c) combine results to estimated T, p, or X
What are the advantages of stratification?
a) potential to reduce variance of estimates
b) more representative as each stratum is represented
c) costs may be lower
d) estimates for separate strata can be compared
How do you calculate π π sqrd? (sample variance of i-th stratum)
What does the sum of the Wiβs add to in stratified sampling?
1
Why do we use weighted average in stratified sampling estimates?
a) sample means evaluated from larger strata have more importance (weight) and vice versa
How do you calculate the variance of the total in stratified sampling?
What do we assume when estimating in stratified sampling?
that the estimators are normally distributed
How would you calculate confidence intervals with stratified sampling?
a) z- or t-statistics for estimating margins of error/confidence intervals for a mean or total.
b) non-normal = a z-statistic can only be used given that each ππ > 30.
c) t-statistic degrees of freedom are (πβπ), i.e. the (overall sample size β number of strata).
d) use Z-statistic for estimating margin of error/confidence interval for proportion.
What is sample allocation in stratified?
splitting the overall sample size between the different strata to decide the sample size ππ for each stratum.
What are the methods of sample allocation?
a) proportional
b) neyman
c) optimal
What is proportional allocation?
a) simplest method
b) involves sampling each stratum in proportion to its size or weight respective to the population
c) sample more from larger strata, less from smaller
what are the requirements of proportional allocation?
a) means that sampling fractions are equal for each stratum (up to rounding errors)
b) ensures a representative sample;
c) straightforward and commonly used;
d) requires the stratum sizes ππ or at least the stratum weights ππ to be known.
When allocating sample size, how do you round the estimate?
the rounding of the overall sample size and sample sizes within strata should be to the nearest whole number
What is Neyman allocation?
a) chooses the ππ to directly minimise the variance of the estimator π₯Μ
ππ
b) sample more from strata with higher variability and/or larger strata.
what are the requirements of Neyman allocation?
a) Requires more information than proportional allocation
b) As well as the ππ (or the ππ), it requires ππ to be known or estimated
c) For equal ππ, the allocation reduces to proportional allocation, so the Neyman method is more general;
d) Where the ππ differ, this method is more precise than proportional allocation.
What is relative efficiency?
a) measurement of the improvement in the performance of one unbiased estimator over another
b) variance of estimator a over variance of estimator b
How do you interpret relative efficiency?
a) π
πΈ(π₯Μ
ππ/π₯Μ
ππ
π ) = 2
b) then stratified sampling was twice as efficient as SRS.
What is the design effect?
a) the reciprocal of the relative efficiency
b) another way to assess which estimator, and sampling scheme, is more efficient to use.
c) variance of parameter 1 over variance of parameter 2
How do you interpret design effect?
a) the design effect of stratified random sampling to SRS, when estimating a population mean would be:
ππππ(π₯Μ
ππ/π₯Μ
ππ
π ) = (πππ(π₯Μ
ππ)) / (πππ(π₯Μ
ππ
π))
b) Design effects less than 1 indicate an efficient design for the sampling scheme on numerator
What is optimal allocation?
a) chooses the sample sizes ππ to minimise the variance
b) takes into account situations where there is a budget for conducting a survey
c) sampling costs will be different in each strata
d) sample from strata which are larger and/or have greater variability ππ and/or have lower sampling costs.
How do you calculate total cost of sampling C?
where ci is the cost of sampling a single population element from the i-th stratum
what is the optimal allocation sample size formula?
a) where πΎπ are the fractions of the sample size π to be taken from the π-th stratum and βπΎπ =1.
What are the optimal allocation results under certain circumstances?
a) For equal costs ππ, this method gives the same as Neyman allocation.
b) For equal costs and equal ππ, this method gives the same as proportional allocation
c) so optimal allocation is the most general of the 3 methods of allocation