Surveys - Sample Designs Flashcards

1
Q

what is a controlled experiment?

A
  1. Comparitive (2+ things, groups, ideas etc.)
  2. Manipulative (manipulate one variable or more (ie treatment), to study relationships)
    - cause and effect
    - before and after
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an observational study?

A
  1. Absolute (no baseline comparison)
  2. Mensurative (measure natural variation between variables, with no manipulation)
    - survey
    - monitoring
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between a survey and monitoring?

A

Survey - (estimate a statistic, no temporal change in period or survey)

Monitoring - (estimate a change in statistic, temporal changes during period of observations)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 2 types of survey sampling?

A
  1. sampling with replacement (SIR)

2. sampling without replacement (SI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean for random sampling

A

Sum of all values divided by the number of samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Variance of mean for simple random sampling

A

SD squared of all samples, divided by # of samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a confidence interval?

A

The interval in which we are x% confident that the true pop mean u lies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do confidence intervals consist of?

A
  • un upper and lower limit

- a degree of confidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the solution to bad random sampling pick?

A
  • divide population into sub-groups (strata)
  • don’t overlap
  • randomisation within strata
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 2 types of stratified random sampling?

A
  1. Stratification - elements in pop divided into strata based on their variables
    * must be non-overlapping and together constitute the whole pop*
  2. Sampling within strata - samples selected randomly and independently from each stratum
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why do we stratify?

A
  1. Precision - more homogenous strata then more precise estimates.
  2. Captures individual strata characteristics - characteristics of each sample weighed proportional to entire pop - similar to weighted average.
  3. Practical - already know info may differ between groups/ strata is occuring (e.g suburbs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the mean for stratified random sampling (StR) calculated?

A

First calculate the mean of each strata, then multiply each mean by its weighting (usually a proportion)
Then add up weighted means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we calculate the variance of the mean for stratified random sampling?

A

First calculate varience for mean of each strata, multiply each varience value by square of weighting
Then add up weighted variences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Worked example: Stratified sampling

A
# definitions
A = c(90, 78, 86, 71) # define stratum A (4 samples)
B = c(48, 56, 42)     # defime stratum B (3 samples)
n = 7                 # total number of samples
tcrit = qt(.975, df = n-2) # t critical value for 95% CI
wt = c(A = .62, B = .38)   # define weights
# calculations:
wmean = sum(mean(A) * wt[1], mean(B) * wt[2]) # weighted mean
# weighted^2 variance of mean:
wvar = sum(var(A)/4 * wt[1]^2, var(B)/3 * wt[2]^2) 
L95t = wmean - tcrit * se # lower 95% CI
U95t = wmean + tcrit * se # upper 95% CI
c(lower95 = L95t, upper95 = U95t)
##  lower95  upper95 
## 61.04864 76.68803
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Worked example: Simple Random Sampling

A
# definitions
A = c(90, 78, 86, 71) # define stratum A (4 samples)
B = c(48, 56, 42)     # defime stratum B (3 samples)
n = 7                 # total number of samples
tcrit = qt(.975, df = n-1) # t critical value for 95% CI
# calculations:
mean_ab = mean(c(A, B))   # mean
var_ab = var(c(A, B))/n   # variance of the mean
L95s = mean_ab - tcrit * sqrt(var_ab) # lower 95% CI 
U95s = mean_ab + tcrit * sqrt(var_ab) # upper 95% CI
c(lower95 = L95s, upper95 = U95s)
##  lower95  upper95 
## 49.84627 84.72516
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does monitoring study?

A

the change in the mean overtime