Topic 10: sampling + random error Flashcards
(20 cards)
Define sample
- Selected subset of source population
- Representative of source population
Define source population
- Group of all individuals = we’re interested to assess parameters
- Can be general population or sub-population e.g. everyone with condition in country
What is the purpose of a sample?
- Study something we can’t study in whole population due to practical restrictions e.g. financials/time
Define sampling
- Process of selecting number of individuals from all individuals in source population
Define sampling frame
- List with all individuals in population = used for sampling
Define sampling units
- Individuals potentially to be selected
Describe sampling in descriptive research
- Investigate prevalence/incidence of condition in population
- Sample accuracy important = represents specific source population
Describe sampling in analytic research
- Investigate association between exposure + outcome
- Can be general with source population depending on research
Describe sampling when investigating biological effects
- E.g. effect of smoke on risk of cancer
- Can be more general with source population = no need to restrict specific region
Describe sampling when investigating social/cultural effects
- E.g. effect of social class on risk of heart disease
- Need to restrict source population to specific region from where sample is derived
Define an estimate
- When measuring the sample of a population = determine proportion of characteristic = estimate
- Sampling error inherent
- Measure of quantity in sample = represents true quantity in source population
Describe statistical inference
- Sample estimate used to draw inferences about population from the sample
- Use statistics to determine degree of uncertainty in estimate of interest
Describe the parameter
- Measure of quality/association in population of interest e.g.
> Mean age
> Prevalence of obesity
> Mean difference in bp between M/F
> Odds ratio for association between smoking + cancer - Sample is used to make an estimate about something true for the whole group = quantify corresponding to population parameter
- Thereform mean of sample size ≈ mean of population parameter
Define sampling variation
- Variation between different sample estimates from same source population
Define sampling error
- Difference between sample estimate + actual population parameter when measuring in sample rather than source population
- Due to chance = random error
- Sample size plays role on magnitude of error
Describe sampling distribution
- We take repeated samples with sample size = n → e.g. n=3 take many samples of 3 values from the population
- Calculate mean of each sample
- Plot means on histogram = will cluster around true mean = estimate population parameter
Describe standard error
- Uncertainty of how well sample estimate represents population parameter
- Estimates SD of sample distribution = average error that can occur with sample size = n
How to calculate SE?
SE = S/√n
- S = sample SD
- n = sample size
Define confidence intervals
- Range within which we are confident with degree of uncertainty = true population parameters lie
Describe how the 95% confidence interval calculated
LOWER CI:
- Sample estimate - 1.96 x SE
UPPER CI:
- Sample estimate + 1.96 x SE
> 95% confident population parameter within interval sample estimate +/- 1.96 SE