Producing Data Flashcards
observational study
observe individuals and measuring variables of interest but not attempting to influence the response
experiment
deliberately imposing some treatment on individuals in order to observe their responses. An experiment can help eliminate (or at least try to minimize the effects of) lurking variables
population
entire group of individuals we are interest in
census
involves using every member of a population for an observational study
sample
subgroup of population that is being studied
sampling
studying a part in order to gain information about the whole
sampling error/variability
natural variation one would expect to see in sample statistics from sample to sample
voluntary response sample
people who choose to be part of a sample by responding to a general appeal (biased)
convenience sample
choosing the most convenient individuals from the population for your sample (biased)
Bias
occurs when the sampling method systematically favors certain outcomes
simple random sample (srs)
a sample of size n is selected in such a way if every individual in the population has an equal chance of being selected and every subset of individuals has an equal chance of being selected for the sample (unbiased)
stratified random sample
divide the population into groups of similar individuals called strata, choose a separate SRS from each stratum, and then combine all of those individuals chosen from all of the strata to make up the full sample (unbiased)
cluster sample
divide the population into groups (or clusters), then randomly select some of these clusters (completely ignoring the others). All the individuals from chosen clusters are selected to be in the sample
multistage sample design
select successively smaller groups within the population in stages, resulting in a sample consisting of clusters of individuals. Each stage may employ an SRS, or another type of sample
systematic sample
sampling with a pattern (inspecting every fifth bag of potato chips coming off an assembly line)
probability sample
each member of the population has a known chance (greater than zero) to be selected
undercoverage
some groups in population are left out of process of choosing sample, population is not same group as sampling frame
non response
an individual is chosen for the sample but cannot be contacted or refuses to cooperate
response bias
individual in sample chooses an answer to a survey that they thinks is best rather than the answer that they truly believe
wording of question
leading questions cause individual being questioned to choose one answer as opposed to another
sampling frame
group from which sample is chosen
what must you do before you trust a poll result?
insist on knowing the exact questions asked, the rate of nonresponse, and the date and method of survey
are larger samples or smaller samples better? why?
larger random samples are better because they give more accurate results that smaller samples because they decrease variability
experimental units
individuals on which the experiment is being done (if units are people they are called subjects)