07. Quantitative methods Flashcards
(24 cards)
Primary data involves the collection or generation of data with a ___.
specific project or task in mind
Secondary data is collected for distribution and use by ___.
other interested parties
Cross-section data are onservations that come from ___ or ___ at a ___.
- different individuals or groups
- single point in time
Time-series data is a set of observations usually collected at ___ & ____ time intervals e.g. ___.
- discrete & equally spaced
- e.g. annually
A population is ___ members of a well-defined ___ or ___.
(Can be ___ or ___).
- all
- set or group
- finite or infinite
A sample is a ___ of a population.
subset
A parameter is a ___ describing a ___ e.g. the population mean.
- number
- whole population
A statistic is a number describing a ___
e.g. the ___ mean.
sample x2
We often have to work with samples because it is ___ to gather data on ___.
- cost-effective
- whole populations
What is it that is desirable for a sample to appropriately represent?
The key features of a population.
For a sample to be unbiased, what must it be?
Representative of the population.
Random vs non-random sampling
- Random: every member of a population has a chance of being selected for the sample.
- Non-random: involves some element of judgement in selecting the sample.
What is the issue if a sample is too small?
It may bias the population estimate.
Sampling methods can be divided into probability & non-probability methods.
Probability methods?
Have a known probability for each member of the population to be selected.
Name 3 techniques included in sampling probability methods.
- Random
- Systematic
- Stratified
Random sampling
Each member of population has equal & known chance of being selected.
Systematic sampling
Every nth record is selected from a list of population members.
Stratified sampling
1st identify characteristics of population that are already known, then selecting a random sample to represent those characteristics in the correct proportion.
Why is stratified sampling often considered superior to random sampling?
Because it reduces sampling error - the possibility of selecting an unrepresentative sample.
Sampling methods can be divided into probability & non-probability methods.
Non-probability sampling?
Members are selected from the population in some non-random manner.
Name 4 examples of non-probability sampling.
- Convenience sampling
- Judgement sampling
- Quota sampling (the non-probability equivalent of stratified sampling)
- Snowball sampling (may be used if the desired sample characteristic is very rare)
Pro & con of snowball sampling
- Reduces search costs
- Introduces bias as it reduces the likelihood that the sample will represent a good cross-section from the overall population.
Continuous data
Can take any value in an interval on the line from minus infinity to plus infinity.
Discrete data
Can only take a finite number of values.