- Nominal (qualitative) - Ordinal (qualitative) - Interval (quantitative) - ratio (quantitative)

- Likert scale: strongly agree/disagree - Semantic differential: Cold warm TREATED AS INTERVAL/RATIO so that you can use regression

L4 Statistical techniques and sampling designs Flashcards by Sonny Fridael

Descriptive statistics

Methods of summarizing the data in an informative way

central tendency: mean, median, mode
dispersion: range, stdev, variance, interquartile range

How well did you know this?

Not at all

Perfectly

Inferential statistics

Methods to draw conclusions (or to make inferences, test hypotheses)
• Mean difference test
• Chi-square test
• Analysis of variance (ANOVA)
• Regression analysis
• Logit analysis

How well did you know this?

Not at all

Perfectly

Four types of scales

Nominal (qualitative)
Ordinal (qualitative)
Interval (quantitative)
ratio (quantitative)

How well did you know this?

Not at all

Perfectly

Nominal scale

allows classifying data into groups/categories

e.g. gender

How well did you know this?

Not at all

Perfectly

Ordinal scale

rank orders in a meaningful way

e.g. education level

How well did you know this?

Not at all

Perfectly

Interval scale

Meaningful differences between values, but no natural zero point –> zero means something (0 degrees)

How well did you know this?

Not at all

Perfectly

Ratio scale

Meaningful differences and ratios between values due to a natural zero point –> zero is actually nothing (0 dollar is no money)

How well did you know this?

Not at all

Perfectly

Choosing between inferential statistics:

IV=nominal/ordinal DV=nominal/ordinal

Chi-square test

How well did you know this?

Not at all

Perfectly

Choosing between inferential statistics:

IV=nominal/ordinal DV=interval/ratio

T-test, Anova

How well did you know this?

Not at all

Perfectly

Choosing between inferential statistics:

IV=interval/ratio DV=nominal/ordinal

logit analysis

How well did you know this?

Not at all

Perfectly

Choosing between inferential statistics:

IV=interval/ratio DV=interval/ratio

regression analysis

How well did you know this?

Not at all

Perfectly

When to perform T-Test vs Anova

T-Test –> compare two means (two levels of IV)

Anova –> compare more than two levels

How well did you know this?

Not at all

Perfectly

Rating scales

Likert scale: strongly agree/disagree
Semantic differential: Cold warm

TREATED AS INTERVAL/RATIO so that you can use regression

How well did you know this?

Not at all

Perfectly

What is a population?

Entire group of people, firms, events, or things of interest for which you would like to make inferences

How well did you know this?

Not at all

Perfectly

What is a sample?

A subset of the population of interest

How well did you know this?

Not at all

Perfectly

What is a subject?

Study These Flashcards

Single member

What is low representativeness?

Study These Flashcards

= properties of the population are over- or underrepresented in the sample
= high sampling error

The sampling process

Study These Flashcards

define population
determine sampling frame
determine sampling design
determine sample size

define population

Study These Flashcards

e.g. students TISEM, dutch organ donors

determine sampling frame

Study These Flashcards

“Physical” representation of the target population

- where you can reach out to e.g. Donorregister

coverage error

Study These Flashcards

sampling frame ≠ population
• Under-coverage: true population members are excluded
• Miss-coverage: non-population members are included

solutions to coverage error

Study These Flashcards

If small, recognize but ignore

* If large, redefine the population in terms of the sampling frame

determine sampling design

Study These Flashcards

probability vs non-probability sampling

Probability sampling

Study These Flashcards

Each element of the population has a known chance
of being selected as a subject

–>Results generalizable to population
BUT more time and resource intensive

Nonprobability sampling

The elements of the population do not have a known chance of being selected as a subject --> less time and resource intensive BUT results not generalizable to population

Probability sampling techniques

- Simple random sampling (SRS) - Systematic sampling - Stratified sampling - Cluster sampling

Simple random sampling (SRS)

Each population element has an equal chance of being chosen e.g. out of a hat --> Highest generalizability BUT costly?

Systematic sampling

Select random starting point and then pick every nth element --> simplicity BUT low generalizability if there happens to be a systematic difference between every nth observation

Stratified sampling

Divide the population in meaningful (homogenous) groups, then apply SRS within each group e.g. level of income --> All groups are adequately sampled, allowing for group comparisons BUT more time consuming and Requires homogenous subgroups

Cluster sampling

Divide the population in heterogeneous groups, randomly select a number of groups and select each member within these groups e.g. geographic clusters (areas) --> Geographic clusters BUT Subsets of naturally occurring clusters are typically more homogeneous than heterogeneous

Nonprobability sampling

- Convenience sampling - Quota sampling - Judgment sampling - Snowball sampling

Convenience sampling

Select subjects who are conveniently available e.g. random on the street --> Convenient (inexpensive and fast) BUT lower generalizability

Quota sampling

Fix quota for each subgroup (percentage in population) --> When minority participation is critical BUT lower generalizability

Judgment sampling

Select subjects based on their knowledge/professional judgment e.g. experts --> Convenient (inexpensive and fast) when a limited # of people has the info you need BUT Lower generalizability

Snowball sampling

“Do you know people who...” e.g. people with rare disease --> For rare characteristics (“experts”) BUT first participants strongly influence the sample

Rules of thumb for sample size

• Sample size ≥ 75, < 500 • Multivariate research: ≥ 10 x parameters to be estimated • Subsamples (e.g., male/female): ≥ 30 per subsample

L4 Statistical techniques and sampling designs Flashcards

(36 cards)