Research Module Flashcards by K C

What is the definition of epidemiology?

The quantitative study of the distribution, determinants, and control of health problems in human populations.

How well did you know this?

Not at all

Perfectly

Who is known as the “father of epidemiology” and why?

John Snow—he linked cholera outbreaks to a contaminated water pump on Broad Street, ending the epidemic.

How well did you know this?

Not at all

Perfectly

What are the three main aims of epidemiology?

(1) Describe disease distribution, (2) Identify etiological factors, (3) Provide data for planning and evaluating services.

How well did you know this?

Not at all

Perfectly

What is a sample in epidemiology?

A selected group meant to represent the larger population.

How well did you know this?

Not at all

Perfectly

Define simple random sampling.

Every individual in the population has an equal chance of being selected.

How well did you know this?

Not at all

Perfectly

What is stratified random sampling?

The population is divided into subgroups (e.g., age/gender), and random samples are drawn from each.

How well did you know this?

Not at all

Perfectly

What is cluster sampling?

Groups (not individuals) are randomly selected, e.g., tribes or clinics.

How well did you know this?

Not at all

Perfectly

What is convenience sampling?

Selecting participants who are easy to access, without ensuring representativeness.

How well did you know this?

Not at all

Perfectly

What is snowball sampling?

Participants recruit others from their network—used for hard-to-reach populations like drug users.

How well did you know this?

Not at all

Perfectly

Define selection bias.

Bias from differences in how participants are selected into the study.

How well did you know this?

Not at all

Perfectly

Define information bias.

Systematic errors in measurement or data collection (e.g., recall or interviewer bias).

How well did you know this?

Not at all

Perfectly

What is confounding bias?

A third variable distorts the true relationship between exposure and outcome.

How well did you know this?

Not at all

Perfectly

What is meant by “central tendency”?

A measure of the center of a distribution—mean, median, or mode.

How well did you know this?

Not at all

Perfectly

When is the median preferred over the mean?

When data are skewed or have outliers.

How well did you know this?

Not at all

Perfectly

What is variance?

The average squared deviation from the mean—indicates data spread.

How well did you know this?

Not at all

Perfectly

What is standard deviation?

The square root of variance—measures spread of individual data points around the mean.

How well did you know this?

Not at all

Perfectly

What is the standard error of the mean?

An estimate of how much the sample mean is likely to differ from the population mean.

How well did you know this?

Not at all

Perfectly

What is kurtosis?

A measure of how “fat” or “heavy” the tails of a distribution are.

How well did you know this?

Not at all

Perfectly

What is skewness?

A measure of asymmetry in the distribution.

How well did you know this?

Not at all

Perfectly

What is a null hypothesis (H₀)?

A statement that there is no difference or association—assumed true until disproven.

How well did you know this?

Not at all

Perfectly

What is an alternative hypothesis (H₁)?

The hypothesis that there is a real difference or association.

How well did you know this?

Not at all

Perfectly

What is a p-value?

The probability of obtaining the observed result if the null hypothesis is true.

How well did you know this?

Not at all

Perfectly

What does it mean if p < 0.05?

The result is statistically significant; we reject the null hypothesis.

How well did you know this?

Not at all

Perfectly

What is a Type I error (α)?

False positive—rejecting the null when it is actually true.

How well did you know this?

Not at all

Perfectly

What is a Type II error (β)?

False negative—not detecting a true difference.

What is statistical power?

The probability of correctly rejecting a false null hypothesis (Power = 1 – β).

What determines statistical power?

Sample size, effect size, and significance level (α).

What is a confidence interval (CI)?

A range within which the true population value is expected to lie, with a given level of confidence (usually 95%).

What is validity in a screening tool?

The extent to which the tool measures what it’s intended to.

Define sensitivity and specificity.

Sensitivity = true positives / (true positives + false negatives); Specificity = true negatives / (true negatives + false positives).

Define health informatics.

The discipline that stores, retrieves, shares and uses healthcare information, data and knowledge for communication and decision-making.

What does the abbreviation PAS stand for in Maltese hospital systems?

Patient Administration System – the core electronic registry of patient episodes.

HL7 messages are primarily used for what?

Standards-based electronic exchange of clinical and administrative data between health-information systems.

Which 19th-century physician is hailed as a founder of modern epidemiology for tracing a cholera outbreak to a London pump?

John Snow.

Give the textbook definition of epidemiology.

The study of the distribution and determinants of health-related states or events in specified populations, and the application of this study to the control of health problems.

Differentiate qualitative vs. quantitative research in one phrase each.

Qualitative = explores why & how via non-numerical data; Quantitative = measures how much/many using numerical data and statistics.

What is lead-time bias in screening?

Apparent survival prolongation caused by earlier detection rather than true delay in death.

Rank these designs from highest to lowest evidence for causality: cross-sectional, cohort, RCT.

RCT > Cohort > Cross-sectional.

State one key attribute that makes an RCT the ‘gold standard’.

Random allocation balances known and unknown confounders between groups.

When do you choose a paired t-test instead of an unpaired t-test?

When the same subjects provide two related measurements (e.g., before-and-after).

Name two situations that call for a non-parametric test.

(1) Data are ordinal or categorical. (2) Continuous data are skewed and cannot be normalised.

Which non-parametric test replaces an unpaired t-test?

The Mann–Whitney U (Wilcoxon rank-sum) test.

The χ² (chi-square) test assesses the association between which type of variables?

Two categorical variables.

Define confidence interval (CI) in one sentence.

A range of values calculated from the sample that is expected, with a given probability (e.g., 95 %), to contain the true population parameter.

Formula for a 95 % CI of the mean (large n, known σ).

Mean ± 1.96 × (SE) where SE = σ/√n.

Define standard deviation (SD) in one phrase.

The average distance of individual observations from the sample mean.

Simple random sampling guarantees what statistical property?

Every individual in the population has an equal probability of selection, minimising selection bias.

Instrument mis-calibration produces which class of bias?

Measurement (information) bias.

Define the three measures of central tendency.

Mean = arithmetic average; Median = middle value when ordered; Mode = most frequent value.

Which designs are prospective by nature?

Randomised controlled trials and most cohort studies.

Give two examples of categorical data and two of continuous data.

Categorical: blood group, gender. Continuous: serum glucose (mmol/L), height (cm).

What p-value threshold is conventionally considered statistically significant?

p < 0.05.

Which graph best displays the distribution of a single continuous variable?

Histogram.

Define positive predictive value (PPV).

Probability that a person with a positive test truly has the disease.

In an ROC curve, what does the area under the curve (AUC) represent?

Overall test accuracy; the probability a randomly chosen case ranks higher than a control.

What is the primary purpose of blinding in an RCT?

To prevent performance and ascertainment bias by keeping group allocation concealed.

Name two advantages of electronic health records over paper charts.

Real-time data access & easier longitudinal tracking of outcomes.

Categorise BMI ≥ 30 kg/m² (obesity) – is it nominal, ordinal, or interval?

Ordinal categorical (ordered classes).

Explain the term intention-to-treat analysis.

Outcomes are analysed according to original group assignment regardless of protocol adherence, preserving randomisation benefits.

Which statistical test compares survival curves between two groups?

The log-rank (Mantel–Cox) test.

Research Module Flashcards

(60 cards)