Week 5 Flashcards
(36 cards)
What is biostatistics?
The application of statistical principles to biological, medical, and public health research to design studies and interpret data.
What is the difference between a population and a sample?
A population includes all individuals of interest, while a sample is a smaller group selected from the population to make inferences.
Define descriptive vs inferential statistics.
Descriptive: Summarise data (mean, median, SD); Inferential: Draw conclusions about a population using data from a sample.
What is a null hypothesis (H₀)?
A statement of no effect or difference (e.g., no difference in treatment outcomes).
What is an alternative hypothesis (H₁)?
A statement that suggests there is an effect or difference (e.g., treatment causes improvement).
What does a p-value represent?
The probability of observing the data (or more extreme) if the null hypothesis is true.
What does it mean when p < 0.05?
It indicates statistically significant evidence against the null hypothesis at the 5% significance level.
What is a t-test used for?
Comparing the means of two groups.
When is an independent t-test used?
When comparing means of two unrelated groups (e.g., treatment vs control).
When is a paired t-test used?
When comparing means from the same group at two time points (e.g., before and after).
What is a chi-square test used for?
To examine relationships between categorical variables.
What is ANOVA used for?
To compare the means of three or more groups.
What does correlation measure?
The strength and direction of a linear relationship between two numerical variables.
What is regression analysis used for?
Predicting a dependent variable from one or more independent variables.
What are the three sampling methods discussed?
Random, stratified, and cluster sampling.
What are confidence intervals?
Ranges that estimate a population parameter with a given level of confidence (e.g., 95%).
What affects the width of confidence intervals?
Sample size, variability, and confidence level.
What’s the difference between statistical and practical significance?
Statistical: based on p-values; Practical: whether the effect is meaningful in real life.
What are examples of categorical data?
Blood type, gender, disease status.
What are examples of numerical data?
Number of hospital visits, test scores.
What are examples of continuous data?
Height, weight, temperature.
What measures are used for skewed data?
Median and IQR (interquartile range).
What is standard deviation?
A measure of how spread out the data is around the mean.