Week 5 Flashcards by Dakota Wilckens

What is biostatistics?

The application of statistical principles to biological, medical, and public health research to design studies and interpret data.

How well did you know this?

Not at all

Perfectly

How well did you know this?

Not at all

Perfectly

What is the difference between a population and a sample?

A population includes all individuals of interest, while a sample is a smaller group selected from the population to make inferences.

How well did you know this?

Not at all

Perfectly

Define descriptive vs inferential statistics.

Descriptive: Summarise data (mean, median, SD); Inferential: Draw conclusions about a population using data from a sample.

How well did you know this?

Not at all

Perfectly

What is a null hypothesis (H₀)?

A statement of no effect or difference (e.g., no difference in treatment outcomes).

How well did you know this?

Not at all

Perfectly

What is an alternative hypothesis (H₁)?

A statement that suggests there is an effect or difference (e.g., treatment causes improvement).

How well did you know this?

Not at all

Perfectly

What does a p-value represent?

The probability of observing the data (or more extreme) if the null hypothesis is true.

How well did you know this?

Not at all

Perfectly

What does it mean when p < 0.05?

It indicates statistically significant evidence against the null hypothesis at the 5% significance level.

How well did you know this?

Not at all

Perfectly

What is a t-test used for?

Comparing the means of two groups.

How well did you know this?

Not at all

Perfectly

When is an independent t-test used?

When comparing means of two unrelated groups (e.g., treatment vs control).

How well did you know this?

Not at all

Perfectly

When is a paired t-test used?

When comparing means from the same group at two time points (e.g., before and after).

How well did you know this?

Not at all

Perfectly

What is a chi-square test used for?

To examine relationships between categorical variables.

How well did you know this?

Not at all

Perfectly

What is ANOVA used for?

To compare the means of three or more groups.

How well did you know this?

Not at all

Perfectly

What does correlation measure?

The strength and direction of a linear relationship between two numerical variables.

How well did you know this?

Not at all

Perfectly

What is regression analysis used for?

Predicting a dependent variable from one or more independent variables.

How well did you know this?

Not at all

Perfectly

What are the three sampling methods discussed?

Study These Flashcards

Random, stratified, and cluster sampling.

What are confidence intervals?

Study These Flashcards

Ranges that estimate a population parameter with a given level of confidence (e.g., 95%).

What affects the width of confidence intervals?

Study These Flashcards

Sample size, variability, and confidence level.

What’s the difference between statistical and practical significance?

Study These Flashcards

Statistical: based on p-values; Practical: whether the effect is meaningful in real life.

What are examples of categorical data?

Study These Flashcards

Blood type, gender, disease status.

What are examples of numerical data?

Study These Flashcards

Number of hospital visits, test scores.

What are examples of continuous data?

Study These Flashcards

Height, weight, temperature.

What measures are used for skewed data?

Study These Flashcards

Median and IQR (interquartile range).

What is standard deviation?

Study These Flashcards

A measure of how spread out the data is around the mean.

Why is standard deviation preferred over variance?

It is in the same units as the original data, making it easier to interpret.

When critically appraising a research article, what are some key questions you should ask about the study design, sample size, and statistical methods used?

Is the study design appropriate? Is the sample size large enough and representative? Are the statistical methods appropriate for the data type and study aim? Are p-values and confidence intervals reported and interpreted correctly?

Provide examples of categorical, numerical, and continuous data

Categorical: blood type (A/B/O), gender Numerical (discrete): number of children Continuous: height, cholesterol level

Briefly describe the three different sampling methods discussed in this lecture

Random: equal chance of selection Stratified: divide into subgroups (e.g., by age), sample from each Cluster: divide into clusters (e.g., schools), then sample whole clusters

* What type of data are the mean, mode, and median best used for?

Mean: for symmetrical, numerical data Median: for skewed data Mode: for categorical data

Interpret the following statement: "The results of the study were statistically significant (p < 0.05)." What does this statement mean in the context of hypothesis testing? What does it not mean?

There is less than a 5% chance the observed results are due to random variation alone if the null is true. It does not mean the effect is large or important, nor does it prove the hypothesis is true.

What is the purpose of a chi-square test? Describe a scenario where a chi-square test would be an appropriate statistical method to use.

Tests whether two categorical variables are related. Example: Is smoking status associated with disease outcome?

Explain the purpose of a t-test. Under what circumstances would you use an independent samples t-test versus a paired samples t-test?

Tests if means differ between two groups. Use independent for unrelated groups; paired for repeated measures (same group pre/post).

Explain the difference between variance and standard deviation. Why is the standard deviation often preferred over the variance when describing the spread of data?

Variance: average squared deviation SD: square root of variance (same units as data, easier to interpret)

* List and briefly describe some of the common statistical tests

t-test: compare two means ANOVA: compare 3+ means Chi-square: test categorical associations Correlation: assess relationship strength Regression: prediction and relationship modeling

* Briefly describe the 3 factors impacting confidence intervals

Sample size (larger = narrower CI) Variability (more = wider CI) Confidence level (higher = wider CI)

Explain the difference between statistical significance and practical significance. Why is it important to consider both when interpreting research findings?

Statistical: relies on p-values and chance Practical: real-world relevance Both are needed to fully interpret findings (e.g., small but significant change may be clinically irrelevant)

Week 5 Flashcards

(36 cards)