Week 5 Flashcards

(36 cards)

1
Q

What is biostatistics?

A

The application of statistical principles to biological, medical, and public health research to design studies and interpret data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between a population and a sample?

A

A population includes all individuals of interest, while a sample is a smaller group selected from the population to make inferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define descriptive vs inferential statistics.

A

Descriptive: Summarise data (mean, median, SD); Inferential: Draw conclusions about a population using data from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a null hypothesis (H₀)?

A

A statement of no effect or difference (e.g., no difference in treatment outcomes).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an alternative hypothesis (H₁)?

A

A statement that suggests there is an effect or difference (e.g., treatment causes improvement).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a p-value represent?

A

The probability of observing the data (or more extreme) if the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does it mean when p < 0.05?

A

It indicates statistically significant evidence against the null hypothesis at the 5% significance level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a t-test used for?

A

Comparing the means of two groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is an independent t-test used?

A

When comparing means of two unrelated groups (e.g., treatment vs control).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is a paired t-test used?

A

When comparing means from the same group at two time points (e.g., before and after).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a chi-square test used for?

A

To examine relationships between categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is ANOVA used for?

A

To compare the means of three or more groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does correlation measure?

A

The strength and direction of a linear relationship between two numerical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is regression analysis used for?

A

Predicting a dependent variable from one or more independent variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three sampling methods discussed?

A

Random, stratified, and cluster sampling.

16
Q

What are confidence intervals?

A

Ranges that estimate a population parameter with a given level of confidence (e.g., 95%).

17
Q

What affects the width of confidence intervals?

A

Sample size, variability, and confidence level.

18
Q

What’s the difference between statistical and practical significance?

A

Statistical: based on p-values; Practical: whether the effect is meaningful in real life.

19
Q

What are examples of categorical data?

A

Blood type, gender, disease status.

20
Q

What are examples of numerical data?

A

Number of hospital visits, test scores.

21
Q

What are examples of continuous data?

A

Height, weight, temperature.

22
Q

What measures are used for skewed data?

A

Median and IQR (interquartile range).

23
Q

What is standard deviation?

A

A measure of how spread out the data is around the mean.

24
Why is standard deviation preferred over variance?
It is in the same units as the original data, making it easier to interpret.
25
When critically appraising a research article, what are some key questions you should ask about the study design, sample size, and statistical methods used?
Is the study design appropriate? Is the sample size large enough and representative? Are the statistical methods appropriate for the data type and study aim? Are p-values and confidence intervals reported and interpreted correctly?
26
Provide examples of categorical, numerical, and continuous data
Categorical: blood type (A/B/O), gender Numerical (discrete): number of children Continuous: height, cholesterol level
27
Briefly describe the three different sampling methods discussed in this lecture
Random: equal chance of selection Stratified: divide into subgroups (e.g., by age), sample from each Cluster: divide into clusters (e.g., schools), then sample whole clusters
28
* What type of data are the mean, mode, and median best used for?
Mean: for symmetrical, numerical data Median: for skewed data Mode: for categorical data
29
Interpret the following statement: "The results of the study were statistically significant (p < 0.05)." What does this statement mean in the context of hypothesis testing? What does it not mean?
There is less than a 5% chance the observed results are due to random variation alone if the null is true. It does not mean the effect is large or important, nor does it prove the hypothesis is true.
30
What is the purpose of a chi-square test? Describe a scenario where a chi-square test would be an appropriate statistical method to use.
Tests whether two categorical variables are related. Example: Is smoking status associated with disease outcome?
31
Explain the purpose of a t-test. Under what circumstances would you use an independent samples t-test versus a paired samples t-test?
Tests if means differ between two groups. Use independent for unrelated groups; paired for repeated measures (same group pre/post).
32
Explain the difference between variance and standard deviation. Why is the standard deviation often preferred over the variance when describing the spread of data?
Variance: average squared deviation SD: square root of variance (same units as data, easier to interpret)
33
* List and briefly describe some of the common statistical tests
t-test: compare two means ANOVA: compare 3+ means Chi-square: test categorical associations Correlation: assess relationship strength Regression: prediction and relationship modeling
34
* Briefly describe the 3 factors impacting confidence intervals
Sample size (larger = narrower CI) Variability (more = wider CI) Confidence level (higher = wider CI)
35
Explain the difference between statistical significance and practical significance. Why is it important to consider both when interpreting research findings?
Statistical: relies on p-values and chance Practical: real-world relevance Both are needed to fully interpret findings (e.g., small but significant change may be clinically irrelevant)