Correlation & regression Flashcards

(21 cards)

1
Q

What does the correlation coefficient r NOT tell you?

A

It does not describe slope, shape, or linearity of a relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When is correlation misleading?

A

When x and y are not independent (pseudoreplication).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is correlation inappropriate for non-linear data?

A

r only measures linear relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do outliers affect correlation?

A

Outliers can strongly distort r.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why can subgroups/clusters mislead correlation?

A

Groups can artificially inflate or mask the true association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why shouldn’t you extrapolate correlation beyond your data range?

A

r is only valid within the observed x–y range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Does correlation imply causation?

A

No — correlation never proves causation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a statistical model?

A

A mathematical representation of a relationship: outcome = model + error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is assumed normally distributed in regression models?

A

The error, not necessarily the outcome variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are residuals?

A

Differences between each data point and the model prediction (e.g., mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the mean minimise in modelling?

A

The sum of squared residuals (SSR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does simple linear regression model?

A

The linear relationship between y (dependent) and x (independent).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the slope coefficient b₁ represent?

A

Change in y for each 1-unit change in x.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does SST represent?

A

Total variability in the dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does SSM represent?

A

Variability explained by the regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does SSR represent?

A

Residual unexplained variability (error).

17
Q

What is R²?

A

The proportion of total variability in y explained by x: R² = SSM / SST.

18
Q

How is model fit tested?

A

Using an F-ratio: F = MS_model / MS_residual.

19
Q

How is the significance of the slope b₁ tested?

A

t = b₁ / SE(b₁), with df = N − 2.

20
Q

What does homoscedasticity mean?

A

Residual variance is similar across all x values.

21
Q

Name three ways to reduce bias in a study.

A

Double-blinding, coded treatments, coded analysis.