correlation Flashcards

(14 cards)

1
Q

What does Pearson’s correlation coefficient (r) measure?

A

Range:

−1 (perfect negative) to

+1 (perfect positive).

Interpretation:
r=0: No linear relationship.

r=0.868 (e.g., LLL vs. height): Strong positive relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is covariance different from correlation?

A

Covariance: Measures direction but not strength (units-dependent).

Correlation: Standardized covariance (unitless, −1≤r≤1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you test if a correlation is significant?

A

H0:ρ=0 (no correlation).

H1:ρ≠0

t stat = rsqrt(n-2) / sqrt(1-r^2)

cor.test(x, y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Does correlation imply causation?

A

No! Possible explanations:

Chance (spurious correlation).

Confounding

True causation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the simple linear regression equation?

A

Yi=β0+β1Xi+ϵi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the least squares criterion?

A

Minimizes the sum of squared residuals:
SSRes = ∑(yi−y^i)^2

Residuals: Vertical distances from data points to the regression line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is model variance (σe2) estimated?

A


= 1/n−2 ∑(yi−y^i)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does R-squared (R^2) measure?

A

Proportion of variance in Y explained by X:

R^2 = SSmodel / SStotal = 1 - SSred/SStotal

Example: R^2=0.75 → 75% of variation in Y is explained by X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is the intercept (β0) meaningless?

A

When X=0 is outside the data range (e.g., negative folate levels).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you predict values using regression?

A

Plug X into the fitted equation: Y^= β^0+β^1X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the assumptions of linear regression?

A

Linearity: Relationship between X and Y is linear.

Independence: Residuals are uncorrelated.

Homoscedasticity: Constant residual variance.

Normality: Residuals∼N(0,σ2)∼N(0,σ2).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you check regression assumptions in R?

A

plot(model) # Check:
1. Residuals vs. Fitted (linearity).
2. Q-Q Plot (normality).
3. Scale-Location (homoscedasticity).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the F-test in regression used for?

A

Tests if the model explains significant variance:
H0: All slopes = 0.

H1 : At least one slope ≠0

R output: F-statistic and p-value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why avoid extrapolation in regression?

A

Predictions outside the observed
X
X range may be invalid (e.g., SST = 150°C).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly