prediction from the linear model Flashcards

(12 cards)

1
Q

What is the difference between interpolation and extrapolation in prediction?

A

Interpolation: Predicting
Y within the observed range of X (safe).

Extrapolation: Predicting Y outside the observed range of X (risky, assumes linearity holds).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you calculate a point estimate for a new observation in R?

A

Use predict():

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you compute a confidence interval (CI) for the mean response?

A

y^p±tα/2,df×se(y^p)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between a confidence interval and a prediction interval?

A

CI: Uncertainty in the mean response (narrower).

PI: Uncertainty in an individual response (wider, includes residual variance).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you calculate a prediction interval in R?

A

predict(model, newdata, interval = “prediction”, level = 0.99)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is the prediction interval wider than the confidence interval?

A

It accounts for:

Uncertainty in the estimated coefficients (same as CI).

Variability of individual observations around the mean (extra term).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the risks of extrapolation?

A

Assumes linearity continues beyond observed data (often untrue).

No data to validate predictions → high uncertainty.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does multiple regression prediction differ from simple regression?

A

Requires values for all predictors in the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is MSE and where do you find it in R output?

A

Mean Square Error: Average squared residual.

Sources:

ANOVA table: Mean Sq for Residuals.

summary(model)$sigma^2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you interpret a 99% prediction interval of (106.7, 107.6) for IQ?

A

We are 99% confident that an individual with height=1.72m will have an IQ between 106.7 and 107.6.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is the CI narrowest at the mean of X?

A

Because
∑(xi−xˉ)2∑(xi−xˉ)2is minimized at xˉxˉ, reducing se(y^p)se(y^p).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the t-multiplier for a 95% CI with 98 degrees of freedom?

A

qt(0.975, df = 98) # ≈ 1.984

How well did you know this?
1
Not at all
2
3
4
5
Perfectly