Model 2: Linear Regression Models Flashcards

(14 cards)

1
Q

Model Building Principles

A
  1. Consider all predictor variables that may be important in describing the outcome.
  2. Check for multicollinearity
  3. Consider possible interactions between variables.
  4. Think about expected value of predictors.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What model selection approaches are covered in STAT210?

A

Gelman and Hill
Backward/Forward/Sidewise-Step Selection
Akiake’s Information Criterion (AIC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe the Gelmen & Hill Approach

A

Look at the statistical significance and sign of estimated effect.
- significant & expected = keep
- non-significant & expected = find to keep unless interaction
- significant & unexpected = possible lurking variables
- non-significant & unexpected = remove.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe Backwards Selection

A
  1. Start with all possible predictors
  2. Remove predictor with largest p-value
  3. Refit model
  4. Repeat unit all predictors included are significant.

remember if the interaction is keep, the interactions main effect must be keep

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe Forward Selection

A
  1. Start with the basic model (no predictors in model)
  2. Consider all possible predictor variables at one time. Include the predictor with the smallest p-value
  3. Consider all possible predictors, minus the previous. Include the predictor with the smallest p-value.
  4. Repeat until no significant predictors are left.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe Sidewise-Step Selection

A

A combination of backward and forward selection, ensure the model is refit with each change.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe AIC Selection

A

1) Narrow down selection to smallest number of possible models.
2) Specify the basic and full model, include models between the two.
3) Choose the model with the smallest AIC score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does a high/low/similar score in AIC mean?

A

High score means the model has a lot of parameters.
Low score means the model is a good fit.
AIC with similar scores mean the models are similarly a good fit. Discussing which model to choose may depend on whether you want to prioritise simplicity or covariates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Cons and Benefits of Gelmen & Hill Approach

A

This approach is a helpful guideline for model building and invites discussion about variables. However, this method can be easily abused.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Cons and Benefits of Backward/Forward/Sidewise-Step Approach

A

These approaches are easy and fast, however, each method can decide different models are best.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cons and Benefits of AIC Approach

A

This approach thinks about the model upfront, but may lead to multiple models being a possible good fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are additive models?

A

When explanatory/predictor variables are summed, which means the individual independent effects are summed. Visually there should only be one slope on the plot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly