Topic 13 Flashcards
(9 cards)
Multiple Regression
Linear regression with more than 2 variables (eg. book- weight, volume, paper vs hardback)
Factor
Categorical variable predictor
Reference Level
Levels are the different cateogorical values the factor can take, reference level coded as 0 other levels coded as 1 to compare
Collinear Explanatory Variables
Two predictor variables that are correlated, complicates the model estimation
Adjusted R^2
Accounts for the number of explanatory variables, applying a penalty for the number of preditors
Feature Seletction: Backwards Elimination- Adjusted R^2 Approach
- Start with the full model
- Drop one variable at a time and record adjusted R2 of each smaller model
- Pick the model with the highest increase in adjusted R2
- Repeat until none of the smaller models yield an increase in adjusted R2
Feature Selection: Backwards Elimination- p-value Approach
- Start with the full model
- Drop the variable with the highest p-value and refit a smaller model
- Repeat until all variables left in the model are significant
Feature Selection: AIC Approach
*
Similar to the adjusted R2 approach: penalises for adding more variables to the model.
*
AIC quantifies the amount of information loss due to simplification of the model (less information loss is better)