Week 9 Flashcards
Motivate ensemble methods
Idea of ensemble methods
What is Bagging
Bootstrap aggregating
What is boosting?
Bagging - process
Goal of boosting
Boosting procedure
AdaBoost
AdaBoost advantages/disadvantages
AdaBoost algo
Stacked generalisation
Parallel structure for ensemble classifiers
Serial structure for ensemble classifiers
Hierarchical structure for ensemble classifiers
Random forest
(Random forest is ensemble of BAGGED decision trees, with randomised feature selection)
Training decision tree
Notes on decision trees
Decisions at nodes could be more complex
Prediction performance can be poor (tends to overfit)
Unstable (small changes to training set cause large changes in classification accuracy. although bagging improves stability)
Constructing binary decision tree algorithm
Randomised tree learning procedures algo
Advantages of random forests
Fast (scalable)
Accurate
Simple to implement
Popular