Gradient Boosting Trees Flashcards

Question 1

Q

What is Gradient Boosting?

Answer

A

An ensemble technique where trees are build sequentially. Each new tree corrects the mistakes of previous trees.

Question 2

Q

How does Gradient Boosting work?

Answer

A

Gradient boosting optimizes a loss function, and each iteration, a new tree is added to reduce the residual errors.

Question 3

Q

What are the steps in gradient boosting?

Answer

A

Initial prediction
Residual Calculation
Tree Construction
Update
Repeate

Question 4

Q

What is the initial prediction in gradient boosting?

Answer

A

Starting with a simple prediction, usually the most frequent class for classification or the mean value for regression

Question 5

Q

What is the residual calculation in gradient boosting

Answer

A

The residuals (errors) between predicted and actual values

Question 6

Q

What is tree construction in gradient boosting?

Answer

A

A new tree is trained to predict the residuals of the previous models

Question 7

Q

What is the update in gradient boosting?

Answer

A

The predictions are updated by adding the predictions of the new tree, multiplied by a learning rate to control the contribution of each tree

Question 8

Q

What is repeat in gradient boosting?

Answer

A

Continuing the process and adding trees until the model reaches a specified number of trees or achieves the desired level of accuracy

Question 9

Q

What are the advantages of gradient boosting?

Answer

A

High predictive power, it outperforms random forest
Flexibility, it supports various loss functions
Handles imbalanced data by focusing on more difficult to predict instances

Question 10

Q

What are the limitations of gradient boosting?

Answer

A

Prone to overfitting
Longer training time
Sensitive to hyperparameters (careful tuning of number of trees, learning rate, max depth)

Question 11

Q

When should you used Gradient Boosting?

Answer

A

When accuracy is critical
When you have imbalanced data
When you have time for hyper tuning
When you have complex problem spaces

Examples - healthcare, apparently DoD

Question 12

Q

When should you use random forest

Answer

A

Need fast and robust model
You have a large data set
Interpretability is important
You need to avoid overfitting

Examples - banking

Gradient Boosting Trees Flashcards

(12 cards)