Overfitting Flashcards

1
Q

Name the 3 parts we split the dataset into.

A
  1. Training
  2. Validation
  3. Test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Overfitting?

A

When we have a model that scores well on training but poorly on validation and test datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Underfitting?

A

Where we have not captured the logic of the underlying data, leading to poor predictive power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is N-fold Cross Validation used?

A

When you have smaller amounts of data to be used to train on

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Early Stopping?

A

A technique to prevent overfitting

stop training early before we overfit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we implement Early Stopping?

A
  1. Train for a preset number of epochs
  2. (better) stop when the loss function updates become too small
  3. (best) Validation set Strategy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Validation Set Strategy?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly