Topic 16 Flashcards

(3 cards)

1
Q

Crossvalidation

A

Partition the data available for training into a training dataset and a validation dataset, allowing us to diagnose overfitting without seeing the real test data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why not use test data to choose K?

A

We won’t correctly evaluate the model’s performance when we have truly unlabeled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Bias and Variance Depending on the Chosen K Value for Validation

A

Low K: high bias, low variance, High K: low bias, high variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly