Topic 16 Flashcards by Unknown Unknown

Crossvalidation

Partition the data available for training into a training dataset and a validation dataset, allowing us to diagnose overfitting without seeing the real test data

How well did you know this?

Not at all

Perfectly

Why not use test data to choose K?

We won’t correctly evaluate the model’s performance when we have truly unlabeled data

How well did you know this?

Not at all

Perfectly

Bias and Variance Depending on the Chosen K Value for Validation

Low K: high bias, low variance, High K: low bias, high variance

How well did you know this?

Not at all

Perfectly

Topic 16 Flashcards

(3 cards)