Topic 16 Flashcards
(3 cards)
1
Q
Crossvalidation
A
Partition the data available for training into a training dataset and a validation dataset, allowing us to diagnose overfitting without seeing the real test data
2
Q
Why not use test data to choose K?
A
We won’t correctly evaluate the model’s performance when we have truly unlabeled data
3
Q
Bias and Variance Depending on the Chosen K Value for Validation
A
Low K: high bias, low variance, High K: low bias, high variance