8.1 Model Evaluation 2 Flashcards

1
Q

What is model bias?

A

Model bias in classification context:

  • Label predictions can’t be “too high” or “too low”
  • “biased towards the majority class” means our model predicts too many instances as the majority class
  • Typically compare the class distribution:
  • An unbiased classifier produces labels with the same distribution as the actual distribution
  • A biased classifier produces labels with a different distribution from the actual distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is model variance?

A

Model variance relates to the tendency of different training sets to produce different models or predictions with the same type of learner

  • A model has high variance if a different randomly sampled training set leads to very different predictions on the evaluation set
  • A model has low variance if a different randomly sampled training set leads to similar predictions, independent of whether the predictions are correct
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which of the following is more harmful for the performance on test set than training set?

A
  • Model variance

The model variance is high when different randomly sampled training sets lead to very different predictions on the test set. The high variance indicates that the model overfits to training set. In this case, the training error may decrease, but test error will increase.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

During training process, if your model shows significantly different performance across different training sets, which of the following is NOT a valid way to reduce this variance?

A
  • Improve optimisation algorithm used for error minimisation

Improving your optimisation algorithm would decrease the bias. To reduce variance, using the other three options would be helpful. To decrease model complexity, you can consider reducing the number of features or using regularisations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Given a model y=θ01x , after adding more basis functions to this model, it becomes y=θ01x+…+θnxn . Adding more basis functions can:

A

Adding more basis function would increase the model complexity, which would lead to decrease in model bias and increase in variance. The following figure illustrates the relationship of model complexity and model bias and variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the possible solutions to reduce evaluation variance?

A
  • Repeated random subsampling and run multiple evaluations
  • Increase the holdout partition size for test set
  • Use K-fold cross-validation

To reduce evaluation variance, we can increase the size of test set, or evaluate multiple times using repeated random subsampling or K-fold cross-validation, and get the average performance across different runs. The stratification generates training and test sets that contain approximately the same distribution of class labels as the overall set. The stratification can help to reduce the bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly