Model Evaluation, Hyperparameter Tuning, Classification & Regression Metrics Flashcards by Franklin Hole

What is the solution to model evaluation problems?

Split the data into training, validation, and test sets.

How well did you know this?

Not at all

Perfectly

What is a Training Set?

Used to train the model.

How well did you know this?

Not at all

Perfectly

What is a Validation Set?

Used during training to tune hyperparameters.

How well did you know this?

Not at all

Perfectly

What is a Test Set?

Used after training to check final performance.

How well did you know this?

Not at all

Perfectly

What is the Holdout method in model validation?

One-time split: Train (e.g. 60%), Val (20%), Test (20%).

How well did you know this?

Not at all

Perfectly

When is the Holdout method best used?

For large datasets.

How well did you know this?

Not at all

Perfectly

What is K-Fold Cross Validation (KCV)?

Split data into k parts, rotate training/testing.

How well did you know this?

Not at all

Perfectly

When is K-Fold Cross Validation best used?

Best for small data, better accuracy.

How well did you know this?

Not at all

Perfectly

What is Overfitting?

Too good on training, bad on new data.

How well did you know this?

Not at all

Perfectly

What is Underfitting?

Bad on both training and new data.

How well did you know this?

Not at all

Perfectly

What are the characteristics of Overfitting?

High variance, memorizing.

How well did you know this?

Not at all

Perfectly

What are the characteristics of Underfitting?

High bias, guessing.

How well did you know this?

Not at all

Perfectly

What is Early Stopping?

Stop training when validation loss goes up.

How well did you know this?

Not at all

Perfectly

What is L2 Regularization?

Penalizes large weights to keep the model simple.

How well did you know this?

Not at all

Perfectly

What is the L2 Regularization formula?

λ * Σ(weights²) → encourages smaller weights.

How well did you know this?

Not at all

Perfectly

What are Hyperparameters?

Settings you pick before training (not learned from data).

How well did you know this?

Not at all

Perfectly

Give examples of Hyperparameters.

Learning rate
Batch size
Number of layers
Activation functions

How well did you know this?

Not at all

Perfectly

What is Grid Search?

Try every combo of settings.

When is Grid Search effective?

Good for small search spaces.

What is a disadvantage of Grid Search?

Super slow if too many options.

What is Random Search?

Pick random combos.

When is Random Search better?

Better for large/continuous spaces.

What are Classification Models used for?

To assign a class (label) to data.

What is an example of a Classification Model?

Is this email spam? Is the tumor benign or malignant?

What is Accuracy in model evaluation?

% of correct predictions.

What is a limitation of Accuracy?

Doesn't work well when classes are imbalanced.

What is a Confusion Matrix?

A table used to describe the performance of a classification model.

What does TP stand for in a Confusion Matrix?

True Positives.

What does FN stand for in a Confusion Matrix?

False Negatives.

What does FP stand for in a Confusion Matrix?

False Positives.

What does TN stand for in a Confusion Matrix?

True Negatives.

What is Precision?

TP / (TP + FP) → How many predicted positives were correct?

What is Recall?

TP / (TP + FN) → How many actual positives were found?

What is the F1 Score?

Harmonic mean of Precision & Recall → 2 * (P * R) / (P + R).

What is AUC?

Area Under Curve measures model’s ability to distinguish classes.

When should you use F1 or AUC?

When data is imbalanced or missing a positive is worse than a few false alarms.

What are Regression Models used for?

When predicting a number (not a category).

Give examples of Regression Models.

* House prices * Stock market trends

What is MAE?

Mean Absolute Error – average of errors, less sensitive to outliers.

What is MSE?

Mean Squared Error – squares errors, punishes big mistakes more.

What are Clustering Models used for?

When you don’t have labels – model tries to find natural groupings.

Give an example of a Clustering Model use case.

Segmenting customers into behavior types.

What metric is used for evaluating clustering?

Silhouette Coefficient.

What does a higher Silhouette Coefficient indicate?

Better clustering.