ML Part 1 Flashcards by James Clyne

What is supervised learning?

A machine learning task where the model learns from labeled data.

How well did you know this?

Not at all

Perfectly

What is unsupervised learning?

A task where the model finds patterns or structure in unlabeled data.

How well did you know this?

Not at all

Perfectly

What is the difference between regression and classification?

Regression predicts continuous values; classification predicts discrete labels.

How well did you know this?

Not at all

Perfectly

What is reinforcement learning?

A type of learning where an agent learns by interacting with an environment and receiving rewards or penalties.

How well did you know this?

Not at all

Perfectly

What is a model in machine learning?

A mathematical function or algorithm that maps inputs to outputs.

How well did you know this?

Not at all

Perfectly

What is overfitting?

When a model learns noise in the training data and performs poorly on unseen data.

How well did you know this?

Not at all

Perfectly

What is underfitting?

When a model is too simple to capture underlying patterns in the data.

How well did you know this?

Not at all

Perfectly

What is the bias-variance tradeoff?

The balance between underfitting (high bias) and overfitting (high variance).

How well did you know this?

Not at all

Perfectly

How can you reduce overfitting?

Use regularization, more data, cross-validation, or simpler models.

How well did you know this?

Not at all

Perfectly

How can you reduce underfitting?

Use more complex models or add relevant features.

How well did you know this?

Not at all

Perfectly

What is accuracy?

The proportion of correct predictions out of all predictions made.

How well did you know this?

Not at all

Perfectly

What is precision?

The proportion of true positives among all predicted positives.

How well did you know this?

Not at all

Perfectly

What is recall?

The proportion of true positives among all actual positives.

How well did you know this?

Not at all

Perfectly

What is the F1 score?

The harmonic mean of precision and recall.

How well did you know this?

Not at all

Perfectly

What is a confusion matrix?

A table showing true vs predicted classifications (TP, FP, FN, TN).

How well did you know this?

Not at all

Perfectly

What is a train-test split?

Study These Flashcards

Dividing data into a training set and a test set to evaluate generalization.

What is k-fold cross-validation?

Study These Flashcards

Dividing data into k parts, training on k-1 and testing on the remaining fold, repeated k times.

Why use cross-validation?

Study These Flashcards

To get a more reliable estimate of model performance on unseen data.

What is the purpose of a validation set?

Study These Flashcards

To tune model parameters before final evaluation on the test set.

What is data leakage?

Study These Flashcards

When information from outside the training set is used in model training, leading to unrealistic performance.

What is a machine learning pipeline?

Study These Flashcards

A sequence of data preprocessing and modeling steps applied consistently.

What are the stages in a basic ML workflow?

Study These Flashcards

Preprocessing → training → validation → testing → deployment.

What is model deployment?

Study These Flashcards

Making a trained model available for use in production environments.

What is model inference?

Study These Flashcards

Using a trained model to make predictions on new data.

What is feature engineering?

Creating new input features from raw data to improve model performance.

ML Part 1 Flashcards

(25 cards)