{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

ML Part 1 Flashcards

(25 cards)

1
Q

What is supervised learning?

A

A machine learning task where the model learns from labeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is unsupervised learning?

A

A task where the model finds patterns or structure in unlabeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between regression and classification?

A

Regression predicts continuous values; classification predicts discrete labels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is reinforcement learning?

A

A type of learning where an agent learns by interacting with an environment and receiving rewards or penalties.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a model in machine learning?

A

A mathematical function or algorithm that maps inputs to outputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is overfitting?

A

When a model learns noise in the training data and performs poorly on unseen data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is underfitting?

A

When a model is too simple to capture underlying patterns in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the bias-variance tradeoff?

A

The balance between underfitting (high bias) and overfitting (high variance).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can you reduce overfitting?

A

Use regularization, more data, cross-validation, or simpler models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can you reduce underfitting?

A

Use more complex models or add relevant features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is accuracy?

A

The proportion of correct predictions out of all predictions made.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is precision?

A

The proportion of true positives among all predicted positives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is recall?

A

The proportion of true positives among all actual positives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the F1 score?

A

The harmonic mean of precision and recall.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a confusion matrix?

A

A table showing true vs predicted classifications (TP, FP, FN, TN).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a train-test split?

A

Dividing data into a training set and a test set to evaluate generalization.

17
Q

What is k-fold cross-validation?

A

Dividing data into k parts, training on k-1 and testing on the remaining fold, repeated k times.

18
Q

Why use cross-validation?

A

To get a more reliable estimate of model performance on unseen data.

19
Q

What is the purpose of a validation set?

A

To tune model parameters before final evaluation on the test set.

20
Q

What is data leakage?

A

When information from outside the training set is used in model training, leading to unrealistic performance.

21
Q

What is a machine learning pipeline?

A

A sequence of data preprocessing and modeling steps applied consistently.

22
Q

What are the stages in a basic ML workflow?

A

Preprocessing → training → validation → testing → deployment.

23
Q

What is model deployment?

A

Making a trained model available for use in production environments.

24
Q

What is model inference?

A

Using a trained model to make predictions on new data.

25
What is feature engineering?
Creating new input features from raw data to improve model performance.