Lecture 1 - Recap and Neural Networks Flashcards

(31 cards)

1
Q

What best describes supervised learning?

A

Data points have a known, correct label.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What type of variable does regression predict?

A

A continuous variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of variable does classification predict?

A

A categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What best describes unsupervised learning?

A

Data points have no label; patterns are discovered in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What best describes semi-supervised learning?

A

Some data points have labels and others do not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What best describes reinforcement learning?

A

An agent learns from its environment by receiving rewards for actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of a training set in ML?

A

To adjust the weights of the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the purpose of a validation set in ML?

A

To tune hyperparameters and check for overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of a test set in ML?

A

To evaluate the final performance of the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is k-fold cross-validation used for?

A

To evaluate a model by splitting the dataset into multiple train/test folds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a large gap between training and test error indicate?

A

High variance in the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What can help reduce test error?

A

More training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is bias in a model?

A

The error due to overly simplistic assumptions in the learning algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is variance in a model?

A

The error due to sensitivity to small fluctuations in the training set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is accuracy in classification?

A

The ratio of correct predictions to total predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is precision in classification?

A

The ratio of true positives to all predicted positives.

17
Q

What is recall in classification?

A

The ratio of true positives to all actual positives.

18
Q

What does F1-score balance?

A

Precision and recall.

19
Q

Why is recall more important than precision in disease diagnosis?

A

Because false negatives are riskier than false positives.

20
Q

What are examples of supervised learning algorithms?

A

Linear regression, logistic regression, SVM, decision trees, random forests, neural networks.

21
Q

What are examples of unsupervised learning algorithms?

A

K-means clustering, hierarchical clustering, PCA, autoencoders.

22
Q

Which method is commonly used for semi-supervised learning?

A

Self-learning: 1) Train on data with labels; 2) Predict labels on unlabeled data; 3) Train again on the newly labeled dataset.

23
Q

Is it correct that having a high number of correctly classified training samples means correctly classified test samples?

A

Yes, as long as the model does not overfit during training. In general, it is hard to find the ideal number of epochs.

24
Q

What is the formula for accuracy?

A

Accuracy = (TP + TN) / (TP + FP + TN + FN)

25
What is the formula for precision?
Precision = TP / (TP + FP)
26
What is the formula for recall?
Recall = TP / (TP + FN)
27
What is the formula for F1-score?
F1-score = 2 * (Precision * Recall) / (Precision + Recall) = 2 * TP / (2 * TP + FP + FN)
28
Why do we care about the F1-score?
Because it balances Precision and Recall, especially useful in imbalanced datasets.
29
Why do we have different performance metrics like Precision, Recall, and F1-score?
Because in some tasks like disease diagnosis, False Negatives are too risky and must be minimized.
30
What is a neural network?
A model architecture inspired by the brain's neural connections, used to approximate complex functions.
31
Why are neural networks powerful?
Because of their high expressivity, allowing them to resemble complex functions well.