19. Learning by Example (ML) Flashcards

Question 1

Q

What is learning by example?

Answer

A

Learning by example involves inferring a function or decision boundary from input-output pairs (training data).

Question 2

Q

What is the goal of supervised learning?

Answer

A

To learn a function that maps inputs to outputs, minimizing error on unseen examples.

Question 3

Q

What is a hypothesis in machine learning?

Answer

A

A candidate function from a hypothesis space that approximates the target function.

Question 4

Q

What is the hypothesis space?

Answer

A

The set of all functions a learning algorithm can choose from to approximate the target function.

Question 5

Q

What does it mean to generalize?

Answer

A

To perform well on unseen data, not just the training set.

Question 6

Q

What is overfitting?

Answer

A

When a model fits training data too closely, capturing noise and failing to generalize.

Question 7

Q

What is underfitting?

Answer

A

When a model is too simple to capture the underlying pattern of the data.

Question 8

Q

What is the difference between training and test error?

Answer

A

Training error measures fit to seen data; test error measures performance on unseen data.

Question 9

Q

What is the version space?

Answer

A

The set of hypotheses consistent with all training examples.

Question 10

Q

What is inductive bias?

Answer

A

Assumptions a learner uses to generalize beyond the training data.

Question 11

Q

What is the inductive learning hypothesis?

Answer

A

Any hypothesis that approximates the target function well over the training set will also do well on unseen data.

Question 12

Q

What is a consistent learner?

Answer

A

A learner that only outputs hypotheses consistent with all training examples.

Question 13

Q

What is the Find-S algorithm?

Answer

A

It finds the most specific hypothesis consistent with the training data.

Question 14

Q

What are limitations of Find-S?

Answer

A

It only works for conjunctive hypotheses and ignores inconsistent data or noise.

Question 15

Q

What is the Candidate Elimination algorithm?

Answer

A

It maintains the version space by updating specific (S) and general (G) boundaries.

Question 16

Q

What happens to the version space with more data?

Answer

Study These Flashcards

A

It shrinks, ideally converging toward the target concept.

Question 17

Q

What are the S and G sets in Candidate Elimination?

Answer

Study These Flashcards

A

S contains the most specific hypotheses; G contains the most general ones consistent with data.

Question 18

Q

How are S and G updated in Candidate Elimination?

Answer

Study These Flashcards

A

S is generalized on positive examples; G is specialized on negative examples.

Question 19

Q

What causes noise to be problematic in version space learning?

Answer

Study These Flashcards

A

It can eliminate all hypotheses, as no consistent function may exist.

Question 20

Q

What is inductive learning vulnerable to?

Answer

Study These Flashcards

A

Noise, limited data, and incorrect inductive bias.

Question 21

Q

How can hypothesis space design affect learning?

Answer

Study These Flashcards

A

Too large → overfitting; too small → underfitting.

Question 22

Q

Why is inductive learning considered impossible without bias?

Answer

Study These Flashcards

A

Because multiple hypotheses may explain training data—bias is needed to prefer one.

19. Learning by Example (ML) Flashcards

(22 cards)