End-Term Flashcards

(40 cards)

1
Q

What is Machine Learning?

A

A field of AI that enables systems to learn from experience without being explicitly programmed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Who defined ML as ‘computers learning from experience E with respect to tasks T and performance measure P’?

A

Tom Mitchell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name the three types of Machine Learning.

A

Supervised learning, Unsupervised learning, Reinforcement learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is supervised learning?

A

Predicting outcomes based on labeled training data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are examples of supervised learning?

A

Handwriting recognition, stock prediction, disease diagnosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is unsupervised learning?

A

Finding patterns or groupings in unlabeled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are examples of unsupervised learning?

A

Market basket analysis, customer segmentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is reinforcement learning?

A

Learning to make sequences of decisions to maximize a reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a data set in ML?

A

A collection of information or records about a specific subject or entity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are attributes in a data set?

A

Features or variables that describe each record

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is qualitative data?

A

Categorical data describing qualities (e.g., gender, grade)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are types of qualitative data?

A

Nominal (no order) and Ordinal (with order)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is quantitative data?

A

Numerical data that can be measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is interval data?

A

Numeric data with known intervals but no true zero (e.g., temperature)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is ratio data?

A

Numeric data with a true zero (e.g., height, weight)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the main steps in building a learning system?

A

Task understanding, data collection, data preparation, modeling, evaluation, deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is classification?

A

Assigning a label to input data based on learned patterns

18
Q

What is regression?

A

Predicting a numerical value based on input data

19
Q

What programming language is most used in ML?

20
Q

What is a common library in Python for ML?

21
Q

What is the first step in problem-solving?

A

Problem formulation

22
Q

Name the main components in problem formulation.

A

Initial state, actions, transition model, goal test, path cost

23
Q

What is BFS (Breadth-First Search)?

A

Search algorithm that explores all nodes at the present depth before moving deeper

24
Q

What is DFS (Depth-First Search)?

A

Search algorithm that explores as far as possible along a branch before backtracking

25
What is A* Search?
Heuristic search algorithm that balances cost and heuristic estimates
26
What is the Vacuum World problem?
An AI agent cleans two squares using actions: Left, Right, Suck
27
What does k in k-NN stand for?
The number of nearest neighbors considered
28
Is k-NN a lazy learner?
Yes, it defers learning until a query is made
29
Steps in k-NN classification?
Choose k, calculate distances, select k nearest neighbors, vote
30
What distance metric is commonly used in k-NN?
Euclidean distance
31
What is the formula for Euclidean distance in 2D?
√((x2 - x1)^2 + (y2 - y1)^2)
32
Why choose an odd value for k in binary classification?
To avoid ties when voting
33
What is the impact of irrelevant attributes in k-NN?
They can distort distance calculations and reduce accuracy
34
What does normalization do in k-NN?
Rescales features to the same range, improving distance comparisons
35
How do you normalize an attribute?
(x - min) / (max - min)
36
In k-NN, what is the effect of different attribute scales?
Attributes with larger scales can dominate distance calculations
37
In k-NN, what is the best practice with continuous attributes?
Use a consistent and appropriate distance metric like Euclidean
38
Python code to create 3-NN classifier in sklearn?
knn = neighbors.KNeighborsClassifier(n_neighbors=3)
39
Python code to train k-NN classifier?
fit = knn.fit(iris_train_ftrs, iris_train_tgt)
40
Python code to evaluate accuracy?
metrics.accuracy_score(iris_test_tgt, preds)