Supervised Learning Flashcards

1
Q

What is Supervised Learning?

A

A type of ML where the model is trained on labeled data, learning from known answers.

Supervised Learning relies on a dataset that includes input-output pairs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are key features used in Supervised Learning?

A
  • Buying Price
  • Maintenance Cost
  • Number of Doors
  • Seating Capacity
  • Luggage Boot Size
  • Safety Rating

These features help the model learn and make predictions based on labeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Predictive Modeling?

A

When ML learns patterns from data to make predictions.

Predictive modeling is a core aspect of Supervised Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between Regression and Classification?

A
  • Regression → Predicts continuous values (e.g., house prices)
  • Classification → Assigns data into categories (e.g., spam or not spam)

Understanding the difference is crucial for choosing the right model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Linear Regression?

A

A method to find the best-fit line Y = mx + c, where c is the intercept and m is the slope.

Linear regression is used to model the relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the limitations of Linear Regression?

A
  • Not good for non-linear relationships
  • Not good when there are too many outliers.

These limitations can affect the accuracy of predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Decision Tree?

A

A flowchart-like structure where each decision leads to an outcome.

Decision Trees are intuitive and easy to interpret.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the process of creating a Decision Tree?

A
  • Pick the best feature
  • Split the data into groups
  • Keep splitting until groups are pure.

This process helps in making decisions based on the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Random Forest?

A

A collection of multiple decision trees to improve accuracy and reduce overfitting.

Random Forest is an ensemble method that enhances model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does Random Forest work?

A
  • Train many Decision Trees on random data subsets
  • Use different features at each split
  • Combine all tree predictions.

This method helps in averaging out errors from individual trees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is k-Nearest Neighbors (k-NN)?

A

A method that classifies new data points based on the ‘k’ closest points in the dataset.

k-NN is a simple yet effective classification algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the process for k-NN classification?

A
  • Store the data
  • Choose k
  • Measure who’s closest
  • Pick the k nearest
  • Count votes & classify based on majority

The choice of ‘k’ can significantly impact the classification result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a limitation of k-NN?

A

It is slow for large datasets.

The computational cost increases with the size of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

List the main concepts of Supervised Learning.

A
  • Uses labeled data
  • Regression vs. Classification
  • Linear Regression
  • Decision Trees
  • Random Forest
  • k-NN

These concepts form the foundation of supervised learning techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the goal of Linear Regression?

A

To find the best-fit line that represents the relationship between variables.

This goal helps in making accurate predictions based on input data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Decision Trees can overfit.

A

True

Overfitting occurs when the model learns noise in the training data.

17
Q

Fill in the blank: Random Forest is an army of _______.

A

[Decision Trees]

This metaphor highlights the ensemble nature of the Random Forest algorithm.