ML Part 2 Flashcards

Question 1

Q

What is linear regression?

Answer

A

A model that predicts a continuous outcome using a linear combination of input features.

Question 2

Q

What does the slope coefficient represent in linear regression?

Answer

A

The change in the predicted value for a one-unit increase in the input.

Question 3

Q

What is the intercept in linear regression?

Answer

A

The predicted value when all input features are zero.

Question 4

Q

What is the loss function used in linear regression?

Answer

A

Mean Squared Error (MSE).

Question 5

Q

What are assumptions of linear regression?

Answer

A

Linearity, homoscedasticity, independence, normality of residuals.

Question 6

Q

What is logistic regression used for?

Answer

A

Binary classification.

Question 7

Q

What is the output of logistic regression?

Answer

A

A probability between 0 and 1.

Question 8

Q

What is the sigmoid function?

Answer

A

A function that maps any value to a [0, 1] probability range.

Question 9

Q

How do you convert probabilities to classes in logistic regression?

Answer

A

Using a decision threshold (often 0.5).

Question 10

Q

What is the loss function used in logistic regression?

Answer

A

Log loss or cross-entropy loss.

Question 11

Q

What is a decision tree?

Answer

A

A model that splits data using feature values to make decisions.

Question 12

Q

What is Gini impurity?

Answer

A

A measure of how often a randomly chosen element would be incorrectly labeled.

Question 13

Q

What is information gain?

Answer

A

The reduction in impurity achieved by a split in a decision tree.

Question 14

Q

What is tree pruning?

Answer

A

Reducing the size of a tree to prevent overfitting.

Question 15

Q

What are advantages of decision trees?

Answer

A

Interpretability, handling non-linearities, and requiring little data preprocessing.

Question 16

Q

What is the k-nearest neighbors algorithm?

Answer

Study These Flashcards

A

A model that classifies data based on the majority label of the k closest training examples.

Question 17

Q

What is the key hyperparameter in k-NN?

Answer

Study These Flashcards

A

The number of neighbors (k).

Question 18

Q

What distance metric is commonly used in k-NN?

Answer

Study These Flashcards

A

Euclidean distance.

Question 19

Q

What happens if k is too small in k-NN?

Answer

Study These Flashcards

A

The model becomes sensitive to noise and overfits.

Question 20

Q

What happens if k is too large in k-NN?

Answer

Study These Flashcards

A

The model may underfit and smooth over patterns.

ML Part 2 Flashcards

(20 cards)