KNN Flashcards by ROWAN Gomanee

What type of model is k-NN?

Non-parametric.

How well did you know this?

Not at all

Perfectly

Does k-NN train model parameters like θ?

No, it stores the training data instead.

How well did you know this?

Not at all

Perfectly

How does k-NN make predictions for classification?

By majority vote of the k closest training points.

How well did you know this?

Not at all

Perfectly

How does k-NN make predictions for regression?

By averaging the outputs (y) of the k nearest points.

How well did you know this?

Not at all

Perfectly

What does a small k value do in k-NN?

Makes the model highly flexible and sensitive to noise.

How well did you know this?

Not at all

Perfectly

What happens when k is too small in k-NN?

The model overfits (low bias, high variance).

How well did you know this?

Not at all

Perfectly

What happens when k is too large in k-NN?

The model underfits (high bias, low variance).

How well did you know this?

Not at all

Perfectly

What is the most common strategy for handling ties in k-NN classification?

Use odd values of k to reduce tie probability.

How well did you know this?

Not at all

Perfectly

What is the Euclidean distance formula?

√Σ(xᵢ - yᵢ)²

How well did you know this?

Not at all

Perfectly

What is the Manhattan distance formula?

Σ|xᵢ - yᵢ|

How well did you know this?

Not at all

Perfectly

When is cosine similarity useful in k-NN?

When direction matters more than magnitude (e.g. text data).

How well did you know this?

Not at all

Perfectly

What is one-hot encoding used for in k-NN?

To handle categorical variables.

How well did you know this?

Not at all

Perfectly

What distance metric can be used with one-hot encoded features?

Hamming distance.

How well did you know this?

Not at all

Perfectly

What is Jaccard similarity used for?

Comparing overlap between sets.

How well did you know this?

Not at all

Perfectly

What is the main drawback of k-NN?

Slow prediction and high memory usage.

How well did you know this?

Not at all

Perfectly

What do k-NN prediction curves show in classification tasks?

Study These Flashcards

How predicted probabilities change across the input space.

What does a sharp transition in a k-NN prediction curve indicate?

Study These Flashcards

The model is highly sensitive to local data; likely overfitting.

What does a smooth transition in a k-NN prediction curve indicate?

Study These Flashcards

The model is generalizing better; less variance.

What does a very flat k-NN prediction curve suggest?

Study These Flashcards

The model is underfitting; it’s not responsive to class boundaries.

How does increasing k affect the shape of the k-NN decision boundary?

Study These Flashcards

It makes the boundary smoother and less sensitive to individual points.

How does decreasing k affect the shape of the k-NN decision boundary?

Study These Flashcards

It makes the boundary more complex and sensitive to noise.

What is the tradeoff shown in k-NN decision boundary plots?

Study These Flashcards

The bias-variance tradeoff.

What kind of bias and variance do small k values typically produce?

Study These Flashcards

Low bias, high variance.

What kind of bias and variance do large k values typically produce?

Study These Flashcards

High bias, low variance.

Why are odd values of k often preferred in classification?

To avoid tie votes between classes.

How is the best k usually chosen?

Using cross-validation.

What is a key advantage of k-NN?

It is simple and easy to implement.

Does k-NN require training?

No, it is a lazy learner that defers computation to prediction time.

Can k-NN adapt to complex decision boundaries?

Yes, especially with small k values.

Does k-NN work with both classification and regression?

Yes, it supports both.

Why is k-NN considered non-parametric?

Because it makes no assumptions about the data distribution.

What is a major disadvantage of k-NN?

Slow predictions, especially on large datasets.

Why is k-NN memory-intensive?

It stores the entire training dataset.

How does k-NN handle irrelevant features?

Poorly — irrelevant or unscaled features can mislead distance calculations.

Is k-NN sensitive to feature scaling?

Yes, features with larger ranges can dominate distance calculations.

What happens to k-NN performance in high-dimensional spaces?

It degrades due to the curse of dimensionality.

KNN Flashcards

(36 cards)