Final (3) Flashcards

(17 cards)

1
Q

What does NB assume about features?

A

They are independent given the class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does BernoulliNB do?

A

Checks if features are 0 or non-zero for each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does MultinomialNB use?

A

Average values of features per class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does a large alpha in NB do?

A

Adds smoothing and reduces model complexity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

NB model strengths?

A

Fast to train and predict, similar pros/cons to linear models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do hidden layers allow neural nets to do?

A

Learn complex, non-linear decision boundaries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do activation functions do?

A

Add non-linearity to hidden layers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Neural net strengths?

A

Can capture patterns in large datasets and beat other models with tuning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Neural net weaknesses?

A

Long training time, needs preprocessing, works best with homogenous data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is GridSearchCV used for?

A

Finding the best model parameters through exhaustive search.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a Pipeline do?

A

Combines preprocessing and modeling steps into one object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why do we use cross-validation?

A

To get a reliable average accuracy score and avoid information leakage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the R² score measure?

A

Goodness of fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What makes a good model in terms of bias and variance?

A

Low bias (train accuracy close to 1), low variance (train ≈ test).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the goal of k-means clustering?

A

Minimize within-cluster sum of squares (inertia).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does Agglomerative Clustering work?

A

Starts with each point as its own cluster and merges the most similar ones.

17
Q

What does DBSCAN look for?

A

Dense regions in feature space.