naive bayes Flashcards

1
Q

The “naive” in Naive Bayes refers to:
A) The simplicity of its mathematical formula
B) The assumption that all predictors are conditionally independent
C) Its use only with numeric data
D) The use of small datasets

A

The assumption that all predictors are conditionally independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the Naive Bayes classifier estimate?
A) The linear relationship between predictors and outcome
B) The mode of the numeric predictors
C) The probability of each class given the predictor values
D) The average of categorical variables

A

The probability of each class given the predictor values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the key disadvantage of the full (exact) Bayes classifier?
A) It cannot handle categorical predictors
B) It requires calculating too many conditional probabilities
C) It requires the use of logistic regression
D) It overfits the training data by default

A

It requires calculating too many conditional probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why might a Naive Bayes classifier assign a class probability of zero?
A) There are missing values
B) The predictor value is continuous
C) The class does not exist in the dataset
D) The predictor value was not observed in the training set

A

The predictor value was not observed in the training set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What technique is commonly used to avoid zero probabilities in Naive Bayes?
A) Normalization
B) Data partitioning
C) Laplace smoothing
D) Standardization

A

Laplace smoothing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which of the following is not a benefit of Naive Bayes?
A) Fast computation
B) Works well with small datasets
C) Performs well even when independence assumption is violated
D) Handles categorical variables well

A

Works well with small datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Suppose 20% of flights are delayed. A Delta flight from DCA to LGA, Sunday morning, has a 0.3 probability of delay and 0.7 on-time. According to Naive Bayes, how is it classified?
A) Delayed
B) On-time
C) Uncertain
D) Cannot be classified

A

On-time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the effect of violating the conditional independence assumption?
A) Naive Bayes fails to run
B) Naive Bayes provides better probabilities
C) Rank ordering is still preserved for classification
D) The model becomes a logistic regression

A

Rank ordering is still preserved for classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which R function is used to train a Naive Bayes model?
A) glm()
B) naiveBayes()
C) bayes.train()
D) predictNB()

A

naiveBayes()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What kind of predictors can the standard Naive Bayes algorithm handle directly?
A) Only numeric
B) Only binary
C) Only text-based
D) Only categorical

A

Only categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly