Topic 17 Flashcards

Question 1

Q

Bayes Theorem

Answer

A

P(Y|X) = P(X|Y)P(Y) /P(X) Where P(Y|X) is the posterior probability (description given all the available data), P(X|Y) is the likelihood (predicts the data), P(Y) is the prior probability (based on prior information) and P(X) is the normalising constant

Question 2

Q

Product Rule for Independent Events

Answer

A

P(X|Y) = P(x1, x2, … , xn|Y) = P(x1|Y)P(x2|Y)…P(xn|Y)

Question 3

Q

Conditional Probability

Answer

A

The probability that A and B are true is the same as the probability that A is true and that B is true given that A is true: P(A and B) = P(A) x P(B|A)

Question 4

Q

Naive Bayes General Formula

Answer

A

P(Y|x1, … , xn) ∝ 𝑃(𝑌)ෑ
Σ 𝑖=1,𝑛 𝑃(𝑥𝑖 |𝑌)

Question 5

Q

Y With Maximum Probability

Answer

A

𝑌𝑝𝑟𝑒𝑑 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑦𝑃(𝑌) Σ n, i=1 𝑃(𝑥𝑖 |𝑌)

Question 6

Q

Laplace/ Plus One Smoothing

Answer

A

Adding 1 to all occurences / data points to avoid having a probability of 0 that will ruin Naive Bayes calculations

Question 7

Q

Multinomial Naive Bayes

Answer

A

Document classification (eg. assigning an article to sports, politics etc or spam vs not spam)

Question 8

Q

Bernoulli Naive Bayes

Answer

A

Similar to Multinomial. but the features / predictors are boolean variables (eg. tennis weather example)

Question 9

Q

Gaussian Naive Bayes

Answer

A

When features are not discrete and take up a continuous value (assumed that the values are sampled from a Gaussian distribution), estimate a normal distribution with some mean and SD for each feature

Question 10

Q

Naive Bayes Advantages

Answer

A

*
It is easy and fast to predict a class of test dataset
*
Naïve Bayes classifier performs better compare to other models assuming independence
*
It performs well in case of categorical input variables compared to numerical variables

Question 11

Q

Naive Bayes Disadvantage

Answer

A

Independent Predictor Assumption (variables might not be independent)

Topic 17 Flashcards

(11 cards)