Machine Learning NanoDegree Flashcards

Question 1

Q

apply bias and variance to ( underfitting || overfitting )

Answer

A

bias - underfitting, variance - overfitting

Question 2

Q

define the harmonic mean for (x, y)

Question 3

Q

what is the f1 score

Answer

A

the harmonic mean of precision and recall - raised a flag if any of the values are small

Question 4

Q

what is precision

Answer

A

the percent of labeled positives that are actually positive

Question 5

Q

what is recall

Answer

A

the percent of total positives that are label positive

Question 6

Q

What is fbeta score?

Answer

A

an f1 score that allows biasing towards either precision or recall. beta = 1 = harmonic mean, beta > 1 tends towards recall, 0 < beta < 1 tends towards precision

Question 7

Q

what is an ROC curve and how do you interpret it?

Answer

A

close to 1 is good, 0.5 is random,

Question 8

Q

What is r2 score and how do you interpret it?

Answer

A

the difference between a regression model and the simple averaging of all the points. close to 1 is good, close to 0 is bad

Question 9

Q

What is the point of having a bias node in a NN layer?

Answer

A

To provide the constant or intercept.

Question 10

Q

What is the ‘perceptron trick’ to get a line to move closer to a point?

Answer

A

subtract the the point vector (plus one for bias) time the learning rate from the linear equation if the point is negative labeled positive, add if the point is positive labeled negative.

Question 11

Q

What is the formula for multi-class entropy?

Answer

A

sum(for i in p){ p[i] * log2(p[i]) }

Question 12

Q

what does ‘naive’ refer to in naive bayes?

Answer

A

assuming that all variables are independent.

Question 13

Q

a function must be ___ not ___ in order to be optimized

Answer

A

continuous, discreet

Question 14

Q

describe l2 regularization, including its alternate name

Answer

A

also called ridge regression, l2 regularization adds the square of the coefficients to the cost function, perhaps scaled by lambda. This works to penalize the model for being too complex and reduce overfitting.

Question 15

Q

describe l1 regularization, including its alternate name

Answer

A

also called lasso regression, l1 regularization adds the absolute value of the coefficients to the cost function, perhaps scaled by lambda. This works to penalize the model for being too complex and reduce overfitting. Reduces less important features to 0 and thus may be suitable for feature engineering.

Question 16

Q

What is a polynomial kernal and what does degree refer to?

Answer

Study These Flashcards

A

A polynomial kernal projects a two dimensional function into 5 dimensions by adding terms x^2 , xy, and y^2. Higher degree polynomials add more exponents and combinations and therefore higher dimensions.