SLP Flashcards
(107 cards)
What is logistic regression called for more than two classes?
multinomial logistic regression
What is the most important difference between naive Bayes and logistic regression?
logistic regression is a discriminative classifier, naive Bayes is a generative classifier
Generative model
A model that should generate what its trying to learn.
Discriminative model
A model that only tries to learn to distinguish the classes.
Naive Bayes assigns a class c to a document d. How does it do this?
It computes a likelihood : P(d|c) and a prior: P(c).
What are the four components of a machine learning system for classification?
- feature representation of the input
- classification function that computes estimated class
- objective function that we want to optimize
- algorithm for optimizing the objective function
What are the two phases of logistic regression?
- training
- test
What is the bias term?
A real number that is added to the weighted inputs
how does a logisitc classifier make a decision on a test instance after learning the weights?
it multiplies each xi<\sub> by its weight wi<\sub>, sums the weighted features and adds the bias term.
What is the formula for z in logistic regression?
z = w*x + b
What does z express in logistic regression?
the weighted sum of the evidence for the class it is computed for
What do we to do z in logistic regression to create a probability?
Pass it through the sigmoid function (logistic function)
exp(x) ==
e^x
What is the sigmoid function?
1 / (1+ exp(-z) )
What does the sigmoid function do to the real number z?
it maps it into the range (0,1)
logit
the input to the sigmoid function:
z
Why is the input to the sigmoid function often called logit?
it’s the inverse of the sigmoid.
What is the logit function logit(p)?
sigma^(-1) (p)
ln(p/(1-p))
Period disambiguation
Deciding if a period is the end of a sentence or part of a word.
Representation learning
Ways to learn features automatically in an unsupervised way from the input
What does it mean to standardize input values?
Rescale them so they have zero mean and a standard deviation of one.
What is the formula to normalize input features values to lie between 0 and 1?
(xi<\sub> - min(xi<\sub>) / (max(xi<\sub>) - min(xi<\sub>))
What is softmax regression another name for?
multinomial logistic regression
hard classification
when an observation can not be in multiple classes