Supervised Learning Flashcards
(10 cards)
What is supervised learning?
Learns a mapping from inputs x = (x1,…,xd)^T ∈ X to outputs y ∈ Y given a training set of input-output pairs T = {(x1, y1),…,(xn, yn)} where d = dimensions and n = size of training set.
Can dimensions be different types?
Yes, dimensions can be of different times. Many assume numeric inputs but some accept ordinal, categorical or non-structured data types.
What is the hypothesis set?
The hypothesis set is the set of all possible functions that a machine learning approach can learn.
What is the aim of the learning algorithm?
The learning algorithm searches for the hypothesis g ∈ H that best appropriates f given T
What is the hypothesis formula?
h(x) = wTx + b
How do we account for noise?
By considering a distribution over the target function, input, final hypothesis and target.
What is the unknown target function distribution?
f : p(x|y)
What is the unknown input distribution?
p(x)
What is the final hypothesis distribution?
p(y|x)
What is the unknown target distribution?
p(y|x)