Intro Flashcards
(86 cards)
Why is machine learning popular?
-Lots of data available
-current control theory methods struggle to solve large scale complex problems
What are the types of supervised learning?
regression and classification
what are the types of unsupervised learning?
clustering and dimensionality reduction
what are the types of reinforcement learning?
Value iteration and policy iteration
What is supervised learning?
a function that maps an input to an output based on labelled example input output pairs
what is unsupervised learning?
an algorithm that learns patterns from un labelled data
What is the key difference between regression and classification?
in regression the data is continuous whereas discrete data is used for classification
How does regression work?
find a function that minimises a cost function (most often mean squared error)
Describe a nearest neighbour model?
individual data point is grouped depending on proximity
Describe a piecewise linear model
data follows different linear trends over different regions of the data
What are some model types?
Linear, low order polynomial, high order polynomial, piecewise linear, nearest neighbour
When does overfitting occur?
- when a model fits the data set too well and is unable to generalise
- low density of data
What is a characteristic of overfitting?
oversensitivity to measurement noise
How can overfitting be avoided?
do not use a model that is more complicated than required (Occam’s razor)
What is a white box model?
-increased system information
-low model uncertainty
What is a black box model?
-decreased system information
-high model uncertainty
what is inference?
the process in which prediction is made
What does the expected mean square error of the prediction depend on?
bias and variance
what is meant by high bias?
model fails to capture the underlying structure of the data (underfitting)
what is meant by high variance?
model is sensitive to small fluctuations in the data (overfitting)
when is variance high?
in complex models
what is the bias-variance trade-off?
If biased is increased then variance decreases and vice versa. Therefore need to minimise both bias and variance.
what is meant by error?
the error between the true value and the predicted value
what happens in simple linear regression?
identify a line of best fit y=a0+a1x+err, where a0 and a1 need to be determined