Classifiers Flashcards by Matthew Gilbert

What is the difference between classification and regression?

Classification aims to split discrete data into categories, whereas regression aims to model continuous data.

How well did you know this?

Not at all

Perfectly

What type of learning is classification?

Supervised

How well did you know this?

Not at all

Perfectly

What technique was used in the first classifiers?

Logistics Regression with x as continuous data and y as binary data

How well did you know this?

Not at all

Perfectly

How do you fit a logistics regression using sklearn?

log_reg.fit(x_train, y_train)

How well did you know this?

Not at all

Perfectly

How do you find the score of a logistics regression using sklearn?

log_reg.score(x_test, y_test)

How well did you know this?

Not at all

Perfectly

How is score calculated for logistics regression?

mean accuracy on given test data and labels

How well did you know this?

Not at all

Perfectly

What is a perceptron?

A perceptron is a function that aims to draw a line or plane to separate two categories of data.

How well did you know this?

Not at all

Perfectly

What does a perceptron aim to learn?

It aims to learn the weights that allow it to best classify data. This is the same as learning the coefficient of the line.

How well did you know this?

Not at all

Perfectly

How is a perceptron trained?

Weights initialised as 0
Cycle through the data
for each x try classifying it: y = f(wx + b)
update w: w = w + α(y - y)x
If the prediction is correct w is not updated

How well did you know this?

Not at all

Perfectly

What is a perceptrons main weakness?

They rely on data having linear separability. If a straight line can’t be drawn to separate the data, then a perceptron won’t work.

How well did you know this?

Not at all

Perfectly

What is a Multi-Layer Perceptron?

The simplest type of Neural Network that contains hidden layers made up of a number of perceptrons.

How well did you know this?

Not at all

Perfectly

What is the advantage of an MLP?

MLP’s don’t require the data to be linearly separable so are more powerful.

How well did you know this?

Not at all

Perfectly

What technique is used to fit an ML model?

Gradient Descent

How well did you know this?

Not at all

Perfectly

What are the two types of parameter fitting method?

Deterministic and Stochastic

How well did you know this?

Not at all

Perfectly

2 answers

How is error claculated in a classifier?

Error is calculated using either L1 or L2 norm.

How well did you know this?

Not at all

Perfectly

Sum

What is L1 Norm?

Study These Flashcards

The sum of the absolute errors

What is L2 Norm?

Study These Flashcards

The root of the sum of squared errors

What are other names for the error?

Study These Flashcards

The cost function or loss function

How does gradient descent work?

Study These Flashcards

Start at random point p1
calculate loss at p1
take a step in the direction where the loss has the steepest gradient
calculate loss at new point
repeat until the loss gradient is less than a threshold or until N steps

What is a confusion matrix?

Study These Flashcards

A representation of the predicated values and if they are true/false positive or true/false negatives.

How is precision calculated?

Study These Flashcards

True Positive/ (True Positive + False Positive)

How does high precision present?

Study These Flashcards

an example labelled as positive is likely to be positive (small number of false positives)

How is recall calculated?

Study These Flashcards

TP/(TP+FN)

How does recall present?

Study These Flashcards

a class is correctly recognised so there are a small number of false negatives

What does high recall but low precision mean?

Most of the positive samples are correctly recognised but there are lots of false positives

What does High precision and low recall mean?

It misses a lot of positive examples but those predicted as positive are indeed positive

How is an F1 score calculated?

F1 = 2 x (precision x recall)/(precision + recall)

What is a ROC Curve?

A graph that visualises the performance of a binary classifier at different classification thresholds

What is the ideal shape of a ROC curve?

Upper case Gamma (Γ)

How are different ROC curves compared?

By calculating and comapringg there area under curves (AUC)

What does the AUC measure?

The AUC measures the quality of a classifier's predictions over multiple thresholds and will work for any classifier.

Classifiers Flashcards

(31 cards)