Classifier Models Flashcards

(23 cards)

1
Q

What are classification models?

A

Classification models are used to classify observations by some discrete label (i.e. gender, revenue bounds)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do classification models create predictions?

A

Separating observations using a plane or cluster in order to group them by label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is classification an unsupervised or supervised learning task?

A

Supervised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Logistic regression is useful when we have [continuous/discrete/binary] observations and [continuous/discrete/binary] labels.

A

Continuous, binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Logistic regression graphs continuous observations onto a graph with only […] label values.

A

Two/binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

By adjusting the parameters of a logistic regression, we adjust…

A

The steepness of the ‘middle curve’ in our sigmoid-like function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The accuracy of a classifier is calculated as…

A

The number of true positives and negatives divided by the total number of predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do perceptrons classify data?

A

By drawing a hyperplane between the binary data, separating them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a superplane?

A

A hyperplane is a mathematical line that attempts to separate two classes of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do perceptrons learn how to properly classify data?

A

By taking input signals and passing them through a layer that applies a weight to each feature of the input, finishing with a function that converts that output into a binary value like softmax

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a perceptron’s training phase look like?

A

Initialise each weight at 0, and cycle through the data. For each x, try classifying it, then update each weight according to some update rule ONLY if it is correct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the problem with a basic perceptron?

A

They have linear separability, meaning they can only separate the data with one line, making more complex shapes difficult

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a multilayer perceptron?

A

An adaptation of a basic perceptron that uses multiple weighted layers to simulate multiple layers of neurons, called hidden layers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the layers between the input and output of a multilayer perceptron called?

A

Hidden layers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is K-Nearest Neighbours, and what type of problem is it used for?

A

A supervised learning algorithm that models similarity via distance, and it is used for classification problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does K-Nearest Neighbours facilitate predictions?

A

Splitting the dataset into clusters, each of which represent a different class cluster, predicting new values as the cluster it is closest to

17
Q

How does K-Nearest Neighbours make a prediction given one new data point?

A

Calculate the distance to the other points, then sort that list and select the K nearest points. Find the majority class among those 3 neighbours to find the new point’s class

18
Q

How can outliers affect K-Nearest Neighbours?

A

Since we predict using a distance metric, outliers can skew how often we predict a certain class given its position on the graph

19
Q

How can class imbalance affect K-Nearest Neighbours?

A

Our predictions are based on a number of nearest neighbours, so if we simply have more neighbours than the other class, we are more likely to select the other, even if the nearest one is more accurate

20
Q

How can we select the optimal parameters for K-Nearest Neighbours?

A

Starting with one and increasing k by 1 each time, or setting k to the square root of the number of data points in the training dataset

21
Q

What is Weighted KNN?

A

A variant of KNN where we make the assumption that the impact of nearer neighbours should be greater than the further neighbours, using distance to consider impact

22
Q

Why does complexity increase with the size of the training data?

A

KNN is a form of instance-based learning, meaning they construct hypotheses directly from training instances, therefore as the data gets larger, we encounter slower training - O(n)

23
Q

KNN is most suited to [higher/lower] dimensional data.