L18 - Support Vector Machine Flashcards

1
Q

What type of model is a SVM?

A

Non-probabilistic Supervised Classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the goal of SVM?

A

Find the optimal plane through a set of data points. That being, the one with the largest margin from the plane to the support vectors.

This enables greater classification capability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In a feature space, what do the dimensions represent?

A

Features. I.e if there are 5 features, there are 5 dimensions in the feature space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does SVM classify data?

A

The feature space is partitioned, each partition represents a class. The SVM places the data about the partition planes to classify it.

For example, 2 dimensions, data would be place above or below the plane for classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does it mean the SVM is non-probabilistic?

A

The classification is based purely on features of the data point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the Kernel Trick enable SVM to do?

A

Make classification of non-linear data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are Support Vectors?

A

Data points closest to the separation plane.

They are called this since the support the classification decision boundaries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define the Maximum Margin Classifier…

A

When we classify based on the threshold that gives us the largest margin between the place and the support vectors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an issue with using Max. Margin Classifier?

A

Prone to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the solution to the outlier problem?

A

Enable the SVM to make misclassifications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Soft Margin of the SVM?

A

If misclassifications are allowed, the soft margin is the margin between the support vectors and the separation plane.

But the soft margin enables misclassifications, meaning that outliers on the wrong side of the separation plane will be classified to prevent outlier impacting classification.

Thus, misclassification is allowed to enhance the general classification capability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we know if we have non-linear classification?

A

If data overlaps in the features space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If we have a non-linear problem, what transformation do we perform on the data?

A

A mapping transformation in which overlapping features are bought into higher dimensions. This means if the data is not linearly separable in one dimension, it should be separable in a higher dimensional space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the name of the function that performs higher dimensionality mapping? Given an example of one

A

Kernel Function

Example : X mod 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the issue with non-linear SVM?

A

Computational cost of operations increase greatly when they have to be performed in higher dimensions.

This is infeasible for large data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the solution to the high computational cost of kernel functions in high dimensional space?

A

Kernel Trick - Enables the data transformation to be applies without moving to higher dimensional feature space.

17
Q

How does the Kernel Trick work?

A

Create a Kernel Matrix M = NxN where N is the number of data points in the feature space.

Kernel Function performs a dot product on each matrix cell M_i,j

18
Q

What is a Kernel Function?

A

A function that takes 2 feature vectors from the features space, and returns the dot-product to the feature space.

19
Q

What type of value does a Kernel Function return?

A

A real number.

20
Q

What are some Advantages and Disadvantages of SVM?

A

Advantages:
- Effective in higher dimensions
- Capability of non-linear classifications
- Memory efficient

Disadvantages:
- If feature count is greater than the number of data points, the feature space will be greater than the sample count. This will lead to poor classification capability.