VC Dimension Flashcards

(9 cards)

1
Q

What are dichotomies?

A

Dichotomies are hypothesis limited to the eyes of inputs. We look at the maximum kind of lines w.r.t N inputs, which is <= 2^N. When the change in Eout and Ein is small, the two hypothesis are similar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the growth function to calculate the maximum number of dichotomies.

A

mH(n).

The larger mH(n), the more expensive or more complicated. Dependent on H and N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When does H shatter x1,..,xn?

A

When the hypothesis set is able to generate all 2^N dichotomies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the break point for dichotomies?

A

If no k inputs can be shattered by H, k is a break point for H.

e.g. for 2D perceptron’s, the break point is N = 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is mH(n) not a good replacement for M?

A

Because it is exponential. However, when N is large we can change the upper bound such that its polynomial

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Ein Eout inequality using the growth function?

A

P[|Ein(g) - Eout(g) > ε|] <= 4 mH(2N) exp(-1/8 ε^2 N)

If k exists, we can replace the bound with 4(2N)^k-1 exp(-1/8 ε^2 N)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the VC dimension of a hypothesis?

A

dvc(H) is the largest value for N for which H can shatter all N training examples. It is the maximum non-breaking point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does a high dvc(H) mean?

A

A high dvc means a higher model complexity but smaller in-sample error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly