nearest neighbor Flashcards

don't forget kd trees (13 cards)

1
Q

what do you do in nearest neighbors classification?

A

identify what training neighbor is most similar to the target and take the class label from it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are some advantages of nearest neighbor classification?

A

simplicity, interprtability, non-linearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the issue with nearest neighbor classification?

A

devising the right distance function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the mathematical properties of distance measures?

A

positive, if the distance is 0 the points must be equal, the distance from x to y must be the distance from y to x bc symmetry, triangle inequality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the generalized idea of nearest neighbor?

A

function interpolation by averaging the values of the k nearest points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how can you adjust nearest neighbors classification for more robust classification or interpolation?

A

using the k closet neighbors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what happens to the boundaries as k gets larger in k nearest neighbors?

A

smoother boundaries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the time complexity for to find NN given n points in d dimensions?

A

O(nd)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are voronoi diagrams?

A

diagrams that partition space into regions sharing nearest neighbors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are kd trees?

A

efficient algorithm that helps to find nearest neighbors in low dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is locally sensitive hashing?

A

could speed nearest neighbor search if nearby points got hashed to the same bucket. loally sensitive hashing works on hashing points so that points/ vectors that are close to each other are hashed intot he same buckets. (do this by picking planes through the origin and cut through them)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what are some examples of network data?

A

social networks, www, product/ customer networks, genetic networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is page rank?

A

algorithm to find the probability that a random walk on G ends up at vertex v. looks at hyperlinks to different sites. allows for random jumps (damping factor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly