Week 6 Flashcards

1
Q

K means algo

A

If 2 mean points are equidistant from point, choose one with lower index

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

K median algo

A

As image except 2.1.2 is not abs value squared

Eg for (3,4) and (4,2) manhattan dist is 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

K means ++

A

Same as K means but to initialise mean points use

When selecting centroids 2,…, k, you are again choosing a random number each time to pick a point to use as a centroid. The probability distribution can be thought of as a number line and you are picking the number immediately to the right of the randomly generated P

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to choose k for k means etc

A

K means score is sum of square distances of points from associated centroids

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

K means and K median are examples of what kind of clustering

A

Flat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

2 types of hierarchical clustering

A

Agglomerative (bottom up)

Divisive (top down)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Agglomerative clustering on a graph

A

Assuming each edge represents the similarity of the 2 nodes it connects

Begin with each node is a cluster

Find the single edge with largest value

Let those 2 nodes be one cluster

Repeat until k = # of clusters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to visualise the stages of clustering in Agglomerative

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bisection K means

A

(Divisive) begin at one cluster and split using sparsest cut

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sparsest cut

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Single agglomerative clustering

A

Single: edge with highest similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Average linkage agglomerative clustering

A

Average: sum all edges between 2 clusters divided by # of edges between 2clusters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Complete linkage agglomerative clustering

A

Complete linkage: minimax , combine clusters with highest least similar edge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly