lecture 11 Clustering Flashcards

1
Q

k means

A

Convex in space
Cluster boundaries are In the middle of centers
Can’t model covariants
Only simple cluster shapes
——
Naive implementation
Fast
Not good for large dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Agglomeratie clustering

A

Start with all point in their own-> 2-> 3 -> hierarchical
Dendograms
Merging criteria: complete/ average / single/ward( smallest increase)

——————
Restrict to input topology
Fase with sparse connectivity
May link to imbalanced
Give more holistic view

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Dbscan

A

Core
Allows complex cluster shapes
Can detect outliers
Two parameters to adjust
Learn arbitrary cluster shapes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mixture models

A

Data is mixture of small number of known distributions
Find p(x)
Guasisian
Non-convex
Parametric density
How likely a new point is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bayesian infinite mixture

A

Add priors on mixture coefficients and Gaussian
Can I unselect components if they don’t contribute
Possibly more robust
Replace mixture coefficients by dirichelet process
Automatically finds number of components

How well did you know this?
1
Not at all
2
3
4
5
Perfectly