Unsupervised Learning Flashcards
(10 cards)
What is an unsupervised learning algorithm?
An algorithm that doesn’t require the true labels of the data.
What are some real uses of unsupervised learning?
- Clustering
- Topic Modelling
- Community Detection
What are the main steps to clustering?
- Go over all datapoints
- Measure the distance or similarity between points
- Identify clusters of points that are closer to each other than they are to points outside the cluster.
What are two examples of clustering algorithms?
- K-means
- DBSCAN
How does K-mean clustering work?
It identifies points close to k centroids, where K is given by the user
How does DBSCAN work?
It finds core regions of high density and expands clusters form them
What are two types of clustering?
Hard Clustering and Soft Clustering
What is hard clustering?
Each object belongs in one cluster, similar to how a perceptron performs classification
What is soft clustering?
Objects are assigned to multiple clusters with corresponding probabilities.
What is Hierarchical Clustering?
A clustering algorithm that assigns a data point to multiple clusters, assigning them to all the clusters of their parent nodes.