gmm Flashcards

Question 1

Q

What is the key limitation of K-means that GMMs address?

Answer

A

K-means makes hard assignments and can’t handle overlapping clusters.

Question 2

Q

What type of clustering does a GMM perform?

Answer

A

Soft clustering using probabilistic assignments.

Question 3

Q

What is the basic assumption of a Gaussian Mixture Model?

Answer

A

Data is generated from a mixture of several Gaussian distributions.

Question 4

Q

What does the mixture weight πₖ represent in GMMs?

Answer

A

The prior probability of cluster k.

Question 5

Q

What does the term p(x | c, θ) represent in a GMM?

Answer

A

The likelihood of x given that it came from cluster c.

Question 6

Q

What kind of distribution does each component in a GMM represent?

Answer

A

A multivariate Gaussian distribution.

Question 7

Q

What is the role of the covariance matrix in a GMM?

Answer

A

It controls the shape and orientation of each Gaussian component.

Question 8

Q

What is the posterior probability p(c | x, θ) used for in GMMs?

Answer

A

It represents the responsibility or soft assignment of x to cluster c.

Question 9

Q

What does the EM algorithm optimize in GMMs?

Answer

A

The log-likelihood of the observed data under the model.

Question 10

Q

What is computed in the E-step of EM for GMMs?

Answer

A

Responsibilities: the posterior probabilities of each cluster for each point.

Question 11

Q

What is updated in the M-step of EM?

Answer

A

The means, covariances, and mixing proportions of each Gaussian component.

Question 12

Q

What does the EM algorithm guarantee?

Answer

A

That the data log-likelihood will increase with each iteration.

Question 13

Q

Does EM always find the global maximum of the log-likelihood?

Answer

A

No, EM finds a local maximum and is sensitive to initialization.

Question 14

Q

What does a GMM reduce to when covariances become isotropic and identical?

Answer

A

It becomes equivalent to K-means clustering.

Question 15

Q

How does K-means relate to GMMs conceptually?

Answer

A

K-means is a special case of GMM with hard assignments and fixed variances.

Question 16

Q

What does GMM use instead of distance for assignment?

Answer

Study These Flashcards

A

Probability density functions based on Gaussian distributions.

Question 17

Q

What type of clustering method is a GMM?

Answer

Study These Flashcards

A

A generative, probabilistic clustering model.

Question 18

Q

What kind of outputs does GMM provide for each data point?

Answer

Study These Flashcards

A

Probabilities of membership in each cluster.

Question 19

Q

What is the main advantage of GMM over K-means?

Answer

Study These Flashcards

A

It can model elliptical clusters and overlapping data regions.

Question 20

Q

Why is GMM considered more flexible than K-means?

Answer

Study These Flashcards

A

Because it learns full covariance matrices and uses soft assignments.

Question 21

Q

In which step are cluster labels assigned in GMM?

Answer

Study These Flashcards

A

After computing responsibilities in the E-step.

Question 22

Q

What does the log-likelihood function in GMM involve?

Answer

Study These Flashcards

A

A log of a sum over weighted Gaussians for each data point.

Question 23

Q

What does the E-step of EM depend on?

Answer

Study These Flashcards

A

The current parameter estimates of the Gaussians and mixing proportions.

Question 24

Q

What makes GMMs better suited for overlapping clusters?

Answer

Study These Flashcards

A

They assign probabilities to multiple clusters instead of picking just one.

gmm Flashcards

(24 cards)