11.1 Mixture models Flashcards

1
Q

Which of these produces strictly non-overlapping clusters?

A
  • K-means

K-means produces non-overlapping clusters. Soft k-means and Gaussian mixtures are probabilistic clustering methods. In hierarchical clustering, some clusters are contained within other clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Suppose you input a pair of values [a b] to softmax, b > a. Which of these is a possible output (after rounding to the nearest float)?

A
  • [0.0 1.0]

Softmax outputs a probability distribution – values can only be in the range 0-1 and they sum to 1. The rank order of values in the output is the same as in the input, so if b>a, the second value of the output must be higher than the first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Model likelihood is guaranteed to increase after each iteration of the EM algorithm.

A
  • Likelihood of the instances given the current model parameters

The expectation step computes the likelihood of the instances given the current model parameters. In the maximization step, the model parameters are updated to maximize the likelihood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which of these is computed during the expectation step of the EM algorithm?

A
  • Likelihood of the instances given the current model parameters

The expectation step computes the likelihood of the instances given the current model parameters. In the maximization step, the model parameters are updated to maximize the likelihood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When using the supervised method to evaluate a clustering result, a good cluster model should have ___ purity and ___ entropy.

A
  • high, low

Both measures reflect the consistency of labels within clusters. If the labels are very consistent within clusters, the purity will be high and the entropy will be low.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly