EM Flashcards by ROWAN Gomanee

What problem does Expectation Maximization (EM) solve?

It optimizes likelihood in models with hidden or latent variables.

How well did you know this?

Not at all

Perfectly

What is the basic idea behind EM?

Alternating between estimating hidden variables and optimizing model parameters.

How well did you know this?

Not at all

Perfectly

What are the two main steps in EM?

E-step (Expectation) and M-step (Maximization).

How well did you know this?

Not at all

Perfectly

What happens in the E-step of EM?

Compute the posterior distribution of the latent variables given current parameters.

How well did you know this?

Not at all

Perfectly

What happens in the M-step of EM?

Update parameters to maximize the expected complete-data log-likelihood.

How well did you know this?

Not at all

Perfectly

Why can’t we directly maximize the likelihood in latent variable models?

Because the log of a sum over hidden variables doesn’t simplify easily.

How well did you know this?

Not at all

Perfectly

What is the name of the function that EM maximizes as a lower bound?

The Evidence Lower Bound (ELBO) or free energy.

How well did you know this?

Not at all

Perfectly

Why does EM increase the log-likelihood at every iteration?

Because each step maximizes a lower bound on the true log-likelihood.

How well did you know this?

Not at all

Perfectly

What does the ELBO become tight (equal to the log-likelihood)?

When the approximate posterior matches the true posterior.

How well did you know this?

Not at all

Perfectly

What distribution is commonly used in the E-step for GMMs?

The posterior responsibilities: p(c | x, θ).

How well did you know this?

Not at all

Perfectly

What does qₙc represent in GMM EM?

The probability that data point xₙ belongs to cluster c.

How well did you know this?

Not at all

Perfectly

What is the formula for qₙc in the E-step of GMMs?

qₙc = πc · N(xₙ | μc, Σc) / Σk πk · N(xₙ | μk, Σk)

How well did you know this?

Not at all

Perfectly

What is updated during the M-step of GMM EM?

The means, covariances, and mixing coefficients of the Gaussians.

How well did you know this?

Not at all

Perfectly

What is the formula for updating μc in GMM EM?

μc = Σ qₙc xₙ / Σ qₙc

How well did you know this?

Not at all

Perfectly

What is the formula for updating Σc in GMM EM?

Σc = Σ qₙc (xₙ - μc)(xₙ - μc)ᵀ / Σ qₙc

How well did you know this?

Not at all

Perfectly

What is the formula for updating πc in GMM EM?

Study These Flashcards

πc = (1/N) Σ qₙc

What type of model is a Gaussian Mixture Model (GMM)?

Study These Flashcards

A probabilistic generative model with latent variables.

How is EM related to K-means?

Study These Flashcards

K-means is a limiting case of GMM with hard assignments and small variance.

When does GMM reduce to K-means?

Study These Flashcards

When covariances are isotropic and approach zero.

What kind of assignment does K-means make?

Study These Flashcards

Hard assignments (each point to one cluster).

What kind of assignment does EM in GMM make?

Study These Flashcards

Soft assignments using probabilities.

What type of optimization method is EM?

Study These Flashcards

An iterative coordinate ascent on a lower bound of the log-likelihood.

Why is the log-likelihood hard to compute directly in GMMs?

Study These Flashcards

Because it involves the log of a sum over components.

What is a key requirement for EM to work?

Study These Flashcards

That we can compute the posterior and maximize the expected log-likelihood.

Can EM get stuck in local optima?

Yes, EM converges to a local maximum and is sensitive to initialization.

What is the benefit of using EM in probabilistic PCA?

It allows inference with missing data and fits the model probabilistically.

What is the generative model in probabilistic PCA?

x = Wz + μ + noise, where z is a latent variable.

what is the goal of em?

to find paramters for the distributions of each cluster

EM Flashcards

(28 cards)