PCA Flashcards by ROWAN Gomanee

What is the primary purpose of PCA?

To reduce dimensionality while preserving as much variance as possible.

How well did you know this?

Not at all

Perfectly

What does PCA transform data into?

A new coordinate system aligned with directions of maximum variance.

How well did you know this?

Not at all

Perfectly

What is the name of the directions found by PCA?

Principal Components.

How well did you know this?

Not at all

Perfectly

Why do we use dimensionality reduction?

To compress data, remove redundancy, visualize, and denoise.

How well did you know this?

Not at all

Perfectly

What shape does the covariance matrix describe?

The geometric shape of the data cloud in feature space.

How well did you know this?

Not at all

Perfectly

What does the diagonal of a covariance matrix represent?

The variance of each individual feature.

How well did you know this?

Not at all

Perfectly

What do off-diagonal entries in a covariance matrix represent?

The covariance between pairs of features.

How well did you know this?

Not at all

Perfectly

What does it mean for data to be ‘white noise’?

It is uncorrelated, has zero mean, and unit variance.

How well did you know this?

Not at all

Perfectly

What is the goal of whitening?

To transform data so its covariance matrix becomes the identity matrix.

How well did you know this?

Not at all

Perfectly

What is the formula for the multivariate Gaussian distribution?

P(x) = (1 / sqrt((2π)^D |Σ|)) * exp(-0.5 * (x - μ)^T Σ⁻¹ (x - μ))

How well did you know this?

Not at all

Perfectly

What does an eigenvector of Σ represent in PCA?

A principal direction of variance in the data.

How well did you know this?

Not at all

Perfectly

What does the corresponding eigenvalue represent?

The amount of variance captured in that principal direction.

How well did you know this?

Not at all

Perfectly

What is the first step of PCA?

Center the data by subtracting the mean.

How well did you know this?

Not at all

Perfectly

How is the covariance matrix computed after centering?

Σ = (1 / (n - 1)) * BᵀB

How well did you know this?

Not at all

Perfectly

What is B in the PCA algorithm?

The mean-centered data matrix (X - mean).

How well did you know this?

Not at all

Perfectly

What does projecting data onto eigenvectors achieve?

Study These Flashcards

Transforms data into a decorrelated space with ranked variance.

What does whitening do in PCA?

Study These Flashcards

Removes correlations and scales components to unit variance.

What matrix operation is used to whiten data?

Study These Flashcards

Multiply by D⁻¹ᐟ² where D contains the eigenvalues.

What do principal component ‘loadings’ mean?

Study These Flashcards

They are eigenvectors scaled by the variance (eigenvalue).

What does PCA seek to maximize when choosing projection directions?

Study These Flashcards

The variance of the projected data.

What is the rank of the PCA-transformed dataset if we keep k components?

Study These Flashcards

What shape is the projection matrix if we reduce to k dimensions?

Study These Flashcards

V_k ∈ ℝ^{d × k}, where d is the original dimension.

What is an advantage of using PCA before a classifier?

Study These Flashcards

It can reduce noise and remove multicollinearity.

What happens if you remove PCs with low variance in images?

Study These Flashcards

You may remove noise while preserving structure.

Why does PCA work well for image compression?

Most variance (information) is captured in a few components.

What is a geometric interpretation of PCA?

Finding the plane or hyperplane that best fits the data.

Why is PCA an unsupervised method?

Because it does not use class labels or outputs—only feature structure.

What is the role of SVD in PCA?

SVD can be used to compute PCA efficiently and stably.

What does PCA assume about feature relationships?

That directions with high variance are the most informative.

What happens when you project data onto the top k principal components?

You get a compressed version with minimal information loss.

PCA Flashcards

(30 cards)