L11 Flashcards

(20 cards)

1
Q

What are the three types of learning in machine learning?

A
  • Supervised Learning
  • Unsupervised Learning
  • Reinforcement Learning

Supervised learning involves labeled data, unsupervised learning does not, and reinforcement learning involves learning through interaction with the environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the objective of supervised learning?

A

Learn function F: Xk→YkX_k
ightarrow Y_k

The aim is to map inputs to outputs based on labeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the objective of unsupervised learning?

A

Discover patterns, cluster/group similar items

Examples include topic clustering on Twitter or detecting fraud through outlier detection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the curse of dimensionality?

A

Exponential increase in feature space volume with more features

This leads to difficulties in generalization, increased risk of overfitting, and sparsity of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is dimensionality reduction?

A

Process of reducing the number of features in a dataset

It speeds up computation, improves generalization, enhances visualization, and eliminates redundant features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two main approaches to dimensionality reduction?

A
  • Feature Selection
  • Dimensionality Reduction

Feature selection keeps original variables while dimensionality reduction creates new features as combinations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Principal Component Analysis (PCA) do?

A

Finds new axes (principal components) by rotating the coordinate system

The first component captures the direction of maximum variance, while the second is orthogonal to the first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the first step in performing PCA?

A

Standardize data (mean 0, variance 1)

Standardization ensures that each feature contributes equally to the analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the properties of PCA?

A
  • Orthogonal components
  • Linear combinations of original variables
  • Each additional component explains less variance

PCA is commonly used for data visualization and feature extraction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How many principal components should be kept in PCA?

A

Enough to reach ≥90% explained variance

The number of components depends on training data size and classifier complexity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

True or False: PCA is color-blind and ignores class labels.

A

True

This means PCA does not consider class information when reducing dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Non-negative Matrix Factorization (NMF)?

A

Unsupervised method for non-negative data that factorizes a matrix into non-negative latent components

NMF is useful for text mining and image analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the strengths of NMF?

A
  • Interpretability
  • Parts-based representations

NMF is particularly useful for analyzing topics in text and features in images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a downside of NMF?

A

Requires positive data and is sensitive to initialization

NMF may also struggle with non-convex optimization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does t-SNE do?

A

Projects high-dimensional data into 2D or 3D while keeping local neighborhoods close

It is particularly effective for visualizing clusters in data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some hyperparameters for t-SNE?

A
  • n_components (usually 2)
  • perplexity (typically 5–50)
  • early_exaggeration
  • learning_rate (10–1000)
  • n_iter (≥250, typically 1,000)

These hyperparameters control various aspects of how t-SNE processes data.

17
Q

What is a key limitation of t-SNE?

A

Emphasizes local over global structure

It is non-parametric and cannot easily project new points.

18
Q

What are the strengths and weaknesses of PCA?

A
  • Strengths: Fast, interpretable
  • Weaknesses: May lose class info

PCA is a linear method that is efficient but may overlook important class distinctions.

19
Q

What are the strengths and weaknesses of NMF?

A
  • Strengths: Interpretability, text/image analysis
  • Weaknesses: Non-convex, needs positive data

NMF is useful but has specific data requirements and optimization challenges.

20
Q

What are the strengths and weaknesses of t-SNE?

A
  • Strengths: Great for visualization
  • Weaknesses: Not useful for modeling

t-SNE excels at visualizing data but is not suitable for predictive modeling tasks.