{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

ML Part 3 Flashcards

(19 cards)

1
Q

What is a random forest?

A

An ensemble of decision trees trained on random subsets of data and features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is bagging?

A

Bootstrap aggregating: training models on random samples and averaging their predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do random forests reduce overfitting?

A

By averaging predictions from many trees trained on different data subsets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is feature importance in random forests?

A

A metric that shows how useful each feature was for making predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the out-of-bag (OOB) score?

A

A validation score using samples not included in the bootstrap sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is boosting?

A

An ensemble method that combines weak learners sequentially to improve performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is gradient boosting?

A

A boosting method that minimizes loss by adding trees that correct previous errors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the learning rate in boosting?

A

A hyperparameter that controls the contribution of each tree to the ensemble.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is early stopping in boosting?

A

Halting training when validation performance no longer improves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Name two popular gradient boosting libraries.

A

XGBoost and LightGBM.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Naive Bayes algorithm?

A

A probabilistic classifier based on Bayes’ theorem assuming feature independence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What assumption makes Naive Bayes ‘naive’?

A

It assumes that all features are conditionally independent given the class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is Naive Bayes effective for text classification?

A

Because it handles high-dimensional sparse data well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Laplace smoothing?

A

A technique to handle zero probabilities by adding a small constant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Principal Component Analysis (PCA)?

A

A method for reducing dimensionality by projecting data onto directions of maximum variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are principal components?

A

New orthogonal axes capturing the most variance in the data.

17
Q

Why use PCA?

A

To reduce complexity, noise, and improve efficiency.

18
Q

What is an eigenvector in PCA?

A

A direction in the feature space along which variance is measured.

19
Q

What is an eigenvalue in PCA?

A

The amount of variance captured by its corresponding eigenvector.