{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

ML Part 5 Flashcards

(20 cards)

1
Q

What is feature engineering?

A

The process of creating, selecting, or transforming variables to improve model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is one-hot encoding?

A

Converting categorical variables into binary indicator columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is label encoding?

A

Assigning each category a unique integer value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is feature scaling?

A

Rescaling input features to a standard range (e.g., 0-1 or standard score).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is feature scaling important?

A

For distance-based models like k-NN or SVM.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is normalization?

A

Scaling data to a range (typically 0 to 1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is standardization?

A

Rescaling data to have mean 0 and standard deviation 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is missing value imputation?

A

Filling in missing values using a rule (e.g., mean, median, model).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is data leakage in preprocessing?

A

Using information in training that would not be available at prediction time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why should scaling be fit only on training data?

A

To avoid data leakage from test data influencing the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is feature selection?

A

Choosing a subset of input variables to use in a model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is univariate feature selection?

A

Selecting features based on statistical tests between inputs and target.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is recursive feature elimination (RFE)?

A

A method that fits models and recursively removes least important features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the curse of dimensionality?

A

Model performance degrades as the number of features increases without more data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is multicollinearity?

A

When two or more features are highly correlated, causing instability in coefficients.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is hyperparameter tuning?

A

The process of finding the best values for model parameters not learned from data.

17
Q

What is grid search?

A

An exhaustive search over a manually specified set of hyperparameters.

18
Q

What is random search?

A

Sampling hyperparameters randomly over specified ranges.

19
Q

What is Bayesian optimization for tuning?

A

A probabilistic method that models the objective function to find good hyperparameters.

20
Q

What is cross-validation used for during tuning?

A

To evaluate model performance for each hyperparameter configuration.