Class One Flashcards

1
Q

What is data pre-processing in machine learning?

A

Data pre-processing refers to the techniques and methods used to transform raw data into a clean and meaningful format suitable for machine learning algorithms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the advantages of data pre-processing?

A

Advantages of data pre-processing include improved data quality, reduced noise and outliers, enhanced model performance, and better interpretability of results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the steps involved in data pre-processing?

A

The steps in data pre-processing typically include data cleaning, handling missing values, handling outliers, feature scaling, and feature encoding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is exploratory data analysis (EDA)?

A

Exploratory data analysis is the process of analyzing and visualizing data to gain insights, understand the underlying patterns, detect outliers, and make informed decisions about further analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the goals of exploratory data analysis?

A

The goals of exploratory data analysis are to understand the distribution of variables, identify relationships between variables, detect anomalies or outliers, and uncover hidden patterns in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some common techniques used in exploratory data analysis?

A

Some common techniques used in exploratory data analysis include summary statistics, data visualization (e.g., histograms, scatter plots), correlation analysis, and dimensionality reduction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is exploratory data analysis important in machine learning?

A

Exploratory data analysis helps in understanding the characteristics and structure of the data, identifying data quality issues, selecting appropriate features, and guiding the choice of machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When should you use data pre-processing techniques?

A

Data pre-processing techniques should be used when dealing with raw, noisy, or incomplete data, or when preparing data for machine learning algorithms that have specific requirements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some common challenges in data pre-processing?

A

Common challenges in data pre-processing include handling missing data, dealing with outliers, selecting appropriate feature scaling methods, and determining the best strategy for feature encoding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does data pre-processing impact machine learning model performance?

A

Proper data pre-processing can significantly improve machine learning model performance by reducing noise, removing bias, handling missing values, and ensuring that the data is in a suitable format for the chosen algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Data Science?

A

The ability to take data - to be able
to understand it, to process it, to
extract value from it, to visualize it,
to communicate it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Precision?

A

TP/(TP+FP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Recall?

A

TP/(TP+FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Accuracy?

A

(TN+TP) / (TN+FP+FN+TP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is F1 Score?

A

2 ( Precision* Recall)/(Precision+ Recall)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly