Data science terms and Data handling terms Flashcards

Learn about the groupings of terms related to data science and one of this group i.e. Data handling terms

1
Q

List the 4 groups of terms related to data science

A

-Data handling
-Data features terms
-Artificial intelligence
-Model development terms
-Model performance terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What terms are data handling terms

A

Training set, Testing set, outlier, data cleansing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define training set

A

The dataset used by the machine learning model that will help it to learn the desired task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define testing set

A

Dataset that is used to measure the performance of the developed machine learning model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define outlier

A

The data record that is seen as exceptional and outside the distribution of the normal input data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define data cleansing

A

Process of removing redundant data, handling missing data entries, and removing or at least alleviating other data quality issues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly