data preparation Flashcards

1
Q

process of preparing data for analysis by
removing or modifying incorrect, incomplete,
irrelevant, duplicated, or improperly formatted data

A

data cleaning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

types of data: many different string value

A

polynomial

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

types of data: exactly 2 values

A

binomial

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

types of data: a fractional number

A

real

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

types of data: a whole number

A

integer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

types of data: both date and time

A

data_time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

rapid miner interface

A

repository/source tabs
operators/analysis tabs
description tabs
parameter tabs
canvas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

data will appear in the __ tab

A

results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

“read excel” is found in what tab

A

operator tab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

to find the basic statistics of each attributes, click __

A

statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

connect the __ node of the read excel operator and __ of the result knob

A

out, res

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

filter examples may be found in what tab

A

operator tab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

add filter may be found in what tab

A

parameter tab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

instead of filtering, you may remove all cases with missing values, using the __ class, instead of add filters

A

condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

replace missing values may be found in what tab

A

operator tab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

t/f:in dealing with miscoded data, the out node of the retrieve customer operator must also be connected to the first res of the result knob

A

f (second knob)

17
Q

__ __ operator may be used to tag the attribute that will be used as the label (target variabls)

A

set role

18
Q

if two data sets are needed to be merged in order to make an analysis, use the __ operator

A

join

19
Q

connect the first data set or its result in the _- node of the join operator and the other data set at the right node

A

left