Exam 1 Flashcards

(7 cards)

1
Q

Column Filter Node

A

Used to select columns for analysis.

oRight click -> configure -> choose columns you want

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Statistics Node

A

Shows univariate statistics and some limited diagrams.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Missing Values Node

A

Removes rows with missing values.

Missing Value -> Configure -> Column Settings -> Remove Row

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Row Filter Node

A

o Filter to see only one type (Positions-> Catcher)
o Filter to see anything that contains Pitch. * Is a wild character – Must select contains wild card Box
o Filter to retrieve a range
-Catchers between 25-35, need 2 separate nodes (cant combine)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal Value Row Filter Node

A

Lets you pick one nominal column and filter by selected categories in that variable.
-ex: (Positions-> Catcher, Outfielder, Shortstop)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Types of Missing Values

Nominal Categorical

A

1) Missing completely at random (MCAR)

2) Missing not at random (MNAR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you handle missing data?

A

1)Study the reason for missing
The data could be missing because of MNAR.
2)Ignore it (still keep the record in data set)
This may not be a wise decision because some DM techniques are very sensitive to missing data.
3)Pairwise deletion of rows: an observation with missing value for variable X is removed from statistics (such as correlation matrix) involving the variable X. It is not removed from statistics not involving X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly