exam1 Flashcards

(16 cards)

1
Q

KDD

A

knowledge discovery in databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

business intelligence main definition

A

extract INFORMATION for large amount of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

corelation vs causation

A

just because the curves fit doesnt mean they are related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

patterns

A

probabilities associated with a given fact. summarization of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

association rules

A

identify what goes with what

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

classification

A

like clustering… separate people into groups. difference is groups PREDICTIVE not known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

clustering

A

separate data instances into groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

collaborative filtering

A

netflix reccommender. look for similar users and predict based on that

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

ensemble

A

use many different models fused together to get the best results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

reports

A

summarization or visualization of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

KDD process steps

A
DATA
selection
preprocessing
data mining
interpretation
KNOWLEDGE
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data preprocessing

A

cleaning, exploration, data reduction/transformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

data cleaning

A

detect and fix/remove bad data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

data transformation

A

convert data format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

data mining

A

searching for patterns of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

data mining cycle

A

data -> information -> action -> VALUE