DAV CHAP 6 Flashcards

1
Q

Data mining definition

A

Process of Discovering new and meaningful correlations patterns trends by “mining” large amount of stored data using pattern recognition tech and statistical and mathematical techniques

known as knowledge discovery, data surfing, data harvesting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is driving data mining

A
  1. Change in technology - increase use of internet and computing power + data warehouses + better modelling approaches
  2. Change in customer behavior - more informed, demanding, willing to switch to competitor, harder to satisfy needs as it gets more complex
  3. Change in competition - Evolution of strategy like one to one marketing and mass marketing, more competition, faster pace, niche players.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Crisp DM model

A
  1. Business understanding - understand business objective
  2. Data understanding - collect, identify, describe data
  3. Data preparation - select, clean data
  4. Modelling - select modelling technique, build model and assess
  5. Evaluation - Evaluate results and review
  6. Deployment - plan deployment + presentation + review
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Decision tree

A

Supervised learning method - classification method that uses value of input variable to predict value of categorical variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Decision tree benefits

A
  1. Make readily understandable rules to predict customer behavior
  2. Evaluate values of different outcomes and probability to reach them
  3. Produce graphical representation of how different factors affect the outcome
  4. Make segmentation scheme based on decision tree results
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Clustering

A

Creates groups of records that are similar to each other within a particular group and very different across different groups
Association between members determined by characteristics specified in the analysis
Explore large amount of data and organize it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define data understanding

A

identify source of data, collect, describe explore and verify data quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data preparation

A

select data to clean, integrate the data and format it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Modelling

A

Select modelling technique, build, fine tune and assess

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Business understanding

A

understand business objective, and goal of data mining, work with stakeholders to produce project plan

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Decision tree practical applications

A
  1. Reduce customer fraud
  2. Who is likely to stop buying from us
  3. Who’s likely to be a credit risk
How well did you know this?
1
Not at all
2
3
4
5
Perfectly