Lecture 1: Data Mining Concepts Flashcards
(5 cards)
What is data mining?
DM is technique used to __ valuable information from __ __ by discovering __, __ and __.
DM is technique used to extract valuable information from large dataset by discovering patterns, relationships and anamolies.
KDD
Knowledge Discovery in Databases (KDD) process consists of:
- Data C__ & I__
- Data S__ & T__
- Data M__
- E__ & P__
- Knowledge
- Data Cleaning & Integration
- Data Selection & Transformation
- Data Mining
- Evaluation & Presentation
- Knowledge
What is the concept of DM functionalities?
Function that __ and __ classes for __ predictions.
Function that describes and distinguish classes for future predictions.
Association Analysis
What is confidence and support?
Confidence: The percentage that ___ __ occurs ___ of __.
Support: The __ percentage that ____ & __ occus __.
Confidence: The percentage that event B occurs because of event A.
Support: The total percentage that event A & B occus together.
Discovering ‘interesting’ patterns.
Objective vs Subjective measures:
Objective: Based on s__ and s__ of p__.
Subjective: Based on u__ b__ in data.
Objective: Based on statistics and structures of patterns. E.g. support and confidence for association rules.
Subjective: Based on user’s belief in data. E.g. unexpectedness, novelty.