Key Words Flashcards
Data Mining
identifying implicit, previously unknown, potentially useful patterns in data
Data Mining can be done…
Data Mining can be…
Data Mining can be done… interactively or automatically
Data Mining can be… descriptive or predictive
Machine Learning (for data mining)
programs that induce structural descriptions from observations
Supervised learning
is based on labeled examples and used for predicting labels of new observations
Unsupervised learning
is based on unlabeled data
Classification Rule
predicts value of given attribute
Association Rule
predicts value of arbitrary attribute (or combination)
Concepts
structures that can be learned/thing to be learned
Concept Description
Output of learning scheme
Important types of learning problems (four)
Classification Learning
Numeric Prediction
Association Learning
Clustering
Clustering
Grouping similar instances into clusters
Is classification learning supervised or unsupervised?
Supervised - Scheme is provided with actual outcome
Explain classification learning
-predicting a nominal class Can measure success on fresh data for which class labels are known (test data) - in practice success is often measured subjectively.
What do we call the outcome?
The class of the example
Regression/Numeric Prediction
predicting a numeric quantity
Variant of classification learning where “class” is numeric