Decision Trees Flashcards
(130 cards)
What is a Decision Tree?
Decision Tree is a tree shaped diagram used to
determine a course of action.
What does each branch of the Decision Tree typically represent?
Each branch of the tree represents a possible decision,
occurrence, or reaction.
What type of learning algorithm is a Decision Tree, and for what tasks is it commonly used?
A decision tree is a non-parametric
supervised learning algorithm, which is utilized for
both classification and regression tasks.
How is the training dataset used in building models with the Decision Tree Algorithm?
The training dataset is induced and then used the learning model phase by tree induction algorithm. The learned model is an outcome of the tree induction algorithm processing the training set.
In terms of tasks, what are the two main applications of a decision tree?
classification and regression tasks.
Why is a decision tree considered a popular data mining technique?
A decision tree visualization helps outline the decisions in a way that is easy to understand
What is the primary goal of creating a model using the Decision Tree Algorithm?
The goal is to create a model that predicts the value
of a target variable based on several input variables.
How is an internal node represented in a decision tree, and what does it signify?
an internal node represents a feature (or attribute)
What does each branch in a decision tree represent?
represents a decision rule
What role does a leaf node play in a decision tree, and what does it represent?
Each leaf node represents the outcome.
What is the significance of the root node in a decision tree?
It learns to partition on the basis of the
attribute value
How does the root node contribute to the partitioning of a decision tree?
It partitions the tree in a recursive manner called recursive partitioning
What is the process known as when the decision tree partitions in a recursive manner?
recursive partitioning
Why is a decision tree often compared to a flowchart diagram?
It’s visualization like a flowchart diagram which easily mimics the human level thinking.
What are the three main components of a decision tree, and what do they represent?
- Node
test for the value of a certain attribute - Edges
correspond to the outcome of a test - Leaves
terminal nodes that predict the outcome
In a classification tree, what is the purpose of determining a set of logical if-then conditions?
To classify problems.
When is a regression tree used, and how does it differ from a classification tree in terms of the target variable?
A regression tree is used when the target variable is numerical or continuous. We fit a regression model
to the target variable using each of the independent variables
In the context of Decision trees:
Define Gain
Gain is a measure of decrease in entropy after splitting
Decision Trees
How to split the data?
We hace to frame the conditions that split the data in such a way that the information gain is highest
How does the decision tree algortithm work?
- Select the best attribute using Attribute selection measures to split Records
- Make that attribute a decision node
- Start Tree building by recursively reapting this process for each child
Generally, when does the process of Tree building stop?
- When there are no more attributes
- There are no more instances
- All the tuples belong to the same attribute
List three advantages of Decision Trees in machine learning.
Simple to understand, interpret and visualize
What makes Decision Trees simple to understand and interpret for humans?
They look like simple if-else statements, and therefore can
be easily interpreted by humans
What are the advantages of Decision Trees regarding data preparation?
- No Scaling needed
- Can work without extensive handling of missing data
- no need for dummy variable