Book - Chapter 7 Analytical Theory Classification Flashcards by Jodie Collins

What applications does classification appear in

Data mining

How well did you know this?

Not at all

Perfectly

What is the primary task of a classifier

To assign class labels to new observations

How well did you know this?

Not at all

Perfectly

Are classification method supervised or unsupervised

Supervised

How well did you know this?

Not at all

Perfectly

What is another name for a decision tree

Prediction tree

How well did you know this?

Not at all

Perfectly

What is the input variable of a decision tree

Categorical or continuous

How well did you know this?

Not at all

Perfectly

In a decision tree structure what is a test point

A node

How well did you know this?

Not at all

Perfectly

What is a node without further branches called

A leaf node

How well did you know this?

Not at all

Perfectly

What do leaf nodes return

They return class labels and, in some implementations, they return the probability scores

How well did you know this?

Not at all

Perfectly

What are the two varieties of decision trees

Classification trees and regression trees

How well did you know this?

Not at all

Perfectly

What are classification trees

They usually apply to output variables that are categorical for example often binary yes or no

How well did you know this?

Not at all

Perfectly

What are regression trees

They can apply to output variables that are numerical continuous, such as the predicted price of a consumer good or the likely heard a subscription will be purchased

How well did you know this?

Not at all

Perfectly

What does the term branch mean in decision trees

Refers to the outcome of a decision and is visualised as a line connecting two Nodes

How well did you know this?

Not at all

Perfectly

What happens if the decision is numerical

The greater than branch is usually placed on the right

How well did you know this?

Not at all

Perfectly

What is an internal node

Are the dissertation or test points. Each internal note refers to an input variable or an attribute

How well did you know this?

Not at all

Perfectly

What is the top internal node called

The root

How well did you know this?

Not at all

Perfectly

What is the depth of a node

Study These Flashcards

Is the minimum number of steps required to reach the node from the root

What are short trees also known as

Study These Flashcards

Weak learners or base learners

What’s on in ensemble Mefford

Study These Flashcards

They use multiple predictive models to vote, and decisions can be made based on the combination of the votes

Gave examples of ensemble methods

Study These Flashcards

Random forest, bagging, and boasting

What is the simplest short tree called

Study These Flashcards

Decision stump

At each split what does the decision tree algorithm do

Study These Flashcards

It picks the most informative attribute out of the remaining attributes

How is the most informative attribute determined

Study These Flashcards

By measures such as entropy and information gain

What does entropy measure

Study These Flashcards

The impurity of an attribute

What does information gain measure

Study These Flashcards

The purity of an attribute

When do you achieve maximum entropy

When all class labels are equally probable

What is conditional entropy always

Less than or equal to the base Entropy

What is information gain defined as

The difference between base Entropy and conditional entropy

What is Bayes theorem

Gives a relationship between the probabilities of two events and their conditional probabilities

What is a naive Bayes classifier

Assumes that the presence or absence of a particular feature of a class is unrelated to the presence or absence of other features

What are the input variables of naive Bayes

Categorical and I’ll discreet

What is the output of naive Bayes

Class label and its corresponding probability score. The probability score is not the true probability of the class label, but it’s proportional to the true probability

What is naive Bayes most commonly used for

Spam filtering

What is Bayes theorem

The conditional probability of event C occurring, given that event A has already occurred, is to noted as P (C|A)

What should a good classifier have

A large true positive and true negative and a small (ideally zero) numbers for false positives and false negatives

What does accuracy mean

Defining the rate at which a model has classified the records correctly

What is recall

The percentage of positive instances that were correctly identified

Book - Chapter 7 Analytical Theory Classification Flashcards

(36 cards)