Chapter 7 - Decision Analytic Thinking 1: What is a good model? Flashcards

1
Q

What the deal w/ classification accuracy?

A

It is an easy to measure metric to asses the models perfomance. However, it is to simplistic to use it in real lfie.

Accuracy = N of correct decisions made/ Total N decisions made = 1 - error rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the deal w/ the Confusion Matrix

A

Used to decompose and count the different types of correct and incorrect devisions made by a classiffier. Used for True Positive and False Positives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a skewed class distribution? What is the problem w/ it?

A
This is an unbalanced class distribution in which one class different than the other. The problem is:
1. More skewness leads to a break down in evaluation based on accuracy.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the problem w/ accuracy?

A
  1. Accuracy is misleading when there is skewness.

2. It makes no distinction between False Positive and False Negatives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the deal w/ the Expected Value Framework?

A

Calculates the weighted average of the values of the different possible outcomes.
It decomposes data-analytic thinking into:
1. The Structure of the problem
2. The elements of the analysis that can be extracted from the data.
3. The elements of the analysis that need to be acquired from other sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is the Expected Value Framework used?

A

It is used to calculate the Expected payoff of a model. To know whether the model should be used, the EV of profits > EV costs. This all depends on the Vr and Vnr.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the benefit of using the Expected Value Framework instead of using Accuracies?

A

The EFV can be used to compare two different models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the majority classifier?

A

This is a naive classifier that always chooses the majority class of the training data as baseline.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a decision stump and what is it used for?

A

This is a decision tree w/ only one internal node, the root node. This means that only the single most informative feature is selected to make a decision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly