Analysis Flashcards

1
Q

In ROC (Receiver Operating Characteristic) analysis, what 2 measurements do we take for each threshold level?

A
  1. Specificity and Sensitivity

OR

  1. False Positive Rate and True Positive Rate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the ROC curve plot?

A
  1. Specificity(x-axis) and Sensitivity (y-axis)

OR

  1. False Positive Rate (x-axis) and True Positive Rate (y-axis)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Sensitivity?

A

TP/ (number of real positives)

classifies how good the model is at picking out positive values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Specificity?

A

TN/ (number of real negatives)

classifies how good the model is at picking out negative values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a good ROC curve look like?

A

Like a top-left corner

It should show a sharp rise in the True Positive Rate, without much increase in the False Positive Rate

This means it can classify a lot of positive samples correctly, without misclassifying negative samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What metric do we use to show how good an ROC curve is?

A

We look at the area underneath the ROC curve,

the ideal case is an area of 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Are KNNs good with large scale data?

A

No

There is a high computational complexity of neighbour search and distance calculation with lots of dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why do KNNs have a high memory cost?

A

They need to store all the training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Are KNNs good with dealing with imbalanced data?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Are KNNs sensitive to outliers?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the no free-lunch theorem?

A

This is more of a philosophy which states that:

Given no prior information to the learning task or data distribution

We can never say that any particular algorithm has a guaranteed advantage over any other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do we need to decide when using KNN?

A

The neighbour number K

The distance measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Can a KNN handle both linear and non-linear data patterns?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Can we use Regularised Linear Least Squares with a small dataset?

A

Yes, good results can still be achieved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Does Linear Regression have a low computational cost?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In Regularised Linear Least Squares, what hyper parameters are there to set?

A

The regularisation parameter, lambda

and the form of Regularisation e.g. L2 or L1

17
Q

Is Linear Regression sensitive to outliers?

A

Yes