Anomaly Detection Flashcards

1
Q

DTF never results in distance strictly greater than Euclidean distance

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

DTF cannot be applied to sequences of diff length

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DTF can only be applied to single-variate (one-dimensional/ one feature) sequence

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

DTF normalization is useful

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a low local reachability density mean? (lrd)

A

It means large average distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

LOF(q) < 1 means what?

A

Inlier, higher density

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

LOF(q) > 1 means what?

A

Outlier, lower density

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Advantages of NN?

A
  • used in unsupervised setting
  • no assumptions about data distribution
  • intuitively appealing, uses distances
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Disadvantages of NN?

A
  • computationally expensive when testing
  • requires distances, so all disadvantages of distances apply
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Advantages of PCA?

A
  • Useful for modeling feature interaction
  • Computationally efficient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Disadvantages of PCA?

A
  • Based on assumption that normal/ anomaly are distinguishable in the reduced space
  • Context not taken into account
  • PCA sensitive to outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the three types of anomalies?

A
  • Point (point x is strange)
  • Contextual (point x strange given set S)
  • Collective (set S is strange)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Outliers have no effect on PCA?

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

PCA assumes relationship between variables is linear?

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

LOF uses reachability distance instead of actual distance to lower effect of outliers?

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

LOF does not require distance metric to work properly and return sensible results?

A

True

17
Q

If p, q have same distances to nearest neighbours, it is possible that LOF returns p as anomaly and q as normal?

A

True

18
Q

Main use of LOF is to find collective anomalies?

A

False (I think its point anomalies)

19
Q

What is NN not suitable for?

A

Datasets that have modes with varying density

20
Q

PCA assumptions?

A

-relationship between variables/ features are linear
- principle components are orthogonal (linearly independent)
- direction with largest variance is the most informative

21
Q

Is DTW scale invariant?

A

No