Safety and Reliability Flashcards

1
Q

Erroneous Behaviour of a Classifier

A

Given a trained classifier f : R_n -> R_k (from n features to k values) and a target function h : R_n -> R_k, an erroneous behavior of the classifier f is demonstrated by a legitimate input x which exists in R_n such that
arg max_j f(x) != arg max_j h(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Loss function

A

L(y, f(x)) ; loss between prediction f(x) and ground truth y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Empirical Loss

A

Average loss over a set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Expected Loss

A

The estimated loss (loss of accuracy) before being tested.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Generalisation Loss

A

Empirical loss - Expected loss. Too big of this value is a result of overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Overfitting

A

A machine learning model is overfitted if it performs well on training data but not on test data samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Adversarial Examples

A

Represent erroneous behaviours which introduce safety implications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Measurements of adversarial examples

A
  • Magnitude of perturbation -> ||x-x’||
  • Probability gap between and after the perturbation -> |f_y(x) - f_y(x’)|
    (With f(x) being the regular example and f(x’) being the adjusted example.)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data Poisoning

A

The injection of malicious data into a training process, making the algorithm perform something it should not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Model Stealing

A

Given model f, a model stealing agent reconstructs another model f’ (etc querying model f).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Membership Inference

A

Identifies the training data via shadow models for the training model by observing the models behaviour and the outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly