Questions #2 Flashcards

1
Q

True or false : Putting too many parameters into a model results in overfitting the model

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or false : The best model should provide the most information

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Can you tell me the definition of Information

A

The information of an outcome is defined as the decrease in uncertainty from observing the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 3 properties for a measure of uncertainty?

A
  1. Continuity : Should be a continuous function of the parameters of the distribution
  2. Additivity
  3. Monotonicity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the unique measure of uncertainty that satisfies the 3 properties ?

A

Information entropy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the definition of cross-entropy?

A

Cross-entropy is a measure of uncertainty of using a different distribution with event probabilities q, to estimate a distribution with the same events with probabilities p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or false : Cross-entropy is symmetric

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false : Using a low-entropy distribution to predict a high entropy distribution is worst than the opposite

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or false : The Kullback-Leibler Divergence grows as the esitmate moves away from the true distribution

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

True or false : The Kullback-Leibler Divergence is symmetric

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define the Deviance formula

A

-2 times the loglikelihood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or false : The lower the deviance, the better

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Tell me the steps to calculate LPPD

A
  1. At each point, take the average of the sample
  2. Log the average
  3. Sum all the logs over all points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or false : Deviance is measure of predictive accurary, not of truth

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

True or false : When doing cross-validation, to truly use deviance or lppd as measure of accuracy, it should be calculated on the test data

A

True, because the deviance will be lower on the training data when we add parameters, even if they are not relevant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Complete : One way to avoid overfitting is to use ..

A

regularizing prior

17
Q

What is the regularizing prior?

A

A regularizing prior is one that contains information. The more informationit has, the stronger the regularization

18
Q

True or false : A regularizing prior is skeptical of the information; the stronger the regularization, the more data is needed to overwhelm it

19
Q

True or false : LOOCV does not require lots of computer runs

A

False. It requires a lot of computer runs

20
Q

Can you tell me an alternative to LOOCV

A

Pareto-Smoothed importance sampling cross-validation (PSIS)

21
Q

Can you tell me an alternative to LOOCV which is not PSIS

A

Information criteria

22
Q

Define the formula of the AIC with the deviance

A

Use the D of the training data

23
Q

When we are calculating AIC, we are using the deviance of the training data. AIC can be calculated with the deviance of the test if : (3)

A
  1. The prior is flat
  2. The posterior is approximately multivariate normal
  3. The size of the same is much greater than the number of parameters
24
Q

True or false : WAIC and PSIS are similar for ordinary linear models

25
Cross-validation and PSIS have higher variance as estimators of divergence, but WAIC has higher bias
True
26
Large differences between WAIC and PSIS imply one of them is unreliable
True
27
WAIC identifies highly influential observations, unlike PSIS
False, c'est le contraire
28
True or false : When doing the information criteria, it is important that each model has the same number of observations
True
29
True or false : Normal distribution has a heavier tail than the student distribution
False. Student has a heavier tail