Improving The Model Flashcards

Question 1

Q

What should be the order of things we address in model improvements?

Answer

A

Under fitting
Over fitting
Distribution shift
Re-balance data set

Question 2

Q

Addressing under fitting should be in the fellowing order:

Answer

A

Make your model bigger
Reduce regularization
Error analysis
Choose a different model architecture (closer to SOTA)
Tune hyper-parameters
Add features

Question 3

Q

Addressing overfitting should be in this order:

Answer

A

Add more training data
Add normalization (batch norm, layer norm)
Add data augmentation
Increase regularization (drop out, l2, weight decay)
Error analysis
Choose a different model
Tune hyper-parameters
Early stopping
Remove features
Reduce model size

Question 4

Q

What can you do for Addressing distribution shift? (Validation and test scores not close)

Answer

A

Analyze test-Val set error (actually look at what is being labeled wrong) and:
1 collect more training data to compensate
2 synthesize more training data to compensate
Apply domain adaptation techniques to training and test distributions

Question 5

Q

What is error analysis

Answer

A

Looking at where the train-Val, and the Val-test are off, look at the specific cases. Split them into different groups and prioritize based on what is easier to deal with

Question 6

Q

What is domain adaptation?

Answer

A

Techniques to train on “source” distribution and generalize to another “target” using only unlabeled data or limited labeled data.

Should consider using when access to labeled data from test is limited and the access to relatively similar data is plentiful

Question 7

Q

2 types of domain adaptations:

Answer

A

Supervised, use when there is limited data from target domain, examples: fine tune pre trained model, add target data to train set
Unsupervised, you have lots of unlabeled data from target domain, examples: correlation alignment, domain confusion, cycleGAN

Brainscape's Knowledge GenomeTM

Improving The Model Flashcards

Brainscape's Knowledge Genome^TM