MLOps basics | FSDL | Priority Flashcards

Question 1

Q

Write out a diagram of ML product engineering with a data flywheel.

mlops data-flywheel

Answer

A

(See source material.)

Source: Lecture 1: Course Vision and When to Use ML > But ML-powered products require an outer loop

Question 2

Q

What are some of the items on a checklist to assess the feasibility of an ML project?

mlops ml-powered-products

Answer

A

(See source material.)

mlops ml-powered-products

Question 3

Q

Draw a diagram of the model-as-service pattern.

mlops ml-powered-products deployment

Answer

A

(See source material.)

mlops ml-powered-products deployment

Question 4

Q

What are a few of the pros and cons of the model-as-service pattern?

mlops ml-powered-products deployment

Answer

A

(See source material.)

mlops ml-powered-products deployment

Question 5

Q

Explain the various forms of parallelism in distributed training.

mlops training

Answer

A

Trivial parallelism: model and data fit on single gpus.
Data parallelism: model fits on a single gpu but data is spread across gpus; average gradients are computed by the model across gpus
Model parallelism
a. Sharded data parallelism
i. DeepSpeed, FairScale, fully-sharded data-parallel (pytorch)
ii. shards the optimizer states, the gradients, and the model parameters.
b. Pipelined model parallelism
i. put each layer of your model on each GPU.
c. Tensor parallelism
i. distribute the matrix over multiple GPUs