MLOps basics | FSDL | Priority Flashcards

1
Q

Write out a diagram of ML product engineering with a data flywheel.

mlops data-flywheel

A

(See source material.)

Source: Lecture 1: Course Vision and When to Use ML > But ML-powered products require an outer loop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some of the items on a checklist to assess the feasibility of an ML project?

mlops ml-powered-products

A

(See source material.)

mlops ml-powered-products

Source: Lecture 1: Course Vision and When to Use ML > ML Feasibility Assessment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Draw a diagram of the model-as-service pattern.

mlops ml-powered-products deployment

A

(See source material.)

mlops ml-powered-products deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are a few of the pros and cons of the model-as-service pattern?

mlops ml-powered-products deployment

A

(See source material.)

mlops ml-powered-products deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain the various forms of parallelism in distributed training.

mlops training

A
  1. Trivial parallelism: model and data fit on single gpus.
  2. Data parallelism: model fits on a single gpu but data is spread across gpus; average gradients are computed by the model across gpus
  3. Model parallelism
    a. Sharded data parallelism
    i. DeepSpeed, FairScale, fully-sharded data-parallel (pytorch)
    ii. shards the optimizer states, the gradients, and the model parameters.
    b. Pipelined model parallelism
    i. put each layer of your model on each GPU.
    c. Tensor parallelism
    i. distribute the matrix over multiple GPUs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What takes up gpu memory?

mlops training

A

Model parameters
Gradients
Optimizer states (statistics about gradients)
Batch of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Using a cloud provider, how can you minimize costs?

mlops training

A

Use the most expensive per-hour GPU in the least expensive cloud.
Startups (e.g., Paperspace) tend to be cheaper than major cloud providers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly