MLOps w/ AWS Flashcards by Al Them

What is SageMaker guardrails?

A service that allows you to deploy your SageMaker models with Blue/Green testing to make sure that everything is ok before scaling to full traffic

How well did you know this?

Not at all

Perfectly

What type of deployment testing does Sagemaker guardrails use?

Blue/Green

How well did you know this?

Not at all

Perfectly

What are the different versions of blue/green testing?

All-at-once
Canary
Linearly increasing

How well did you know this?

Not at all

Perfectly

What is shadow testing?

Where you deploy a shadow variant which takes a percentage of traffic. You manually monitor it and decide when to promote it to production

How well did you know this?

Not at all

Perfectly

What are SageMaker production variants? When would you use it?

Allows you to test with different models in production. This is for when testing with old or fake data is not representative, e.g. recommendation algorithms

How well did you know this?

Not at all

Perfectly

How would you get your SageMaker model ready to deploy at the edge?

Use SageMaker Neo - compiles your code for your specified edge device and has a runtime that will run it

How well did you know this?

Not at all

Perfectly

How does SageMaker Neo synergise with AWS IoT Greengrass?

AWS IoT Greengrass is the service that allows you to actually push your compiled SageMaker Neo code to the edge devices

How well did you know this?

Not at all

Perfectly

What tends to be more expensive in raw terms for training - CPU or GPU?

GPU

How well did you know this?

Not at all

Perfectly

What tends to be cheaper - inference or training?

Inference

How well did you know this?

Not at all

Perfectly

What are 2 downsides of training on spot instances to save money?

You need to checkpoint on S3 to save the progress of training the model since it can be interrupted at almost any time. Can also take longer since you have to wait for the resources to become available

How well did you know this?

Not at all

Perfectly

How do you set up auto scaling w/ SM?

The same as w/ EC2. You set important metrics and cooldown periods and then the number of instances scales up to match the specifications at any given time

How well did you know this?

Not at all

Perfectly

Does auto scaling for SageMaker try to balance across AZs automatically for the endpoints?

Yes, but you need more than 1 instance in each endpoint

How well did you know this?

Not at all

Perfectly

When would you use the serverless deployment type in SageMaker?

When there is uneven/unpredictable traffic

How well did you know this?

Not at all

Perfectly

When would you use the real-time Sagemaker deployment type?

For interactive workloads that need low latency

How well did you know this?

Not at all

Perfectly

When would you use SageMaker Jumpstart?

When you can solve your problem with a pre-made model and/or don’t have ML expertise/want the easiest option

How well did you know this?

Not at all

Perfectly

What is SageMaker Inference Recommender?

Study These Flashcards

A service that recommends the best instance type and configuration for your models through automated load testing

What 2 recommendation types can SageMaker Inference Recommender give you?

Study These Flashcards

Instance recommendation (takes about 45m)
Endpoint recommendation (takes 2h-ish)

What is SageMaker Inference Pipelines?

Study These Flashcards

The ability to chain together 2 - 15 containers which each have their own models. This allows you to create a pipeline of processing which goes through different models.

What is SageMaker Model Monitor?

Study These Flashcards

A service that allows you to get alerts on quality deviations on your deployed models and helps you counteract drifts and biases that could occur in your model over time

Do you need a monitoring schedule in order to use SM Model Monitor effectively?

Study These Flashcards

Yes

What data can Model Monitor capture with regards to your endpoint?

Study These Flashcards

The inputs and corresponding inference outputs. The inference data can be encrypted

What is SageMaker Projects?

Study These Flashcards

SageMaker Studio’s native MLOps solution with CI/CD. Uses SageMaker Pipelines in the backend

What can you use to integrate an existing Kubernetes pipeline with SageMaker?

Study These Flashcards

SageMaker Operators for Kubernetes
Components for Kubeflow Pipelines

Per server, as a general rule, can you have more VMs or containers?

Study These Flashcards

Containers

What is AWS Batch?

A serverless service that allows you to run batch jobs as Docker images

What kind of jobs can you do with AWS Batch?

Anything that can be written with a Docker image, not necessarily ETL

What is Github Flow?

A development approach where there are two branches to the code - the main and the feature branches

What is Github Flow particularly useful for?

Environments where you need to be able to release quickly, e.g. even multiple times a day

What is Amazon Managed Workflows for Apache Airflow used for?

To write Python code that can develop, schedule and monitor your batch workflows

Does Amazon Managed Workflows for Apache Airflow have to work within a VPC?

Yes

MLOps w/ AWS Flashcards

(30 cards)