Revise Flashcards

Question

What is a Foundation Model (FM)?

Answer 1

A general-purpose model trained on vast amounts of data, which can be fine-tuned for specific tasks.

Answer 2

A foundation model that implements the transformer architecture.

Answer 3

Effective at NLP due to multi-head attention and positional encoding.

Answer 4

Converts text into a sequence of tokens that a model can process.

Answer 5

Specialized vectors that represent semantic meaning and relationships.

Answer 6

Retraining a pretrained model's weights on a smaller dataset.

Answer 7

A systematic process used to build, train, and deploy machine learning models.

Answer 8

Identify Business Goal: Define success criteria and align stakeholders.

Answer 9

Frame the ML Problem: Define inputs, outputs, and metrics.

Answer 10

Collect Data: Prepare the necessary data for training the model.

Answer 11

Pre-Process Data: Clean and prepare the data for training.

Answer 12

Engineer Features: Select and engineer features that enhance model performance.

Answer 13

Train, Tune, and Evaluate the Model: Train the model and evaluate performance.

Answer 14

Parameters learned during the model training process, updated iteratively.

Answer 15

Parameters used when deploying a trained model to make predictions.

Answer 16

Parameters are fixed values used to make predictions with a trained model.

Answer 17

Temperature controls the randomness of the generated text.

Answer 18

Top K selects only the top k most likely tokens for output.

Answer 19

Top P uses cumulative probability to choose tokens, focusing on the smallest set of tokens with a combined probability of P.

Answer 20

Use Top-P when you want adaptive diversity but want to stay closer to more likely outcomes.

Answer 21

Use Temperature when you need consistent randomness control across the board.

Answer 22

Beam search explores multiple candidate sequences, keeping track of several promising routes at each intersection.

Answer 23

Greedy search selects the most likely token at each step.

Answer 24

Response length specifies the maximum length of generated output.

Answer 25

Penalties apply to repeated tokens or sequences to encourage variety in the generated text.

Answer 26

Stop sequences define specific sequences where the model will stop generating text.

Answer 27

MSE is the average of squared differences between predictions and actual values. Lower is better.

Answer 28

RMSE is the square root of MSE, providing error measurement in the same units as the original data. Lower is better.

Answer 29

Perplexity measures how well a language model predicts sequences of words/tokens. Lower values indicate better prediction capability.

Answer 30

Precision is the ratio of true positives to all positive predictions. Higher is better.

Answer 31

Recall is the ratio of true positives to all actual positives. Higher is better.

Answer 32

FPR is the ratio of false positives to all actual negatives. Lower is better.

Answer 33

Specificity is the ratio of true negatives to all actual negatives. Higher is better.

Answer 34

Accuracy is the ratio of all correct predictions to total predictions. Higher is better.

Answer 35

F1 Score is the harmonic mean of precision and recall, balancing both metrics. Higher is better.

Answer 36

The ROC Curve plots TPR against FPR at various thresholds. Higher AUC is better.

Answer 37

SageMaker Training Jobs manage training processes, specifying training data, hyperparameters, and compute resources.

Answer 38

SageMaker Experiments track model runs and hyperparameter tuning.

Answer 39

AMT automatically tunes hyperparameters using the specified metric.

Answer 40

Real-Time Inference is for low-latency, sustained traffic predictions with auto-scaling capabilities.

Answer 41

Batch Transform processes large batches of data asynchronously.

Answer 42

Asynchronous Inference handles long-running inference requests with large payloads without immediate responses.

Answer 43

Serverless Inference is for intermittent traffic, where the model scales automatically without infrastructure management.

Answer 44

On-Demand Inference is pay-per-use based on the number of input/output tokens.

Answer 45

Provisioned Throughput provides guaranteed capacity for consistent, high-throughput inference.

Answer 46

Bedrock Agents deploy agents for multi-step workflows, integrating models with tools like Amazon Kendra and AWS Lambda.

Answer 47

AWS API Gateway exposes the model as an API endpoint for integration with applications.

Answer 48

Data Drift occurs when the input data changes, but the relationship between inputs and outputs remains the same.

Answer 49

Concept Drift occurs when the relationship between inputs and outputs changes, meaning the model's learned patterns no longer apply.

Answer 50

SageMaker Model Monitor schedules and monitors data drift, sending results to CloudWatch.

Answer 51

MLOps involves DevOps practices to manage machine learning models throughout their lifecycle.

Answer 52

SageMaker Pipelines automate and manage the ML workflow end-to-end.

Answer 53

AWS CodePipeline automates the build, test, and deploy phases for models.

Answer 54

SageMaker Model Registry manages and tracks model versions and metadata.

Answer 55

Amazon S3 is used to store trained model artifacts after training.

Answer 56

Model Governance ensures transparency, accountability, and regulatory compliance for ML models.

Answer 57

SageMaker Clarify helps identify and mitigate biases in ML models.

Answer 58

SageMaker Model Cards create documentation for trained models, including performance metrics and intended use.

Answer 59

ML Governance from SageMaker provides tools for tighter control and visibility over ML models.

Answer 60

SageMaker ML Lineage Tracking captures the entire workflow, tracking model lineage for reproducibility.

Answer 61

Glue DataBrew simplifies data governance with visual data preparation and quality rules.

Answer 62

AWS Audit Manager automates the auditing of AWS services for continuous compliance.

Answer 63

AWS Artifact provides on-demand access to compliance reports and agreements.

Answer 64

AWS Trusted Advisor provides recommendations for cost and performance improvements.

Answer 65

SageMaker Managed Spot Training reduces training costs by utilizing spare AWS EC2 capacity.

Answer 66

SageMaker Profiler identifies inefficient resource use during model training.

Answer 67

Amazon Inspector automates security assessments of ML applications.

Answer 68

Continual Learning involves continuously retraining models to account for new data and changing conditions.

Answer 69

Continued-Pretraining uses unlabeled data to expand the model's overall knowledge.

Answer 70

Transfer Learning involves fine-tuning an existing model for a new problem.

Answer 71

The Least Privilege Principle ensures IAM roles grant only necessary permissions.

Answer 72

PrivateLink and VPC Endpoints lock down SageMaker to prevent exposure to the internet.

Answer 73

SageMaker encrypts data at rest and in transit using KMS.

Answer 74

IAM Roles and Policies manage secure access to model data and resources.

Answer 75

S3 Block Public Access prevents model data from being exposed.

Answer 76

AWS IAM Identity Center centralizes identity management across AWS accounts.

Answer 77

AWS Config continuously monitors and records configuration changes across AWS resources.

Answer 78

AWS CloudTrail logs API calls and tracks user activity for auditing.

Answer 79

Amazon SageMaker is an integrated machine learning service for building, training, and deploying models.

Answer 80

The typical SageMaker training process involves Data, Instances, Training Images, Configuration, and Home Output Bucket.

Answer 81

Ground Truth is a human-powered data labeling service.

Answer 82

Data Wrangler is a tool for easy data cleaning and transformation.

Answer 83

Feature Store is a central repository for ML features.

Answer 84

SageMaker Studio is a web-based ML IDE.

Answer 85

SageMaker JumpStart provides pre-trained models and solutions.

Answer 86

SageMaker Canvas is a no-code visual model building tool.

Answer 87

SageMaker Autopilot automates model building and tuning.

Answer 88

MLflow is a tool to track and compare experiments.

Answer 89

A2I provides human review for quality assurance.

Answer 90

Amazon Q is a generative AI-powered assistant for tasks like answering questions and generating content.

Answer 91

Amazon Q Business helps with tasks by accessing enterprise data sources.

Answer 92

Amazon Q Developer includes features like code generation and security scanning.

Answer 93

Amazon Q in QuickSight allows natural language querying of business intelligence data.

Answer 94

Amazon Q in Connect improves customer service by automating responses.

Answer 95

Amazon Q in AWS Supply Chain optimizes supply chain management.

Answer 96

Amazon Bedrock is a fully managed, serverless service providing access to high-performing foundation models.

Answer 97

Amazon Q helps improve customer service by answering customer inquiries, automating responses, and managing tickets using natural language AI.

Answer 98

Amazon Q assists in optimizing and automating supply chain management by generating insights from supply chain data, streamlining inventory management, and forecasting demand.

Answer 99

Amazon Bedrock is a fully managed, serverless service that provides access to high-performing foundation models (FMs) from leading AI companies through a single API.

Answer 100

1. Prompt: Specific set of input to guide LLMs to generate an appropriate output or completion 2. Inference parameters: Temperature, Top K, Top P, Response length, Stop sequences.

Answer 101

- Model Catalog: AI model library for browsing and selecting foundation models. - Custom Models: Customize foundation models with your data. - Foundation Model Evaluation: Compare models side-by-side. - Playgrounds: Experiment with deployed models APIs. - Bedrock Knowledge Bases: Fetch data from private sources. - Bedrock Agents: Create agents for complex tasks. - Serverless: Simplifies deployment and scaling. - Security and Privacy Guardrails: Ensure compliance with policies. - PartyRock: Build generative AI apps without coding.

Answer 102

AWS Glue is a fully managed, cloud-optimized ETL (Extract, Transform, Load) service that helps prepare and load data for analytics and AI models.

Answer 103

- AWS Glue ETL Service: Cloud-based ETL service for data preparation. - AWS Glue Data Catalog: Centralized repository for managing ETL jobs. - AWS Glue Databrew: Visual tool for data preparation. - AWS Glue Data Quality: Detects anomalies and recommends data quality rules.

Answer 104

1. Cost: Training vs. using pre-trained models. 2. Latency: Inference times of foundation models. 3. Modalities: Combining multiple models for different input types. 4. Architecture: Model size and complexity alignment. 5. Complexity: More parameters require more resources. 6. Performance and metrics: Select appropriate evaluation metrics. 7. Bias and fairness: Evaluate model outputs for demographics. 8. Availability and Compatibility: Verify model availability in regions. 9. Security and Privacy: Implement data handling procedures. 10. Scalability: Design for varying loads.

Answer 105

RAG combines retrieval systems with generative AI models by retrieving relevant information, augmenting the prompt, and generating a response.

Answer 106

- Accuracy: Responses grounded in specific data. - Freshness: Access to up-to-date information. - Reduced hallucinations: Less likely to generate incorrect information. - Transparency: Citations for information sources. - Cost efficiency: More practical than fine-tuning models.

Answer 107

- Zero-Shot Prompt: No examples provided. - One-Shot Prompt: One example provided. - Few-Shot Prompt: A few examples provided. - Negative Prompting: Standardized format for prompts. - Prompt Template: Removes unwanted aspects in image generation. - Chain-of-Thought Prompting: Encourages step-by-step reasoning. - Prompt Tuning: Adjusting prompts for better performance.

Answer 108

A compressed, continuous numerical representation where high-dimensional data is encoded into lower-dimensional vectors.

Answer 109

The maximum amount of tokens an LLM model can process at once, affecting tasks like long-form text generation.

Answer 110

Hallucinations occur when a model generates incorrect information that sounds plausible but is not factual.

Answer 111

Models that work across multiple data types, embedding text, images, or audio into a shared space for richer outputs.

Answer 112

When an AI model learns new tasks but completely forgets old ones.

Answer 113

The process of providing unlabeled data to pre-train a model on new, domain-specific data.

Answer 114

Assessing performance on benchmark tasks, evaluating fine-tuning for specific applications, testing resilience, analyzing biases, and understanding model interpretability.

Answer 115

Responsible AI is classified by fairness, explainability, robustness, transparency, governance, and privacy/security.

Answer 116

- Least Privilege Principle: Grant only necessary permissions. - PrivateLink and VPC Endpoints: Secure access to resources. - Encryption: Protect data at rest and in transit. - IAM Roles: Manage secure access to model data. - S3 Block Public Access: Prevent exposure of model data.

Revise Flashcards

(140 cards)