Keywords list - AWS Certified AI Practitioner Test - Sheet1 Flashcards
(93 cards)
Amazon SageMaker
A fully managed service that data scientists and developers use to prepare, build, train, and deploy ML models.
Amazon Bedrock
A fully managed service that makes FMs from Amazon and leading AI companies available through an API. Amazon Bedrock has a broad set of capabilities to quickly build and scale genAI applications with security, privacy and responsible AI. You can also privately customize FMs with your own data and seamlessly integrate and deploy them into your apps using AWS tools and capabilities.
Bedrock’s RAG Implementation is called “Knowledge Base”.
SageMaker Data Wrangler
Amazon SageMaker Data Wrangler to balance your data in cases of any imbalances. SageMaker Data Wrangler offers three balancing operators: random undersampling, random oversampling, and Synthetic Minority Oversampling Technique (SMOTE) to rebalance data in your unbalanced datasets.Data preparation, transformation and feature engineering tool. Aggregate and prepare data for ML. Used for data selection, cleaning, exploration, visualization and processing. Has SQL Support for data query, and a Data Quality tool to analyze quality of the data.
Amazon SageMaker Model Cards
SageMaker Model Cards are a feature of SageMaker that you can use to record information about ML models. SageMaker Model Cards include information such as training details, risk rating, evaluation metrics, model performance, considerations, and recommendations. Part of the SageMaker Model Registry, Amazon SageMaker Model Cards document critical details about your machine learning (ML) models in a single place for streamlined governance and reporting.
SageMaker Canvas
You can use SageMaker Canvas to build ML models without needing to write any code. SageMaker Canvas does not have any models that can perform content moderation of creative content types.
Amazon SageMaker Ground Truth
SageMaker Ground Truth is a service that uses a human workforce to create accurate labels for data that you can use to train models. SageMaker Ground Truth does not store information about model training and performance for audit purposes. Amazon offers a labeling service, Amazon SageMaker Ground Truth. SageMaker Ground Truth can leverage a crowdsourcing service called Amazon Mechanical Turk that provides access to a large pool of affordable labor spread across the globe.
Amazon SageMaker Model Monitor
SageMaker Model Monitor establishes an automated alert system that alerts when there are variations in the model’s quality, such as data drift and anomalies. You can use SageMaker Model Monitor to monitor deployed models for performance issues, data drift, and operational inconsistencies. You would primarily use SageMaker Model Monitor to ensure that the model’s performance remains stable over time.
SageMaker Studio
SageMaker Studio offers a suite of integrated development environments (IDEs), including JupyterLab, RStudio, and Visual Studio Code - Open Source (Code-OSS).
Guardrails for Amazon Bedrock
Amazon Bedrock Guardrails evaluates user inputs and FM responses based on use case specific policies, and provides an additional layer of safeguards (e.g. block undesirable content, detect prevent hallucinations, redact sensitive/PII, etc.)
Amazon Rekognition
Amazon Rekognition is a fully managed AI service for image and video analysis. You can use Amazon Rekognition to identify inappropriate content in images, including drawings, paintings, and animations. Amazon Rekognition can also help wth performing content moderation of the creative content types. Detect custom objects, such as brand logos, using automated machine learning (AutoML) to train your models with as few as 10 images.
Vector Database
A vector database is a collection of data that is stored as mathematical representations. Vector databases store structured and unstructured data, such as text or images with the vector embeddings. Vector embeddings are a way to convert words and sentences and other data into numbers.
Amazon DocumentDB
Amazon DocumentDB is a fully managed, native JSON document database. Amazon DocumentDB supports vector search. You can use vector search to store, index, and search millions of vectors with millisecond response times.
Amazon OpenSearch Service
OpenSearch Service is a fully managed service that you can use to deploy, scale, and operate OpenSearch on AWS. You can use OpenSearch Service vector database capabilities for many purposes. For example, you can implement semantic search, retrieval augmented generation (RAG) with large language models (LLMs), recommendation engines, and multimedia searches. OpenSearch Service can also scale to store millions of embeddings and can support high query throughput.
Amazon SageMaker Clarify
SageMaker Clarify helps identify potential bias in machine learning models and datasets without the need for extensive coding. SageMaker Clarify is a feature of SageMaker that helps you explain how a model makes predictions and whether datasets or models reflect bias. SageMaker Clarify also includes a library to evaluate FM performance. The foundation model evaluation (FMEval) library includes tools to compare FM quality and responsibility metrics, including bias and toxicity scores. FMEval can use built-in test datasets, or you can provide a test dataset that is specific to your use case.
It can detect biases in training data and model predictions. You can use SageMaker Clarify to provide insights into model decisions. Therefore, SageMaker Clarify is a suitable solution to develop responsible and fair AI systems.
SageMaker JumpStart
Amazon SageMaker JumpStart is a machine learning hub with open-source and proprietary foundation models, built-in algorithms, and prebuilt ML solutions that you can deploy with a few clicks.
SageMaker HyperPod
Reduce time to train foundation models by up to 40% and scale across more than a thousand AI accelerators efficiently. It efficiently distributes and parallelizes your training workload across many accelerators.
SageMaker Model Registry
SageMaker Model Registry is a fully managed catalog for ML models. You can use SageMaker Model Registry to manage model versions, associate metadata with models, and manage model approval status.
SageMaker Model Dashboard
Amazon SageMaker Model Dashboard is a centralized portal, accessible from the SageMaker console, where you can view, search, and explore all of the models in your account
Vector
A vector is an ordered list of numbers that represent features or attributes of some entity or concept.
In the context of generative AI, vectors might represent words, phrases, sentences, or other units.
Embeddings
Embeddings are vector representations of content that captures semantic relationships. Embeddings provide content with similar meanings to have close vector representations.
Amazon Comprehend
Amazon Comprehend uses natural language processing (NLP) to extract insights about the content of documents. It develops insights by recognizing the entities, key phrases, language, sentiments, and other common elements in a document.
Amazon Textract
You can use Amazon Textract to extract text from documents, handwritten text etc