Terms Flashcards by Daniel Comer

Machine Learning (ML)

Enabling computers to learn on their own using data.

How well did you know this?

Not at all

Perfectly

Artificial Intelligence (AI)

An overarching term for enabling computers to mimic human intelligence.

How well did you know this?

Not at all

Perfectly

Deep Learning (DL)

Using neural networks and deep layers to learn without intervention.

How well did you know this?

Not at all

Perfectly

Generative AI (Gen-AI)

Generating new content to expand on the input data (Training data).

How well did you know this?

Not at all

Perfectly

Supervised Learning

The Model is trained on labeled data.

Use cases:
Image classification
spam detection

How well did you know this?

Not at all

Perfectly

Unsupervised Learning

The Model is trained on unlabeled data, tries to find patterns on its own.

Use cases:
Customer segmentation
Anomaly detection

How well did you know this?

Not at all

Perfectly

Self-Supervised Learning

The model is trained on unlabeled data, it creates its own labels to predict and infer missing information.

Use cases:
Natural language processing (NLP) models

How well did you know this?

Not at all

Perfectly

Semi-Supervised Learning

The model is trained on a mix of data. From a small amount of partially labeled and a large amount of unlabeled data.

Use Cases:
Speech recognition with some transcribed audio

How well did you know this?

Not at all

Perfectly

Reinforcement learning

The model learns through interacting with the environment. Receiving feedback in the form of rewards or penalties.

Use Cases:
Recommendation systems
Self-driving cars

How well did you know this?

Not at all

Perfectly

Model Fit Patterns:
Underfitting & Overfitting

Underfitting - A model doesn’t learn enough from the training data, so it performs poorly on the training set.

Overfitting - A model learns too much detail from the training data, but performs poorly on new, unseen data.

Balanced - The model learns the right amount from the training data. It can generalize and perform well on new data.

How well did you know this?

Not at all

Perfectly

Foundation Models

Foundation models are large models that can be adapted to perform a variety of tasks across various types of data.

How well did you know this?

Not at all

Perfectly

Fine-tuning a model

You can adapt foundation models through further training using smaller task-specific datasets via fine-tuning.

How well did you know this?

Not at all

Perfectly

Large language models (LLMs)

LLMs are a subset of foundation models that can understand and generate human language.

How well did you know this?

Not at all

Perfectly

Machine Learning Process Stages

Generating data
Training the model
Deploying the model

How well did you know this?

Not at all

Perfectly

Exploratory Data Analysis (EDA)

The process of examining and understanding a dataset before diving into modeling.

How well did you know this?

Not at all

Perfectly

Correlation Matrix

Study These Flashcards

A correlation matrix allows you to quantify relationships between variables.

Feature Engineering

Study These Flashcards

The process of transforming the raw data into meaningful features.

Hyperparameters

Study These Flashcards

Definition: Settings that you choose before training a model.

Purpose: Control the learning process and model behavior.

Examples: Learning rate, batch size, number of epochs.

Parameters

Study These Flashcards

Definition: Values that the model learns during training.

Purpose: Directly impact model predictions.

Examples: Weights and biases in a neural network.

Amazon Rekognition

Study These Flashcards

Analyses images and videos to identify text, objects and people.

Can be enhanced by using Content Moderation and Content Labels.

Use cases:
Content moderation to identify harmful or offensive images
Identify verification
Identify objects and text in images

Textract

Study These Flashcards

Uses machine learning as to extract information from any kind of document

Can handle printed or handwritten using Optical Character Recognition (OCR)

Use cases:
Automated ID Processing
Analyzing Invoices

Translate

Study These Flashcards

Uses machine learning as a language translation service

Quickly translates large volumes of HTML or text content

Supports 70+ languages

Can customize it to recognize your own brand names, products, and terminology

Amazon Polly

Study These Flashcards

Generates realistic, natural sounding speech, from text that you provide.

You can provide the text in a variety of languages
The resulting audio can be streamed, saved, or downloaded
Can be used to add natural sounding speech to your applications

Supports a variety of languages and voices

Transcribe

Study These Flashcards

Uses machine learning as a speech to text service

Can handle streamed audio or audio files

use case:
subtitles or meeting notes

Amazon Lex

Lex allows you to build conversational interfaces in your applications using natural language models. Chatbot Seamlessly integrates with AWS Lambda for executing logic Multi-Platform Compatibility, works with mobile devices, web applications, and chat services like Facebook Messenger Speech or Text input Natural language, understanding, understands user intent to deliver a natural conversational experience. Use cases: Virtual Agent and voice assistants Automate FAQs

Amazon Forecast

Allows you to generate accurate forecasts by identifying historical patterns.

Amazon Kendra

An intelligent search service uses natural language processing to query your data customized search to help find answers such as from customer queries data sources can be many things such as S3, RDS, SQL Server, Websites, Google Drive, GitHub, etc Data types can be unstructured and semi-structured and file types such as HTML, XML, PDF, Microsoft Office. Examples: (Simple Fact-based questions) When is the deadline for completing the compliance training? (Descriptive questions) How do I register for the AWS Certified Cloud Practitioner exam?

Amazon Personalize

Allows you to create personalized recommendations for users by analyzing user behaviors, preferences, and trends.w

Amazon SageMaker

Fully managed Machine Learning Platform Imports your data Helps you prepare your data Build your models or use built in ones Train your model Deploy your model use cases: Recommendation engine identify fraudulent transactions Predict Insurance claims Virtual customer service assistant

SageMaker Data Wrangler

Allows you to clean, prepare, or transform your data for use with SageMaker. A tool that simplifies data preparation and cleaning for ML tasks.

SageMaker Feature Store

A centralized repository for storing and managing machine learning features.

SageMaker Studio

An integrated development environment for machine learning, making it easier to manage the entire ML workflow.

SageMaker Pipelines

Automates and orchestrates ML workflows within the AWS SageMaker ecosystem.

SageMaker Auto ML

A tool that automates the process of model selection and hyperparameter tuning.

SageMaker Deployments - Real-time (Synchronous Inference)

Latency: Low (sub-second to few seconds) Use Case: Chat bots

SageMaker Deployments - Asynchronous Inference

Latency: Moderate to High Use Case: Image and video analysis

SageMaker Deployments - Batch Transform (Batch Inference)

Latency: High (minutes to hours) Use Case: Processing large datasets

SageMaker Deployments - Serverless

Latency: Low to Moderate Use Case: Scalable applications with variable workloads

Terms Flashcards

(39 cards)