What is ...? Flashcards
(120 cards)
AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL. AWS Glue generates the code to execute your data transformations and data loading processes.
AWS Fargate
sorta like Elastic Beanstalk for containers, builds containers and deploys them
ECS and its equivalent
highly scalable, high performance container management service that supports Docker containers and allows you to easily run applications on a managed cluster of Amazon EC2 instances. Amazon ECS makes it easy to use containers as a building block for your applications by eliminating the need for you to install, operate, and scale your own cluster management infrastructure. Amazon ECS lets you schedule long-running applications, services, and batch processes using Docker containers. Amazon ECS maintains application availability and allows you to scale your containers up or down to meet your application’s capacity requirements.
EKS and its equivalent
Kubernetes
Athena
an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. serverless. analyse log data in S3.
FSx for Lustre
compute-intensive workloads. doesn’t support the Windows-based applications as well as FSx for Windows file servers. can store data on S3
FSx for Windows File Server
- centralized storage for windows-based applications; SMB, sharepoint, sql server, workspaces, iis webserver, etc
- needs vpn or direct connect
DataSync
provides a fast way to move large amounts of data online between on-premises storage and Amazon S3 or Amazon Elastic File System (Amazon EFS).
on prem will no longer be used.
SQS long polling
Long polling helps reduce your cost of using Amazon SQS by reducing the number of empty responses when there are no messages available to return in reply to a ReceiveMessage request sent to an Amazon SQS queue and eliminating false empty responses when messages are available in the queue but aren’t included in the response.
- Long polling reduces the number of empty responses by allowing Amazon SQS to wait until a message is available in the queue before sending a response. Unless the connection times out, the response to the ReceiveMessage request contains at least one of the available messages, up to the maximum number of messages specified in the ReceiveMessage action.
- Long polling eliminates false empty responses by querying all (rather than a limited number) of the servers. Long polling returns messages as soon any message becomes available.
TLDR; short polling returns a response immediately, long polling doesnt return a response until a message arrives in the message queue, or the long poll times out.
SQS short polling
The ReceiveMessageWaitTimeSeconds is the queue attribute that determines whether you are using Short or Long polling. By default, its value is zero which means it is using Short polling
ParallelCluster
an AWS-supported open-source cluster management tool that makes it easy for you to deploy and manage High Performance Computing (HPC) clusters on AWS. It does not provide higher bandwidth, higher packet per second (PPS) performance, and lower inter-instance latencies, unlike ENA or EFA.
Elastic Fabric Adapter (EFA)
simply an Elastic Network Adapter (ENA) with added capabilities. It provides all of the functionality of an ENA, with additional OS-bypass functionality. OS-bypass is an access model that allows HPC and machine learning applications to communicate directly with the network interface hardware to provide low-latency, reliable transport functionality.
The OS-bypass capabilities of EFAs are not supported on Windows instances. If you attach an EFA to a Windows instance, the instance functions as an Elastic Network Adapter, without the added EFA capabilities.
Elastic Network Adapter (ENA)
supports network speeds from 10Gbps up to 100Gbps for supported instance types. Elastic Network Adapters (ENAs) provide traditional IP networking features that are required to support VPC networking.
step scaling
Increase or decrease the current capacity of the group based on a set of scaling adjustments, known as step adjustments, that vary based on the size of the alarm breach.
cheapest S3 teir
S3 glacier deep archive
glacier deep archive retrieval time
12 hours
s3 encryption in transit
SSL/TLS
S3 Encryption at rest
- s3 managed keys - SSE-S3
- aws key management service, managed keys - SSE-KMS
- server side encryption with customer provided keys - SSE-C
glacier retrieval time
minutes to hours
least durable S3
One zone-IA
Service control policies (SCP)
enable/disable AWS services either on OU or individual accounts
ways to share s3 buckets
-bucket policies & aim (entire bucket)
-bucket acl’s & iam (individual objects)
-cross-account iam roles (console access as well)
all methods programmatic access only
cloudfront origin
origin of all files the CDN will distribute. can be s3, ec2, elb, route53
cloudfront distribution
name given to the cdn which conists of a collection of edge locations