Services and Features that make me say "huh?" Flashcards

catch up on tech I haven't encountered (54 cards)

1
Q

AWS Data Exchange

A

data set catalog.

Customers subscribe to data product, then use API to load to s3 and analyze.

Data providers easily reach customers w/o needing to build/maintain infra for data storage, delivery, billing, and entitling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

AWS Data Pipeline

A

web service to process and move data between AWS compute/storage services and on-prem sources at specified intervals.

Access, transform, process at scale, transfer from S3, RDS, DynamoDB, EMR.

Create complex data processing workloads that are fault tolerant, repeatable, and highly available.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Amazon EMR

A

Big Data platform for processing vast amounts of data using open source tools such as Apache (Spark, Hive, HBase, Flink, Hudi), and Presto.

Automates capacity provisioning and cluster tuning.

< half cost of on-prem, 3x faster than traditional Spark.

Run workloads on EC2, EKS, or on-prem with “Amazon EMR on AWS Outposts”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

AWS Lake Formation

A

Data Lake = centralized, curated, secured repo that stores all your data in original form and prepped for analysis.

Simply define where data resides, define data access and security policies.
LF:
- collects and catalogs from DBs and object storage
- moves it to S3 data lake
- cleans and classifies w/ ML algorithms
- secures access to sensitive data

Users leverage data sets with choice of analytics/ML, such as
- Amazon EMR for Apache Spark
- Amazon Redshift
- Amazon Athena
- SageMaker AI
- Amazon Quicksight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon Managed Streaming for Apache Kafka (Amazon MSK)

A

Apache Kafka: open-source platform for building real-time streaming data pipelines and apps. Kafka clusters are hard to setup, scale, and manage.

MSK: creates HIghly Available cluster for you, replaces unhealthy nodes, encrypts data at rest.

Users use Kafka APIs to populate data lakes, stream changes to/from DBs, power ML/analytics apps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon OpenSearch Service

A

easy to deploy, secure, operate, and scale OpenSearch to search, analyze, and visualize data in real-time.

APIs for log analytics, full-text search, app monitoring, clickstream analytics

Integrates with OpenSearch Dashboard and Logstash for data ingestion / visualization.

Integrates with VPC, KMS , Data Firehose, Lambda, IAM, Cognito, CloudWatch

Also has a Serverless option, allowing you to run petabyte-scale workloads w/o managing/scaling your own clusters.

Also a “vector engine for Amazon OpenSearch Serverless” adds simple, scalable, high-performing vector storage and search for ML-augmented search experiences and gen AI apps. Use cases: image search, doc search, music retrieval, product recs, video search, location-based search, fraud detection, anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon QuickSight

A

fast, cloud-powered BI service.

Create and public interactive dashboard for browsers and mobile devices.

Embed dashboards into applications, for customer self-service analytics.

scales to 10k+ user w/o any software to install, servers to deploy, or infra to manage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amazon Redshift

A

cloud data warehouse (w/ CLEAN data, as opposed to a data lake).

fast, simple cost-effective analysis w/ SQL or BI tools, on TB to PB of structured and semi-structured data.

  • sophisticated query optimization
  • columnar storage on high-performance storage
  • massively parallel query completion

scale from $0.25/hour to $1,000/terrabyte/year (less than 1/10th traditional on-prem solutions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Amazon AppFlow

A

managed integration service, transfers data between SaaS apps eg Salesforce, Zendesk, Slack, ServiceNow and AWS services eg S3, Redshift.

data flows at enterprise scale in frequency of: on a schedule, in response to biz event, or on demand.

does data transformation eg filtering, validation, to ready to use data

encrypts data in motion, and can restrict from public internet w/ PrivateLink

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

AWS AppSync

A

serverless backend for mobile, web, and enterprise apps. Makes it easy to build data driven apps by securely handling all app data mgmt tasks eg online/offline data access, data sync, data manipulation across multiple data sources.

Uses GraphQL api query language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

AWS Batch

A

for devs, scientist, engineers to run 100k+ compute jobs. AWS Batch provisions the optimal qty/type of instances (eg CPU or mem-optimized), based on volume and resource req’s of batch jobs submitted.

plans, schedules, runs batch computing workloads across “full range of AWS compute services and features” eg EC2 and Spot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

AWS Outposts

A

AWS servers in quarter, half-rack, or full-rack units.

two variants:
* VMware Cloud: use same VMware control plane / APIs you use on your infra
* AWS-native: use same APIs / control plane you already use in AWS Cloud

order from AWS Management Console

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

AWS Serverless Application Repository

A

free to use code samples, components, and whole apps. Only pay for AWS resources used by the apps you deploy..

each app is packages with an AWS Serverless Application Model (AWS SAM) template, defining resources used.

public apps include a link to source code

can publish your own for use with team, org, or public

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

VMware Cloud on AWS

A

jointly developed offering, for orgs to migrate and extend on-prem VMware vSphere-based envs to AWS Cloud, running on EC2 bare metal infra.

not available globally, but adds new regions each release.

seamlessly integrated AWS Cloud with VMware tech eg vSphere, vSAN, NSX, vCenter Server

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

AWS Wavelength

A

AWS infra embedded in cell towers so mobile edge computing apps get better latency/bandwidth benefits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Amazon ECS Anywhere

A

ECS on customer managed infra

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Amazon EKS Anywhere

A

EKS on customer managed infra

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Amazon EKS Distro

A

open source k8s distro, so you can run it on your own infra w/o being tied to AWS’ update schedule.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Amazon DocumentDB (with MongoDB compatibility)

A

fast, scalable, highly available fully managed Binary JSON (BSON) doc db service that supports MongoDB workloads.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Amazon Keyspaces (for Apache Cassandra)

A

Apache Cassandra is an (old) open-source, NoSQL db designed to store data for apps that require fast read/write. Can store user profile information for online games, device metadata for IoT apps, or records for events.

AK for AC is a scalable, highly available, managed Apache Cassandra compatible db service.

serverless, automatically scale tables up and down in response to app traffic, serve 1k+ requests / second

originally dev’d by facebook in 2008

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Amazon Neptune

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Amazon Quantum Ledger Database (Amazon QLDB)

23
Q

Amazon QLDB

A

Amazon Quantam Ledger Database

24
Q

AWS Amplify

25
AWS Device Farm
26
Amazon Pinpoint
27
Amazon Comprehend
28
Amazon Forecast
29
Amazon Fraud Detector
30
Amazon Kendra
31
Amazon Lex
32
Amazon SageMaker
33
Amazon Textract
34
AWS Computer Optimizer
35
AWS Control Tower
36
AWS License Manager
37
AWS Proton
38
Amazon Elastic Transcoder
39
Amazon Kinesis Video Streams
40
AWS Application Discovery Service
41
AWS Application Migration Service
42
AWS DataSync
43
AWS Migration Hub
44
AWS Artifact
45
AWS Audit Manager
46
AWS CloudHSM
47
Amazon Detective
48
AWS Firewall Manager
49
AWS Network Firewall
50
Amazon Macie
51
AWS Resource Access Manager (AWS RAM)
52
AWS Secrets Manager
53
Amazon FSx (for all types)
54
AWS Storage Gateway