Decoupling Workflows Flashcards

Question

What Is Glue?

Answer 1

- Glue is a **serverless data integration** service that makes it easy to discover, prepare, and combine data. It allows you to perform **ETL(Extract, Transform, Load)** workloads **without managing underlying servers**. - It is also a data **catalog service (Data Catalog)** across all the data stored within an organization. Data source: Stores(S3, RDS, JDBC compatible, DynamoDB), Streams(Kinesis Data Streams, Apach Kafka) Data targets: S3, RDS, JDBC

Answer 2

Amazon QuickSight is a fully managed business intelligence (BI) data visualization service. It allows you to easily create dashboards and share them within your company.

Answer 3

AWS Data Pipeline is a managed **Extract, Transform, Load (ETL)** service for automating movement and transformation of your data. - **It uses servers**. It creates EMR clusters to perform the tasks.

Answer 4

Amazon MSK stands for Amazon Managed Streaming for Apache Kafka - Cluster type within Amazon MSK offering serverless cluster management. Automatic provisioning and scaling. - MSK Serverless is fully compatible with Apache Kafka. Use the same client apps for producing and consuming data. - Allows developers to easily stream data to and from Apache Kafka clusters.

Answer 5

OpenSearch is a managed service that allows you to run search and analytics engines for various use cases. It is the successor to AmazonElasticsearch Service. - Amazon OpenSearch Service ❤ Logs - It is used as a managed analytics and visualization service.

Answer 6

- **Pass**: This state can be used to pass data from one state to another. For example, you could use a pass state to pass the output of a task state to a choice state. - **Task**: This state can be used to perform any task that can be implemented as a Lambda function. For example, you could use a task state to send an email, start a workflow in another service, or perform a database operation. - **Choice**: This state can be used to make a decision based on the input. For example, you could use a choice state to decide whether to send an email or start a workflow based on the value of a variable. - **Wait**: This state can be used to delay the execution of a state machine for a specified amount of time. For example, you could use a wait state to delay the execution of a task state until a certain time of day. - **Succeed**: This state is used to terminate a state machine with a success. For example, you could use a succeed state to terminate a state machine after a task state has completed successfully. - **Fail**: This state is used to terminate a state machine with a failure. For example, you could use a fail state to terminate a state machine if a task state fails. - **Parallel**: This state is used to execute multiple states in parallel. For example, you could use a parallel state to execute two task states simultaneously. - **Map**: This state is used to execute a state multiple times with different inputs. For example, you could use a map state to send an email to each member of a list.

Answer 7

- **400**: Bad request- Generic - **403**: Access denied - Authorized denies.. WAF Filtered - **429**: API Gateway can throttle - this means you have exceed that amount - **502**: Bad Gateway Exception - bad output returned by Lambda - **503**: Service unvailable - backing endpoint offline? Major service issue - **504**: Integration failure/timeout - 29s limit

Answer 8

Visibility timeout is the amount of time that SQS prevents other consumers from receiving and processing a message. This is to ensure that the message is only processed once. The default visibility timeout is 30 seconds, but it can be configured up to 12 hours.

Answer 9

Because it allows multiple subscribers to receive a copy of every message published to a topic. SQS queues can subscribe to SNS topics and receive a copy of every message published to that topic. This is not possible with only SQS because SQS queues can only receive messages from one producer group.

Answer 10

High throughput for FIFO queues is a feature that allows you to send and receive messages at a higher rate than standard FIFO queues. With high throughput for FIFO queues, you can send up to 3000 messages per second per API action(like standard SQS). FIFO queues have a limit of 300 messages per second per API action.

Answer 11

- Delay queues provide an **initial period of invisibility** for messages. Predefine periods can ensure that the processing of messages doesn't begin until this period has expired. - Message timers allow a **per message invisibility** overriding any default setting. Not supported on FIFO Queues. Min=0, Max=15min.

Answer 12

- Is a fully managed service that makes it easy to process and analyze streaming data that needs **real-time** SQL-based processing. - Can take source/destination streams from Amazon Kinesis Streams and Amazon Kinesis Data Firehose, and also reference Amazon S3 for static data. - When using Amazon Kinesis Data Firehose becomes near real-time.

Answer 13

Amazon Kinesis Video Streams makes it easy to securely stream video (and timed series like audio, radar, thermal, etc)from connected devices to AWS for analytics, machine learning (ML), playback, and other processing. Kinesis Video Streams automatically provisions and elastically scales all the infrastructure needed to ingest streaming video data from millions of devices. - Data cannot be accessed through a storage, only through the API.

Decoupling Workflows Flashcards

(37 cards)