Kinesis overview and Streams Flashcards Preview

AWS Data Analytics Cert > Kinesis overview and Streams > Flashcards

Flashcards in Kinesis overview and Streams Deck (28)
Loading flashcards...
1
Q

What are the 3 types of Data Collection?

A
  • Real-Time (Immediate Actions)
  • Near Real-Time (Reactive Actions)
  • Batch (Historical Analysis)
2
Q

What are 3 examples of Real-Time data collection?

A
  • Kinesis Data Streams (KDS)
  • Simple Queue Service (SQS)
  • Internet of Things (IoT)
3
Q

What are 2 examples of Near Real-Time Data Collection?

A
  • Kinesis Data Firehose (KDF)

- Database Migration Service (DMS)

4
Q

What are 2 examples of Batch Data Collection?

A
  • Snowball

- Data Pipeline

5
Q

What are the 3 Kinesis Services?

A
  • Kinesis Streams
  • Kinesis Analytics
  • Kinesis Firehose
6
Q

What does Kinesis Streams allow you to do?

A

Low latency streaming ingest at scale

7
Q

What does Kinesis Analytics allow you to do?

A

Perform real-time analytics on streams using SQL

8
Q

What does Kinesis Firehose allow you to do?

A

Load streams into S3, Redshift, ElasticSearch & Splunk

9
Q

How do Kinesis Streams work?

A
  • Producers update data in shards/partitions

- Consumers read from partitions

10
Q

What is the data retention in Kinesis Streams?

A
  • Default is 24 hours

- Can go up to 7 days

11
Q

How many applications can consume the same stream?

A

Multiple applications can consume the same stream

12
Q

Can an application update data in a Kinesis stream?

A

No, the data is immutable. It is append-only and will remain until the data retention period has been reached

13
Q

How are records ordered?

A

They are ordered per shard

14
Q

What is produced to a shard?

A

Records and Record Keys are produced to shards

15
Q

What is a record made up of?

A

A record is made up of a Data Blob

16
Q

What is a Data Blob?

A

It is the data being sent in a stream, serialized as bytes up to 1MB

17
Q

What is a Record Key?

A

It helps to group records in shards.

- Same key = dame shard

18
Q

How do you avoid a “hot partition” problem?

A

Use a highly distributed record key

19
Q

What is a Sequence Number?

A

It is the unique id given to the record by Kinesis

20
Q

How many MBs per second can a producer write per shard?

A

1MB per second

21
Q

How many messages per second can a producer write per shard?

A

1000 messages per second

22
Q

What happens if I go over the 1MB/s limit or the 1000 messages/s limit?

A

I get the “ProvisionedThroughoutException”

23
Q

What is the read data limit per shard for Classic Consumers?

A

2MBs per second across all consumers

24
Q

What is the API call limit per shard for Classic Consumers?

A

5 API calls per second per shard across all consumers

25
Q

What is the read data limit per shard for Enhanced Fan-Out Consumers?

A

2MBs per second, per shard, per Enhanced Consumer

26
Q

What is the API call limit per shard for Enhanced Fan-Out Consumers?

A

It is a push model, no API calls needed

27
Q

What is the default Data Retention for Kinesis Streams?

A

24 hours

28
Q

What can the Data Retention for Kinesis Streams be extended to?

A

7 days