Kinesis overview and Streams Flashcards

1
Q

What are the 3 types of Data Collection?

A
  • Real-Time (Immediate Actions)
  • Near Real-Time (Reactive Actions)
  • Batch (Historical Analysis)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are 3 examples of Real-Time data collection?

A
  • Kinesis Data Streams (KDS)
  • Simple Queue Service (SQS)
  • Internet of Things (IoT)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are 2 examples of Near Real-Time Data Collection?

A
  • Kinesis Data Firehose (KDF)

- Database Migration Service (DMS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are 2 examples of Batch Data Collection?

A
  • Snowball

- Data Pipeline

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 3 Kinesis Services?

A
  • Kinesis Streams
  • Kinesis Analytics
  • Kinesis Firehose
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does Kinesis Streams allow you to do?

A

Low latency streaming ingest at scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Kinesis Analytics allow you to do?

A

Perform real-time analytics on streams using SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does Kinesis Firehose allow you to do?

A

Load streams into S3, Redshift, ElasticSearch & Splunk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do Kinesis Streams work?

A
  • Producers update data in shards/partitions

- Consumers read from partitions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the data retention in Kinesis Streams?

A
  • Default is 24 hours

- Can go up to 7 days

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How many applications can consume the same stream?

A

Multiple applications can consume the same stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can an application update data in a Kinesis stream?

A

No, the data is immutable. It is append-only and will remain until the data retention period has been reached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How are records ordered?

A

They are ordered per shard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is produced to a shard?

A

Records and Record Keys are produced to shards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a record made up of?

A

A record is made up of a Data Blob

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Data Blob?

A

It is the data being sent in a stream, serialized as bytes up to 1MB

17
Q

What is a Record Key?

A

It helps to group records in shards.

- Same key = dame shard

18
Q

How do you avoid a “hot partition” problem?

A

Use a highly distributed record key

19
Q

What is a Sequence Number?

A

It is the unique id given to the record by Kinesis

20
Q

How many MBs per second can a producer write per shard?

A

1MB per second

21
Q

How many messages per second can a producer write per shard?

A

1000 messages per second

22
Q

What happens if I go over the 1MB/s limit or the 1000 messages/s limit?

A

I get the “ProvisionedThroughoutException”

23
Q

What is the read data limit per shard for Classic Consumers?

A

2MBs per second across all consumers

24
Q

What is the API call limit per shard for Classic Consumers?

A

5 API calls per second per shard across all consumers

25
Q

What is the read data limit per shard for Enhanced Fan-Out Consumers?

A

2MBs per second, per shard, per Enhanced Consumer

26
Q

What is the API call limit per shard for Enhanced Fan-Out Consumers?

A

It is a push model, no API calls needed

27
Q

What is the default Data Retention for Kinesis Streams?

A

24 hours

28
Q

What can the Data Retention for Kinesis Streams be extended to?

A

7 days