Kinesis Flashcards

This deck aims to help retain concepts related to the Kinesis service.

1
Q

Which AWS serverless streaming data service simplifies capturing, processing, and storing data streams at any scale?

A

Kinesis Data Streams - public, regional, and highly available AWS service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the minimum and maximum retention period for data in Kinesis Data Streams?

A

By default, Kinesis Data Streams retains data for 24 hours, but this can be extended to up to 365 days for an additional cost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When using Kinesis Data Streams, is it possible to configure more than one producer and consumer?

A

Yes, Kinesis Data Streams supports multiple producers for data ingestion and multiple consumers for data reading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which Kinesis Data Streams component is responsible for scaling?

A

Shards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which Kinesis Data Streams components can impact pricing?

A

The number of shards and the configured data retention window (24 hours to 365 days)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Where is data stored in Kinesis Data Streams?

A

Data is stored in Kinesis Data Records, with each record up to 1 MB in size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which AWS service is ideal for large-scale data ingestion by numerous producers, with multiple consumers processing data at varying rates, for analytics and monitoring application clicks?

A

Amazon Kinesis Data Streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which AWS service is used to transfer data from Kinesis Data Streams to other AWS services?

A

Amazon Kinesis Data Firehose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can Kinesis Video Streams performance be improved?

A

By increasing the number of shards, each shard supports up to 1 MB/s for ingestion and 2 MB/s for consumption

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which AWS serverless service is used to capture, transform, and load large volumes of streaming data from hundreds of thousands of sources into AWS services like S3, Redshift, or OpenSearch?

A

Kinesis Data Firehose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the primary purpose of Kinesis Data Firehose?

A

To load data into data lakes, storage solutions, and analytics services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Is Kinesis Data Firehose a real-time service?

A

No, it operates in near-real-time, typically delivering data within ~60 seconds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can Kinesis Data Firehose perform on-the-fly data transformations?

A

Yes, it can use AWS Lambda for transformations, though this may introduce some latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How is billing calculated for Kinesis Data Firehose?

A

It is based on the volume of data processed through the service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are common destinations for Kinesis Data Firehose?

A
  • S3
  • Redshift
  • OpenSearch
  • HTTP Endpoints
  • Datadog
  • Splunk
  • ElasticSearch
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does scaling differ between Kinesis Data Firehose and Kinesis Data Streams?

A
  • Kinesis Data Firehose scales automatically
  • Kinesis Data Streams requires manual scaling through shards
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the primary use cases for Kinesis Data Firehose?

A
  • Persisting data from Kinesis Data Streams
  • Transforming and storing data in different formats
  • Delivering data to supported destinations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do replay capabilities differ between Kinesis Data Firehose and Kinesis Data Streams?

A
  • Kinesis Data Firehose doesn’t support replay
  • Kinesis Data Streams allows data replay
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What distinguishes Kinesis Data Firehose and Kinesis Data Streams in terms of data persistence?

A
  • Kinesis Data Firehose doesn’t support data persistence
  • Kinesis Data Streams can retain data from 24 hours to 365 days
17
Q

How do consumers differ between Kinesis Data Firehose and Kinesis Data Streams?

A
  • Kinesis Data Firehose is closed-ended with a single destination
  • Kinesis Data Streams is open-ended, supporting multiple producers and consumers
18
Q

Can unmodified data be saved when using Lambda for transformation with Kinesis Data Firehose?

A

Yes, unmodified data can optionally be delivered to S3

19
Q

Which service is suitable if Kinesis Data Streams features are unnecessary?

A

Kinesis Data Firehose

20
Q

What AWS service combination supports real-time data transformation?

A

Kinesis Data Streams with Lambda

21
Q

Which AWS service enables real-time analysis of streaming data for actionable insights?

A

Kinesis Data Analytics

22
Which AWS service allows real-time transformations, filtering, and enrichment of streaming data using SQL?
Kinesis Data Analytics
23
Is Kinesis Data Analytics a real-time service?
Yes, it processes data in real-time
24
What are the main use cases for Kinesis Data Analytics?
- Processing streaming data in real-time with SQL queries (e.g., time-series data, dashboards, or security metrics) - Performing complex data manipulations in real-time
25
What data sources are supported by Kinesis Data Analytics?
- Kinesis Data Streams - Kinesis Data Firehose - S3 for static reference data
26
What are the supported destinations for Kinesis Data Analytics?
- Kinesis Data Streams (real-time) - Kinesis Data Firehose (near real-time)
27
How date is retrieved by consumers from Kinesis Video Streams?
Consumers retrieve data frame-by-frame for further analysis
27
What are typical producers for Kinesis Video Streams?
Devices such as security cameras, smartphones, cars, drones, and sources of time-serialized data like thermal, depth, and RADAR streams
27
Which AWS service automatically provisions and scales infrastructure for ingesting live video streams from millions of devices?
Kinesis Video Streams
28
Does Kinesis Video Streams integrate with other AWS services?
Yes, it integrates with a range of AWS services, commonly used with Rekognition for facial recognition and S3 for data storage
29
Can Kinesis Video Streams handle secure data persistence?
Yes, it securely persists data, with encryption both in transit and at rest
30
Can data captured by Kinesis Video Streams be accessed directly from storage?
No, data is indexed and structured, so it can only be accessed via APIs
31
How would you configure Kinesis Video Streams for two IP cameras?
Each camera streams data to its own video stream
32
When dealing with live video streams, Real-Time Streaming Protocol (RTSP), or GStreamer, which AWS service comes to mind?
Kinesis Video Streams
33
Which AWS service supports large-scale data ingestion by multiple producers and consumption by multiple consumers in REAL-TIME?
Kinesis Data Streams
34
Which AWS managed service offers streaming data delivery and transformation capabilities in near-real-time?
Kinesis Data Firehose
35
Which service processes streaming data using SQL in real-time?
Kinesis Data Analytics
36
Can the order of data be guaranteed across multiple shards in AWS Kinesis Data Streams?
No, data order can only be guaranteed within a single shard
37
What is the formula to calculate the initial number of shards for AWS Kinesis Data Streams?
num_of_shards = max(incoming_write_bandwidth_in_KiB / 1024, outgoing_read_bandwidth_in_KiB / 2048)
38
What is the purpose of the partition key in AWS Kinesis Data Streams?
The partition key is used to distribute data records across multiple shards in a stream
39
What stand-alone Java software application offers an easy way to collect and send data to Kinesis Data Streams?
Kinesis Agent
40
How does the Kinesis Client Library (KCL) manage shard processing among multiple EC2 instances?
Each shard is assigned to one worker via a lease, with 10 shards, a maximum of 10 EC2 instances (workers) can be used