AWS Analytics Flashcards

1
Q

What is Amazon Redshift?

A

Redshift is a scalable data warehouse solution.

  • Data warehousing solution
  • Improves speed and efficiency
  • Handles exabyte-scale data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to use Amazon Redshift in the real world?

A
  • When consolidating multiple data sources for reporting
  • When you want to run a database that doesn’t require real-time transaction processing (INSERT,UPDATE and DELETE)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Amazon Athena?

A

Amazon Athena is query service for Amazon S3

  • Query Service
  • Analyze S3 data using SQL
  • Pay per query
  • Considered serverless
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Amazon Glue?

A

Amazon Glue prepares your data for analytics.

  • Extract, Transform, Load (ETL) service
  • Prepeare load data
  • Helps to better understand your data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Amazon Kinesis?

A

Amazon Kinesis allows you to analyze data and video streams in real time.

  • Analyze real-time, streaming data
  • Supports video, audio, application logs, website clickstreams, and IoT

NB: Kinesis helps analyze logs in near real time for application monitoring or fraud detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Elastic MapReduce (EMR)?

A

EMR helps you process large amounts of data.

  • Process big data
  • Analyze data using Hadoop
  • Works with big data frameworks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Data Pipeline?

A

Data Pipeline helps you move data between compute and storage services running either on AWS or on-premises.

  • Moves data at specific intervals
  • Moves data based on conditions
  • Sends notifications on success or failure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly