Leveraging Analytics Services Flashcards
What is a data warehouse?
Is a data storage solution that aggregates massive amounts of data from disparate sources.
Benefits supports querying, reporting, analytics, and business intelligence.
Not good for transactional systems
What is Amazon Redshift?
Is an AWS warehouse solution
Handles a LOT of data
What are some real world use cases of Amazon Redshift?
When you want to consolidate multiple data sources for reporting
When you want to run a database that doesn’t require real-time transaction processing (insert, update, and delete)
What is Athena?
A query service that allows you to analyze data in S3 using standard SQL.
You pay per query and it is serverless.
What are some real world use cases of Athena?
Helps you query historical data sorted in S3 as if they were relational data using standard SQL
What is Glue?
Prepares your data for analytics.
It is an ETL (extract, transfer, load) service that generates ETL code.
What is Kinesis?
Real time streaming data.
What are some real world use cases of Kinesis?
Helps you analyze logs in near real time for applications monitoring or fraud detection
What is Elastic MapReduce (EMR)?
Helps you process large amounts of data.
Analyze data using Hadoop and work with other frameworks like Apache Spark
What is Data Pipeline?
Helps you move data between compute and storage services running either on AWS or on-premises.
Sends notification on failure
What is QuickSight?
Helps you to visualize your data. Build interactive dashboards that you can embed into your applications.