7. Analytics Flashcards
(65 cards)
What does AWS stand for?
Amazon Web Services
What is the primary purpose of AWS Glue?
To prepare and transform data for analytics.
True or False: Amazon Redshift is a data warehouse service.
True
Which AWS service is used for real-time data streaming?
Amazon Kinesis
What type of data model does Amazon DynamoDB use?
NoSQL database model
Fill in the blank: AWS _____ is used for data lake storage.
S3
What is the purpose of Amazon QuickSight?
To create visualizations and business intelligence dashboards.
Which service would you use to perform ETL operations in AWS?
AWS Glue
What is the maximum size of an object that can be stored in Amazon S3?
5 TB per object
True or False: Amazon Athena allows you to run SQL queries on data stored in S3.
True
What is Amazon EMR primarily used for?
Processing large amounts of data using Apache Hadoop and Spark.
Which AWS service provides a managed Apache Kafka service?
Amazon MSK (Managed Streaming for Kafka)
What does the term ‘data lake’ refer to?
A centralized repository that allows you to store all your structured and unstructured data at any scale.
Fill in the blank: AWS _____ is a serverless data integration service.
Glue
What does Amazon Redshift Spectrum allow you to do?
Query data directly in S3 without loading it into Redshift.
Which service provides a fully managed data warehouse solution?
Amazon Redshift
True or False: AWS Data Pipeline is used for data orchestration.
True
What is the primary function of Amazon RDS?
To provide a managed relational database service.
Which service is best suited for storing time-series data?
Amazon Timestream
What is the benefit of using Amazon Aurora?
It offers high performance and availability for relational databases.
Fill in the blank: AWS _____ is used to visualize data and create dashboards.
QuickSight
Which service is designed for batch processing of data?
AWS Batch
What does the term ‘data wrangling’ mean?
The process of cleaning and transforming raw data into a usable format.
Which AWS service allows for serverless data analytics?
Amazon Athena