Section 22: Data & Analytics Flashcards
(68 cards)
What is AWS Athena?
AWS Athena is an interactive query service that allows you to analyze data directly in Amazon S3 using standard SQL.
True or False: AWS Athena is a serverless service.
True.
Fill in the blank: AWS Athena charges you based on the amount of ______ processed by your queries.
data
What types of data formats does AWS Athena support?
Athena supports formats such as CSV, JSON, ORC, Parquet, and Avro.
How does AWS Athena handle security?
Athena uses AWS Identity and Access Management (IAM) for access control and integrates with AWS Key Management Service (KMS) for data encryption.
What is the maximum query result size in AWS Athena?
The maximum query result size in AWS Athena is 30 MB.
Which AWS service is often used in conjunction with Athena for data cataloging?
AWS Glue.
True or False: You need to provision servers to use AWS Athena.
False.
What is the primary use case for AWS Athena?
Athena is primarily used for querying large datasets stored in Amazon S3 without the need for data loading.
What SQL dialect does AWS Athena use?
Athena uses Presto SQL.
Can AWS Athena query data stored in formats like Parquet and ORC?
Yes, Athena can query data stored in Parquet and ORC formats.
What are the two main components of AWS Athena?
The two main components are the query engine and the data catalog.
Fill in the blank: AWS Athena can be accessed via the ______ console, the AWS CLI, and the AWS SDKs.
AWS Management
How does AWS Athena integrate with Amazon QuickSight?
Athena can be used as a data source for Amazon QuickSight to visualize data.
What is the role of AWS Glue Data Catalog in relation to AWS Athena?
AWS Glue Data Catalog serves as a central repository to store metadata for the data queried in Athena.
Can you use AWS Athena to join data from multiple S3 buckets?
Yes, you can join data from multiple S3 buckets in AWS Athena.
True or False: AWS Athena supports partitioned tables.
True.
What is the benefit of using partitioning in AWS Athena?
Partitioning improves query performance by reducing the amount of data scanned.
What is the default retention period for query results in AWS Athena?
The default retention period for query results in AWS Athena is 45 days.
Which AWS service can be used to schedule queries in AWS Athena?
AWS Lambda can be used to schedule queries in AWS Athena.
Fill in the blank: AWS Athena can be used to analyze data in _____ time.
real
What type of queries can be run on AWS Athena?
You can run ad-hoc queries, complex queries, and analytical queries on AWS Athena.
True or False: AWS Athena allows you to create views.
True.
In Athena what type of data can save costs and improve performance
columnar