14. Miscellaneous Flashcards
(78 cards)
What is the primary purpose of AWS Glue?
AWS Glue is a fully managed ETL (Extract, Transform, Load) service that prepares data for analytics.
True or False: Amazon Redshift is a fully managed data warehouse service.
True
Fill in the blank: AWS __________ allows for serverless data integration.
Glue
What does the acronym ETL stand for?
Extract, Transform, Load
Which AWS service is primarily used for real-time data streaming?
Amazon Kinesis
What is the maximum number of nodes in an Amazon Redshift cluster?
128
Multiple choice: Which service can be used to automate the extraction of data from multiple sources? A) AWS Lambda B) AWS Data Pipeline C) Amazon CloudWatch
B) AWS Data Pipeline
True or False: Amazon S3 is an ideal storage solution for big data analytics.
True
What does AWS Lake Formation help to create?
A secure data lake
Fill in the blank: __________ is a managed service for stream processing in AWS.
Amazon Kinesis
What is the main benefit of using Amazon EMR?
It allows for processing vast amounts of data quickly using frameworks like Apache Hadoop and Apache Spark.
Multiple choice: Which of the following is not a data lake storage option in AWS? A) Amazon S3 B) Amazon RDS C) AWS Lake Formation
B) Amazon RDS
True or False: AWS Data Pipeline can be used to schedule data workflows.
True
What is the purpose of AWS DataBrew?
AWS DataBrew is a visual data preparation tool that helps users clean and normalize data.
What does Amazon Athena allow you to do?
Run SQL queries on data stored in Amazon S3 without needing to set up a data warehouse.
Fill in the blank: AWS __________ provides a way to run machine learning models in the cloud.
SageMaker
What is the primary function of AWS Step Functions?
To coordinate multiple AWS services into serverless workflows.
Multiple choice: Which service is best for batch processing of large data sets? A) Amazon Kinesis B) AWS Lambda C) Amazon EMR
C) Amazon EMR
True or False: Amazon QuickSight is used for data visualization.
True
What is the function of AWS Glue Data Catalog?
It acts as a central repository for storing metadata about data assets.
Fill in the blank: __________ is an AWS service used for data warehousing.
Amazon Redshift
What type of database is Amazon DynamoDB?
A fully managed NoSQL database.
Multiple choice: Which service would you use to create a data pipeline? A) AWS Lambda B) AWS Glue C) Amazon RDS
B) AWS Glue
True or False: Amazon S3 supports versioning of objects.
True