12. Machine Learning Flashcards

(68 cards)

1
Q

What is the primary purpose of AWS Machine Learning services?

A

To enable developers and data scientists to build, train, and deploy machine learning models at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: AWS SageMaker is a fully managed service for machine learning.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Fill in the blank: AWS _____ allows for the creation of Jupyter notebooks for data exploration and model building.

A

SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the function of AWS Glue in data engineering?

A

To prepare and transform data for analytics and machine learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which AWS service provides a data lake solution?

A

AWS Lake Formation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the role of Amazon S3 in machine learning workflows?

A

To store and retrieve large datasets used for training machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

List one use case for Amazon Rekognition.

A

Image and video analysis for facial recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the term ‘hyperparameter tuning’ refer to in machine learning?

A

The process of optimizing the parameters that govern the training process of a model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False: Amazon Comprehend provides natural language processing capabilities.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the purpose of the Amazon SageMaker Model Registry?

A

To store and manage different versions of machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which AWS service is used for deploying machine learning models as APIs?

A

Amazon SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the significance of using Amazon Redshift in data engineering?

A

It is a fully managed data warehouse service that allows for complex queries and analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Multiple Choice: Which of the following is NOT a feature of AWS SageMaker?
A) Built-in algorithms
B) Real-time predictions
C) Manual model deployment
D) Jupyter notebook support

A

C) Manual model deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Fill in the blank: AWS _____ is used for building, training, and deploying machine learning models quickly.

A

SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of data can Amazon Polly convert into speech?

A

Text data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Amazon Forecast is used for time series forecasting.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the main advantage of using Amazon SageMaker Ground Truth?

A

To create labeled datasets efficiently using human labeling and automation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which service helps in detecting anomalies in time series data?

A

Amazon Lookout for Metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the purpose of Amazon Personalize?

A

To provide personalized recommendations to users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the term ‘training data’ refer to?

A

Data used to train a machine learning model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Multiple Choice: Which AWS service is specifically designed for visual search and image recognition?
A) Amazon Rekognition
B) Amazon Polly
C) Amazon Lex
D) AWS Glue

A

A) Amazon Rekognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the role of AWS Lambda in a machine learning pipeline?

A

To run code in response to events without provisioning servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

True or False: Amazon Lex is used for building conversational interfaces.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the purpose of feature engineering?

A

To select, modify, or create features that improve model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Fill in the blank: The _____ is the model's ability to generalize to new, unseen data.
generalization
26
What is a confusion matrix used for?
To evaluate the performance of a classification model.
27
What does the acronym 'ETL' stand for in data engineering?
Extract, Transform, Load
28
Multiple Choice: Which service provides managed Hadoop and Spark? A) Amazon EMR B) Amazon S3 C) Amazon RDS D) AWS Glue
A) Amazon EMR
29
What is the significance of model evaluation?
To assess how well a model performs on unseen data.
30
True or False: AWS Data Pipeline is used for data orchestration.
True
31
What is the purpose of Amazon SageMaker Autopilot?
To automate the process of building machine learning models.
32
What type of model is used for predicting continuous outcomes?
Regression model
33
Fill in the blank: AWS _____ provides tools for natural language understanding.
Comprehend
34
What is the role of Amazon Kinesis in data engineering?
To collect, process, and analyze real-time streaming data.
35
Multiple Choice: Which AWS service is a serverless data warehouse? A) Amazon Redshift B) Amazon Athena C) Amazon RDS D) AWS Glue
B) Amazon Athena
36
What is a ROC curve used for?
To evaluate the trade-off between true positive rate and false positive rate.
37
True or False: Amazon SageMaker can only be used for supervised learning.
False
38
What is the main function of Amazon Translate?
To provide language translation services.
39
Fill in the blank: The _____ is the set of rules that a machine learning model follows to make predictions.
algorithm
40
What is the purpose of the Amazon SageMaker debugger?
To provide insights into the training process of machine learning models.
41
What type of learning is used when the model learns from labeled data?
Supervised learning
42
Multiple Choice: Which of the following is a key benefit of using AWS for machine learning? A) Scalability B) Manual resource management C) High latency D) Limited data access
A) Scalability
43
What is Amazon SageMaker's role in model deployment?
It provides capabilities for deploying models to production environments.
44
True or False: AWS provides tools for both data preparation and model training.
True
45
What does 'overfitting' refer to in machine learning?
When a model learns the training data too well and performs poorly on unseen data.
46
Fill in the blank: The process of splitting data into training and testing sets is known as _____.
data splitting
47
What is the significance of data normalization?
To scale features to a similar range for better model performance.
48
Multiple Choice: Which AWS service is best for real-time analytics? A) Amazon Redshift B) AWS Glue C) Amazon Kinesis D) Amazon S3
C) Amazon Kinesis
49
What is the purpose of Amazon SageMaker Pipelines?
To create, automate, and manage machine learning workflows.
50
True or False: Amazon Lex can be used to create chatbots.
True
51
What is the role of feature selection in machine learning?
To identify the most relevant features for model training.
52
Fill in the blank: The _____ is a measure of how well a model performs on a given dataset.
accuracy
53
What is a decision tree in machine learning?
A model that makes decisions based on a series of questions.
54
What is the main function of Amazon SageMaker Neo?
To optimize machine learning models for deployment on various platforms.
55
Multiple Choice: Which of the following is a type of unsupervised learning? A) Classification B) Clustering C) Regression D) Time series forecasting
B) Clustering
56
What is the purpose of Amazon SageMaker Clarify?
To detect and mitigate bias in machine learning models.
57
True or False: AWS supports only deep learning frameworks for machine learning.
False
58
What is an ensemble model?
A model that combines predictions from multiple models to improve accuracy.
59
Fill in the blank: The process of adjusting model parameters based on training data is known as _____.
training
60
What does 'data drift' refer to?
Changes in data distribution over time that can affect model performance.
61
What is the primary function of AWS Data Wrangler?
To simplify the process of data preparation for machine learning.
62
Multiple Choice: Which of the following AWS services is used for data transformation? A) Amazon SageMaker B) AWS Glue C) Amazon Comprehend D) Amazon Rekognition
B) AWS Glue
63
True or False: Amazon Forecast is based on machine learning algorithms.
True
64
What is the significance of a validation set?
To tune model hyperparameters and avoid overfitting.
65
Fill in the blank: A _____ is a function that maps inputs to outputs in a machine learning model.
predictor
66
What is the purpose of Amazon Lookout for Equipment?
To monitor equipment for anomalies and predict failures.
67
Multiple Choice: Which AWS service is primarily used for building serverless applications? A) AWS Lambda B) Amazon S3 C) Amazon EC2 D) Amazon RDS
A) AWS Lambda
68
What is the main advantage of using a cloud-based machine learning platform?
Flexibility and scalability without the need for physical infrastructure.