Practice Questions - exam-ai-900 Flashcards
(246 cards)
Match the services to the appropriate descriptions.
To answer, drag the appropriate service from the column on the left to its description on the right. Each service may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
- Uses natural language to query a knowledge base = Language Service
- Transcribes spoken audio into text = Speech
In a machine learning model, the data that is used as inputs are called ________.
Select the answer that correctly completes the sentence.
A.
dataset
B.
labels
C.
variables
C. variables
DISCUSSION:
The question asks about the specific term for the input data in a machine learning model. While a dataset contains the data, it’s not the term for the inputs themselves. Labels are the target values the model tries to predict, not the inputs. The correct term for the input data is “variables” or “features,” which represent the characteristics used to make predictions.
Therefore, option C is correct.
Options A and B are incorrect because they refer to the entire collection of data (dataset) and the target values (labels) respectively, not the input features.
For each of the following statements, select Yes if the statement is true. Otherwise, select No.
NOTE: Each correct selection is worth one point.
- Object detection can identify the location of a damaged product in an image.
- Object detection can identify multiple instances of a damaged product in an image.
- Object detection can identify multiple types of damaged products in an image.
Yes
Yes
Yes
You have a natural language processing (NLP) model that was created by using data obtained without permission.
Which Microsoft principle for responsible AI does this breach?
A.
reliability and safety
B.
privacy and security
C.
inclusiveness
D.
transparency
B. privacy and security
The act of using data without permission directly violates the privacy and security principle of Responsible AI. This principle emphasizes the importance of respecting and protecting individuals’ privacy rights and securing their data.
Options A, C, and D are incorrect because while reliability and safety, inclusiveness, and transparency are all important principles of Responsible AI, the scenario specifically describes a violation of data privacy through the unauthorized collection and use of data.
To complete the sentence, select the appropriate option in the answer area.
DISCUSSION:
The question describes an AI solution that provides feedback on exposure, noise, and occlusion to help photographers take better pictures. This process involves examining the photo’s qualities and characteristics. Therefore, “Analysis” is the most appropriate term. “Detection” would primarily involve identifying faces within an image, not necessarily assessing the qualities mentioned.
You have a website that includes customer reviews.
You need to store the reviews in English and present the reviews to users in their respective language by recognizing each user’s geographical location.
Which type of natural language processing workload should you use?
A. key phrase extraction
B. speech recognition
C. language modeling
D. translation
D. translation
The problem describes a scenario where text needs to be converted from one language (English) to another language based on the user’s location. This is the core function of translation.
Option A is incorrect because key phrase extraction identifies important phrases within the text but does not translate the text.
Option B is incorrect because speech recognition converts spoken language into text.
Option C is incorrect because language modeling predicts the probability of a sequence of words, which is useful in generating text but not in translating it.
You need to make the written press releases of your company available in a range of languages.
Which service should you use?
A. Speech
B. Language
C. Translator
D. Personalizer
C. The Translator service is designed for translating text into different languages. The other services are not directly related to language translation. Speech is for converting text to speech, Language is a broader term and not a specific service, and Personalizer is for tailoring content to individual users.
During the process of Machine Learning, when should you review evaluation metrics?
A.
Before you train a model.
B.
After you clean the data.
C.
Before you choose the type of model.
D.
After you test a model on the validation data.
D. After you test a model on the validation data.
DISCUSSION:
The correct answer is D. Evaluation metrics are used to assess the performance of a trained model. This assessment can only be done after the model has been trained and tested on a validation dataset. Options A, B, and C are incorrect because evaluation metrics are not relevant at those stages of the machine learning process.
Select the answer that correctly completes the sentence.
Image
DISCUSSION:
The question asks to select the appropriate model to predict “whether” a loan application should be approved. The presence of “whether” strongly suggests a binary (yes/no) outcome. Classification models are used for predicting categories or classes, which aligns perfectly with the yes/no decision of loan approval. Regression models are used to predict continuous numeric values. Clustering is used to group data points, and reinforcement learning is a type of training based on rewards. Therefore, classification is the appropriate choice here.
For each of the following statements, select Yes if the statement is true. Otherwise, select No.
NOTE: Each correct selection is worth one point.
YNN
Explanation:
- Statement 1: Yes. A bot that responds to queries by internal users can leverage natural language processing to understand and interpret the users’ questions, providing more accurate and relevant responses. The Language service’s custom question answering feature enables you to define and publish a knowledge base of questions and answers with support for natural language querying.
- Statement 2: No. A mobile application that displays images based on a search term typically uses image retrieval and indexing techniques, rather than NLP. The search term itself does not require NLP for the images to be returned. Object recognition is more closely related.
- Statement 3: No. A web form used to submit a request to reset a password does not involve natural language processing. It relies on structured data input and predefined workflows.
You have a solution that analyzes social media posts to extract the mentions of city names and the city names discussed most frequently.
Which type of natural language processing (NLP) workload does the solution use?
A.
speech recognition
B.
sentiment analysis
C.
key phrase extraction
D.
entity recognition
D. entity recognition
DISCUSSION:
The correct answer is D, entity recognition. Entity recognition (also known as Named Entity Recognition or NER) is designed to identify and categorize entities in unstructured text. Cities are considered entities of type “location”.
A. Speech recognition converts spoken language into text, which is not relevant to this scenario.
B. Sentiment analysis identifies the sentiment (positive, negative, neutral) of a text, but not the entities within it.
C. Key phrase extraction identifies the most important words or phrases, but not necessarily specific entities like city names.
Select the answer that correctly completes the sentence.
DISCUSSION:
The question refers to RFM (Recency, Frequency, Monetary) analysis, a method used for customer segmentation based on transaction history. The goal is to group customers with similar behavior. This grouping process aligns with the definition of “Clustering,” which is an unsupervised machine learning technique used to group similar data points together. Therefore, “Clustering” is the correct answer.
For each of the following statements, select Yes if the statement is true. Otherwise, select No.
NOTE: Each correct selection is worth one point.
Yes
Yes
Yes
Explanation:
- A webchat bot can interact with users visiting a website: This is True. Webchat bots are designed to communicate with users on websites, often providing support or answering questions.
- Automatically generating captions for pre-recorded videos is an example of natural language processing: This is True. Generating captions involves converting spoken language to text (speech-to-text), which is a task within natural language processing (NLP).
- A smart device in the home that responds to questions such as “What will the weather be like today?” is an example of natural language processing: This is True. The device must understand and interpret spoken language, which is a core function of NLP.
Your company manufactures widgets.
You have 1,000 digital photos of the widgets.
You need to identify the location of the widgets within the photos.
What should you use?
A. Computer Vision Spatial Analysis
B. Custom Vision object detection
C. Computer Vision Image Analysis
D. Custom Vision classification
B. Custom Vision object detection
DISCUSSION:
The correct answer is B. Custom Vision object detection allows you to train a model to specifically identify and locate objects (in this case, widgets) within images by drawing bounding boxes around them.
Option A is incorrect because Computer Vision Spatial Analysis focuses on analyzing the spatial relationships between objects, not necessarily identifying their location with high precision.
Option C is incorrect because Computer Vision Image Analysis is a broad term and might not offer the specific object detection capabilities required.
Option D is incorrect because Custom Vision classification is used for categorizing images, not for locating specific objects within them.
You need to convert handwritten notes into digital text.
Which type of computer vision should you use?
A. facial detection
B. optical character recognition (OCR)
C. image classification
D. object detection
B. optical character recognition (OCR)
Explanation:
Optical Character Recognition (OCR) is the correct answer because it’s the computer vision technology specifically designed to convert images of text, including handwritten text, into machine-readable digital text.
- A. facial detection: Facial detection is used to identify faces in images or videos.
- C. image classification: Image classification categorizes an entire image into a predefined class (e.g., “cat,” “dog,” “car”).
- D. object detection: Object detection identifies and locates specific objects within an image, but it doesn’t convert text into a digital format.
You need to implement a pre-built solution that will identify well-known brands in digital photographs.
Which Azure Cognitive Services service should you use?
A. Custom Vision
B. Form Recognizer
C. Face
D. Computer Vision
D. Computer Vision
Explanation:
The question specifies a “pre-built solution” for brand recognition. Computer Vision offers a pre-built brand detection feature that identifies commercial brands in images using a database of logos.
- A. Custom Vision: Custom Vision is used for training a custom model to recognize specific objects, which is not required in this scenario as the requirement is to use a pre-built solution.
- B. Form Recognizer: Form Recognizer is used to extract text and data from forms and documents.
- C. Face: Face service is used for facial detection, recognition, and analysis.
Natural language processing can be used to __________.
Select the answer that correctly completes the sentence.
A. Analyze video content
B. Generate speech
C. Classify email messages as work-related or personal.
D. Classify images
C. Classify email messages as work-related or personal.
Explanation:
Option C is the most accurate completion of the sentence. Natural Language Processing (NLP) is designed to analyze and understand human language, making it suitable for classifying text-based content like emails. Options A and D relate to computer vision, and while option B (generate speech) is related, it is not the primary function of NLP.
* A. Analyze video content: This is incorrect because video analysis primarily uses computer vision techniques.
* B. Generate speech: This is incorrect because speech generation is more closely associated with speech synthesis. While related to NLP, it is not the core application.
* C. Classify email messages as work-related or personal: This is correct because NLP can analyze the text content of emails and categorize them based on their context.
* D. Classify images: This is incorrect because image classification is primarily the domain of computer vision.
You need to create a model that labels a collection of your personal digital photographs.
Which Azure Cognitive Services service should you use?
A. Form Recognizer
B. Custom Vision
C. Language
D. Computer Vision
B. Custom Vision
DISCUSSION:
The question specifies that you need to create a model to label the photographs. Custom Vision allows you to train a custom model using your own images and labels, which aligns perfectly with this requirement. This allows the model to learn specific objects or patterns unique to your personal photographs.
Option A, Form Recognizer, is designed for extracting structured data from forms and documents, not for general image labeling.
Option C, Language, is for processing and analyzing text data, not images.
Option D, Computer Vision, provides pre-built models for image analysis, but it doesn’t allow you to train a custom model with your own labels, making it less suitable for personalizing the labeling process. Some argue that Computer Vision can be used to generate descriptions that can then be used as labels. However, the requirement to create a model strongly suggests the need for training, which Custom Vision is designed for.
You have a dataset.
You need to build an Azure Machine Learning classification model that will identify defective products.
What should you do first?
A. Load the dataset.
B. Create a clustering model.
C. Split the data into training and testing datasets.
D. Create a classification model.
A. Load the dataset.
DISCUSSION:
The first step in building a machine learning model is to load the data that will be used to train and test the model. Without loading the data, none of the subsequent steps (creating a model, splitting the data) are possible. Clustering is also not appropriate for this task, as it is an unsupervised learning technique, and classification is a supervised learning technique.
You are processing photos of runners in a race.
You need to read the numbers on the runners’ shirts to identify the runners in the photos.
Which type of computer vision should you use?
A. facial recognition
B. optical character recognition (OCR)
C. image classification
D. object detection
B. Optical character recognition (OCR) is the correct choice because it is specifically designed to identify and extract text (in this case, numbers) from images.
A is incorrect because facial recognition identifies faces, not numbers.
C is incorrect because image classification categorizes the entire image (e.g., “race photo”) but doesn’t extract specific information.
D is incorrect because object detection identifies the presence and location of objects (e.g., “runner”, “shirt”) but doesn’t read the text on them.
Which Computer Vision feature can you use to generate automatic captions for digital photographs?
A. Recognize text.
B. Identify the areas of interest.
C. Detect objects.
D. Describe the images.
D. Describe the images.
Explanation:
- Correct: Option D, “Describe the images,” directly aligns with the function of generating captions, which are textual descriptions of the image content.
-
Incorrect:
- Option A, “Recognize text,” focuses on identifying text within the image, not generating a general description.
- Option B, “Identify the areas of interest,” is related to attention mechanisms but doesn’t create captions.
- Option C, “Detect objects,” identifies objects within the image, but does not necessarily create a descriptive sentence or caption.
Which service should you use to extract text, key/value pairs, and table data automatically from scanned documents?
A. Custom Vision
B. Face
C. Form Recognizer
D. Language
C. Form Recognizer
Explanation:
- Correct: Form Recognizer (now Azure AI Document Intelligence) is specifically designed to extract text, key-value pairs, and table data from scanned documents.
- Incorrect A: Custom Vision is used for image recognition tasks, not document analysis.
- Incorrect B: Face service is used for facial recognition and analysis.
- Incorrect D: Language service is used for natural language processing tasks such as sentiment analysis and language detection, not document data extraction.