04-Natural Processing Language Flashcards

1
Q

What is Natural Processing Language

A

Area of AI that deals with creating software that understands written and spoken language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What software can Natural Processing Language allow you to create

A
  1. Analyze text to extract key phrases and recognize entities (such as places, dates, or people)
  2. Perform sentiment analysis to determine how positive or negative the language used in a document is
  3. Interpret spoken language, and synthesize speech responses
  4. Automatically translate spoken or written phrases between languages
  5. Interpret commands and determine appropriate actions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which cognitive services build natural language processing solutions

A
  1. Text analytics
  2. Translator text
  3. Speech
  4. Language Understanding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Text analytics

A

Analyzes text and extracts key phrases, detects entities (such as places, dates, and people), and evaluate sentiment (how positive or negative a document is)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Translator text

A

Translate between more than 60 languages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Speech service

A

RECOGNIZE and SYNTHESIZE speech, and TRANSLATE spoken languages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Language Understanding

A

Train a language model that can understand spoken or text-based commands

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What will Language detection detect

A
  1. Language name, i.e. “English”
  2. ISO 6391 language code, i.e. “en”
  3. Score indicating level of confidence in the language detection
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What score is assigned to text that is ambiguous or has mixed language content

A

NaN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does sentiment analysis evaluate text and return sentiment scores and labels

A

Uses pre-built ML classification model. Returns sentiment score in range of 0 to 1, which 1 being most positive and 0 being most negative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What score will sentiment analysis return if you pass French text but tell the service the language code is en for English

A

Service will return a score of precisely 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Key Phrase Extraction

A

Evaluate text then identify main points around the context.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Entity Recognition

A

Item of a particular type or category, and in same cases, subtype, Person, Location, Organization, Quantity, etc.

Service also supports entity linking to help disambiguate entities by linking to a specific reference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Speech recognition

A

Ability to detect and interpret spoken input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Speech synthesis

A

Ability to generate spoken output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What APIs are offered through the Speech cognitive service

A
  1. Speech-to-Text

2. Text-to-Speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What can you use speech-to-text for

A

Perform real-time or batch transcription of audio into text format. Audio source can be real-time or an audio file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What model is used by the speech-to-text API

A

Model based on Universal Language Model that was trained by Microsoft.
Optimized for conversational and dictation. User can also create and train their own custom model including acoustics, language, and pronunciation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is real-time transcription

A

Used for presentation, demos, or any other scenario where person is speaking.

Application needs to listen for incoming audio from a microphone or other audio input source such as audio file.

Application code streams the audio to the service, which returns the transcribed text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is batch transcription

A

Previously recorded and stored audio files are transcripted.

Can have audio stored. You can point to audio files with shared access signature (SAS) URI and asynchronously receive transcription results.

Run in asynchronous manner because batch jobs are scheduled on best-effort bases. Job will execute within minutes of request, but no estimate for when a job changes into the running state

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is text-to-speech API

A

Enables you to convert TEXT input to audible SPEECH, which can either be played directly through computer speaker or written to an audio file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is Speech synthesis voices

A

Pre-defined voices with support for multiple languages and regional pronunciation.

Includes standard voices as well as neural voices that leverage neural networks to overcome common limitations in speech synthesis with regard to intonation, resulting in ore natural sounding voice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is a neural network

A

A bunch of algorithms that try to recognize a relationship in a set of data . Tries to imitate the human brain in order to recognize a relationship in a set of data.. In this way the neural networks refer to systems of neurons, either organic or artificial in nature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is machine translation

A

Automated translation to convert one language to another. This enables collaboration with people of other cultures and geographic locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is literal translation

A

Each word is translated to the corresponding word in the target language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is semantic

A

Relating to meaning in language or logic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is text translation used for

A

Used to translate documents from one language to another, translate email communications that come from foreign governments, and even provide the ability to translate web pages on the Internet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is speech translation used for

A

Used to translate between spoken languages, sometimes directly (speech-to-speech translation) and sometimes by translating to an intermediary text format (speech-to-text translation)

29
Q

What Translation services does Azure provide

A
  1. Translator Text - text-to-text translation

2. Speech - speech-to-text and speech-to-speech

30
Q

What service does Translator Text use

A

Uses a Neural Machine Translation (NMT) model for translation, which analyzes the semantic context of the text and renders a more accurate and complete translation as a result

31
Q

How do you specify language you are translating from and to in the Text Translator service

A

Use ISO 639-1 language code, i.e. en for English, fr for French.

32
Q

How do you specify cultural variant of language

A

Use 3166-1 cultural code, i.e. en-US for US English, en-GM for British English.

33
Q

What are optional configurations for Translator Text API

A
  1. Profanity filtering

2. Selective translation - tag content so it isn’t translated

34
Q

What APIs does Speech service include

A
  1. Speech-to-text
  2. Text-to-speech
  3. Speech translation
35
Q

What are 3 core concepts of Language Understanding

A
  1. Utterances
  2. Entities
  3. Intents
36
Q

What are utterances

A

What a user might say, i.e. “Switch the fan on”

37
Q

What are entities

A

An item to which an utterance refers, i.e. “Switch the FAN on”

38
Q

What are intents

A

Purpose, or goal, expressed in a user’s utterance, i.e. “TurnOn” for “Switch the FAN on”

39
Q

When should you use the None intent

A

To handle utterances that do not map any of the utterances you have created. Considered a fallback, and used to provide generic response to users when their requests don’t match any other intent.

40
Q

What two main tasks are involved with creating a language understanding application

A
  1. Define entities, intents, and utterances with which to train the language model. Referred to as AUTHORING the model.
  2. Publish the model so the client applications can use it for intent and entity PREDICTION based on user input
41
Q

What is the Language Understanding portal

A

Web-based interface for creating and managing Language Understanding applications

42
Q

What are 4 types of entities

A
  1. Machine Learned - entities that are learned by your model during training from context in the sample utterances you provide
  2. List - entities that are defined as a hierarchy of lists and sublists, i.e device list might include sublists for light and fan. For each list entry, specify synonyms such as lamp for light
  3. RegEx - regular expression that describes a pattern, i.e. [0-9]{3}-[0-9]{3}-[0-9]{4} in the form 555-123-4567
  4. Pattern.any - entities that are used with patterns to define complex entities that may be hard to extract from sample utterances
43
Q

What is training the model

A

The process of using sample utterances to teach your model to match natural language expressions that a user might say to probable intents and entities.
Training and testing is an iterative process.

44
Q

What is predicting

A

After training and testing, publish Language Understanding application to prediction resource.
Predictions are returned to client application

45
Q

What are some techniques to build software to analyze text

A
  1. Statistical analysis
  2. Extending frequency analysis to multi-term phrases
  3. Apply stemming or lemmatization algorithms
  4. Apply linguistic structure rules to analyze sentences
  5. Encode words or terms as numeric features that can be used to train a ML model
  6. Create vectorized models to capture semantic relationships
46
Q

What is statistical analysis of terms used in text

A

Remove common “stop words”’, i.e. “the” or “a”. Perform FREQUENCY ANALYSIS of the remaining words (how often each word appears). This provides clues about the main subject of the text.

47
Q

What does extending frequency analysis to multi-term phrases involve.

A

These are known as N-grams (a two-word phrase is a bi-gram, a three-word phrase is a tri-gram, etc).

Analyze the frequency analysis to such words.

48
Q

What is stemming or lemmatization algorithms

A

These NORMALIZE WORDS before counting them, i.e. “power”, “powered”, and “powerful” are interpreted as being the same word.

49
Q

What does applying linguistic structure rules to analyze sentences involve

A

For example, break down sentence into tree-like structure such as noun phrase, which itself contains nouns, verbs, adjectives, and so on

50
Q

What does encoding words or terms as numeric features to train ML model involve

A

This technique is often used to perform SENTIMENT ANALYSIS, in which a document is classified as positive or negative.

For example, classify a text document based on the terms it contains.

51
Q

What does creating vectorized models to capture semantic relationship between words involve

A

Capture semantic relationship between words by assigning them to locations in n-dimensional space.
For example, assign values to words “flower” and “plant” that locate them close to one another, while “skateboard” might be given a value that positions it much further away

52
Q

How does Text Analytics cognitive service simplify application development

A

IT uses pre-trained models that can

  1. Determine the language of a document or text (i.e. French or English)
  2. Perform sentiment analysis from text that might indicate its main talking points.
  3. Extract key phrases from text that might indicate its main talking points
  4. Identify and categorize entities in the text. Entities can be people, places, organizations, or even everyday items such as dates, times, quantities, etc
53
Q

How can Text Analytics be applied

A
  1. Social media feed analyzer to detect sentiment around a political campaign or a product in market
  2. Document search application that extracts key phrases to help summarize the main subject matter of documents in a catalog.
  3. Extract brand information or company names from documents or other text for identification purposes
54
Q

What will Language detection capability of Text Analytics return

A
  1. Language name, i.e. “English”
  2. ISO 6391 language code, i.e. “en”
  3. Score indicating level of confidence n the language detection
55
Q

For AI to accept vocal commands and provide spoken resources, what capabilities must it support

A
  1. Speech recognition - ability to detect and interpret spoken input
  2. Speech synthesis - ability to generate spoken output
56
Q

How can software analyze speech patterns to determine recognizable patterns that are mapped to words

A

Uses

  1. An acoustic model that converts audio signal into phonemes (representations of specific sounds)
  2. A language model that maps phonemes to words, usually using a statistical algorithm that predicts the most probable sequence of words based on phonemes
57
Q

What are phonemes

A

Representations of specific sounds

A phoneme is the smallest unit of sound in speech. When we teach reading we teach children which letters represent those sounds. For example – the word ‘hat’ has 3 phonemes – ‘h’ ‘a’ and ‘t’.

58
Q

What are recognized words that are typically converted to text used for

A
  1. Provide closed captions for recorded or live videos
  2. Create transcript of a phone call or meeting
  3. Automated note dictation
  4. Determine intended user input for further processing
59
Q

How is speech synthesis the reverse of speech recognition

A

Concerned with vocalizing data, usually by converting text to speech. Speech synthesis solution typically requires

  1. Text to be spoke
  2. Void to be used to vocalize the speech
60
Q

How does a system synthesize speech

A
  1. Tokenizes the text to break it into individual words
  2. Assigns phonetic sounds to each words
  3. Breaks the phonetic transcription into prosodic units (such as phrases, clauses, or sentences) to create phonemes that will be converted to audio format.
  4. Phonemes are then synthesized by audio by applying a voice, which will determine parameters such as pitch and timbre; and generating an audio wave form that can be output to a speaker or written to a file
61
Q

What are prosodic units

A

Phrases, clauses, or sentences

62
Q

How can you use output of speech synthesis

A
  1. Generate spoken responses to user input
  2. Create voice menus for telephone systems
  3. Read email or text messages aloud in hands-free scenarios
  4. Broadcast announcements in public locations, such as railway stations or airports
63
Q

What supports phrase extraction

A

Cognitive Services

64
Q

What does Translator Text API offer to fine-tune results

A
  1. Profanity Filtering

2. Selective translation

65
Q

What does Speech service APIs include

A
  1. Speech-to-text
  2. Text-to-speech
  3. Speech Translation
66
Q

Can Speech service translate from audio source to text

A

Yes, it can

67
Q

What does Translator Text Support

A

Text-to-text translation

68
Q

What does Speech service support

A

Enables speech-to-text and speech-to-speech translation