NLP Flashcards

1
Q

“Lemmatization is more complex than stemming.” Justify this statement.

A

Lemmatization takes longer to execute than stemming. It finds the
exact word meaning. Hence it is complex.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What will be the output of “bodies” in stemming and lemmatization?

A

a) The output of the word bodies after stemming will be – bodi
b) The output of the word bodies after Lemmatization will be – body

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do you mean by syntax? Explain in detail.

A

The grammatical structure of a sentence. The grammatical structure contain nouns, verbs, adverbs, adjectives, and some rules to prepare a structure. Another part of grammatical structure is part of speech.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

. Also give examples for:
1. Perfect Syntax, no meaning

A

Perfect Syntax, no meaning – The sentence which is grammatically correct but it does
not make any sense. In human language, a perfect balance of syntax and semantics is
important for better understanding. Human is communication is complex. For example,
Chcikens feed extravagantly while the moon drinks tea.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

examples for
Multiple meaning of a word

A

Multiple meaning of a word – In Natural language it is important to understand that a
word can have multiple meanings and the meanings fit into the sentence according to the
context of it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

TF definition

A

Term Frequency refers to the frequency of words in one document.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Applications of NLP

A

1) Automatic summarisation

2) Sentiment Analysis

3) Text classification

4) Virtual Assistants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Script bot-

A

Handling Conversations Eg.customer care chat bots in companies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Smart bots

A

along with handling conversation Eg. Alexa, Cortana, Siri

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Human Vs. Computer Language

A

Humans communicate through language which we process all the time. Our brain keeps on processing the sounds that it hears around itself and tries to make sense out of them all the time.

On the other hand, the computer understands the language of numbers. Everything that is sent to the machine has to be converted to numbers. And while typing, if a single mistake is made, the computer throws an error and does not process that part. The communications made by the machines are very basic and simple.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The challenges in Natural Language processing are:

A

1) Ambiguity in sentences

2) Handling Emotions

3) Handling multiple meanings of same word

4) Handling Syntax ( Grammar)-

5)Handling emotions

6) Handling sentences with perfect syntax, no meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

TF-IDF score

A

The TF- IDF score help the computer to understand the importance of words while processing the NLP.

The higher the value , the more valuable the word is for a given corpus.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Applications of TF- IDF

A

a)Document Classification b)Topic modelling c)Information Retrieval system d) Stop Word filtering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

define corpus

A

A corpus is a large and structured set of machine-readable texts that have been produced in a natural communicative setting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is meant by a dictionary in NLP?

A

Dictionary in NLP means a list of all the unique words occurring in the corpus. If some words are repeated in different documents, they are all written just once as while creating the dictionary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a document vector table?

A

Document Vector Table is a table containing the frequency of each word of the vocabulary in each document.

17
Q

What do you mean by document vectors?

A

Document Vector contains the frequency of each word of the vocabulary in a particular document.

18
Q

What is the significance of converting the text into a common case?

A

In Text Normalization, we undergo several steps to normalizethe text to a lower level.After the removal of stop words, we convert the whole text into a similar case, preferably lower case. This ensures that the case-sensitivity of the machine does not consider same words as different just because of different cases.

19
Q

What is the need of text normalizationin NLP?

A

Text Normalizationhelps in cleaning up the textual data in such a way that it comes down to a level where its complexity is lower than the actual data.

20
Q

What are stop words?

A

“Stop words” are the most common words in a language like “the”, “a”, “on”, “is”, “all”.