Lecture 9: NLP Flashcards

1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

NLP VS Speech Processing

A

Natural Language Processing
= automatic processing of written texts
1.
Natural Language Understanding

Input = text
2.
Natural Language Generation

Output = text

Speech Processing
= automatic processing of speech
1.
Speech Recognition

Input = acoustic signal
2.
Speech Synthesis

Output = acoustic signal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is BOW model

A

A simple model where word order is ignored

used in many applications:

NB spam filter seen in class a few weeks ago

Information Retrieval (eg. google search)



But has severe limits to understand meaning of text…

Maybe we should take word order into account…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

N-gram model

A

An n-gram model is a probability distribution over sequences of events (grams/units/items)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why onnly bigram or trigram?

A

Markov approximation is still costly
with a 20 000 word vocabulary:

bigram needs to store 400 million parameters

trigram needs to store 8 trillion parameters

using a language model > trigram is impractical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How would one recognize language?

A
  • train a bigram of which characters are followed up by which
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

problem with n-gram

A

Natural language is not linear ….

there may be long-distance dependencies.

Syntactic dependencies

The man next to the large oak tree near … is tall.

The men next to the large oak tree near … are tall.

Semantic dependencies

The bird next to the large oak tree near … flies rapidly.

The man next to the large oak tree near … talks rapidly.

World knowledge

Michael Jackson, who was featured in …, is buried in California.

Michael Bublé, who was featured in …, is living in California.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly