SR Flashcards

1
Q

What is Speech Recognition?

A

is the interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

ASR

A

automatic speech recognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

STT

A

speech to text

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Speech Signal:

A

Amplitude/Time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Fundamental problem
1 Given:
2 Wanted:
3 Search:

A

1: an observation (ADC ,FFT) X = x1, x2, … , xT
2: the corresponding word sequence W = w1, w2, … , wm
3: the most likely word sequence W’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

W’

A

= arg max(w) P(W|X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

P( W|X )

A

p( X|W ) * P( W ) / p( X )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

P( X|W ) The acoustic modeI

A

how likely is it to observe X when W is spoken

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

P( W ) The language model

A

how likely is it W is spoken -priori

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is X ?

A

The Problem of Pre-Processing (Vorverarbeitung)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is p( X|W ) ?

A

The Problem of Acoustic Modelling (Akustische Modellierung)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is P( W ) ?

A

The problem of Language Modelling (Sprachmodellierung)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we find argmax W?

A

The Search problem (Suche)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly