Chapter 5 - Perceptual Coding Flashcards

1
Q

What is perceptual coding?

A

Reducing the quantity of data used to represent digital audio, reducing the file size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name two approaches to perceptual coding.

A
  • Data compression

- Data reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which one is lossless?

  • Data Compression
  • Data reduction
A

Data Compression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Compression?

A

Reducing file size in a lossless manner, without removing any data. This is achieved using Entropy/ Huffman coding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a common entropy code?

A

Morse code.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is morse code used to reduce file size?

A

Morse code requires less data than binary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Data Reduction?

A

Reducing file size by removing data in a manner that is unheard.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does Data Reduction work?

A

Data is removed according to how we perceive sound. In order to identify what data can be safely removed, a perceptual coder must compare the audio against a psychoacoustic model of our hearing system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the membrane in the ear that has hair cells on it called?

A

Basilar membrane

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In the ear, the cells on the membrane respond to different frequencies depending on where they are located along the membrane. Where are high and low frequencies sensed?

A

High frequencies are sensed closer to the middle ear (outside). Low frequencies at the far end (Inner).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define the threshold of hearing.

A

The minimum level at which the human ear can hear a tone.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain Masking.

A

When a tone sounds, there is a theoretical lifting of the minimum audio threshold in the local frequency range around the tone. if there is another tone nearby in frequency and slightly softer, it could be masked by the louder tone.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Masking occurs only if the tones are in the same _______.

A

Critical band

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or false.

Masking is more effective when a lower frequency tone masks a higher frequency one.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is amplitude/ simultaneous masking?

A

Masking that takes place when two tones are sounded simultaneously.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is temporal masking?

A

Masking that takes place when tones are sounded close in time, but not simultaneously.

17
Q

Name the two types of Temporal masking.

A

Pre-masking and post-masking

18
Q

True or False?

Pre-masking occurs when a tone is masked by another tone that ends before the masked signal begins.

A

False

19
Q

Name two ways you can reduce the data rate in a data reduction system.

A
  • decrease the sample rate

- Decrease word length

20
Q

In perceptual coding, word length reduction is done _______ depending on signal conditions.

A

Dynamically

21
Q

_______ and _____________ are used to ensure that the resulting increase in Quantization noise is kept as inaudible as possible.

A

Masking and the Fletcher Munsen equal loudness contours

22
Q

What is the result of a signal being encoded multiple times.

A

Noise will be added everytime

23
Q

Name the six stages of mp3 encoding.

A
  1. An existing PCM audio stream.
  2. The audio is run through an analyzing filter where the audio is divided into 32 sub-bands.
  3. Sub-bands are grouped into frames. Encoder determines where masking is happening. This determines which frames can have a reduced bit rate.
  4. Bit allocation. The encoder determines how many bits to encode each frame with.
  5. All frames are saved as MP3 file.
  6. On playback, the sub-band frames are recorded into time-domain sections and joined up to recreate an audio stream.
24
Q

What is Joint Stereo

A

Info that is the same between channels is encoded in one channel. Info that is different is encoded in the other.