Block 2 Part 3 Flashcards

Question 1

Q

Perceptual redundancy

Answer

A

information contained in audio or visual signal that can be removed without affecting recipients experience of signal

Question 2

Q

Compression level (coding efficiency

Answer

A

this is how far you can compress a file

- there is a trade off between how far you can compress a file and keeping enough of the original signal

Question 3

Q

Permissible distortion

Answer

A

once acquired a digital source representation, need to represent it using the smallest number of bits possible for permissible distortion

Question 4

Q

Coding source into fewest possible number of bits

Answer

A

allows either lower bit rate (bandwidth) to be used for transmitting compressed data
or transmission to be completed faster

Question 5

Q

Rate distortion (RD)

Answer

A

in all source coding algorithms, relationship between compression level achieved and resulting distortion formalised by RD
every source coding algorithm has RD

Question 6

Q

Pulse code modulation (PCM)

Answer

A

digitising analogue signal normally done by PCM
analogue signal first subjected to sampling to create pulse amplitude modulation (PAM) signal
each sample assigned to one of finite number of possible discrete values in process called quantising
resulting bitstream goes through further lossless encoding to minimise final bit rate

Question 7

Q

Aliasing

Answer

A

means not enough samples taken so wave is just an alias of original
still has same shape but more spread out

Question 8

Q

Analogue-to-digital converter (ADC)

Answer

A

combined process of sampling and quantising usually performed by ADC

Question 9

Q

Quantisation noise (quantisation error)

Answer

A

difference between original and digital signals

Question 10

Q

Differential pulse-code modulation (DPCM)

Answer

A

variant of PCM that also converts source analogue signal to digital representation
able to achieve lower bit rate by including sample prediction in its coding

Question 11

Q

Advantages of DPCM over PCM

Answer

A

successive samples not very different from each other
encoder and decoder predict next sample will be same as current one
transmitted difference value is then error in prediction
difference values also known as prediction errors

Question 12

Q

MPEG-1

Answer

A

mainly used for efficient storage of moving pictures for multimedia on CD-ROM

Question 13

Q

MPEG-2

Answer

A

toolbox of optimised compression techniques for DTV systems to support both SD and HD picture resolutions

Question 14

Q

MPEG-4

Answer

A

intended to provide high compression rates, allowing for transmission of moving pictures at bit rates below 64 kbit s-1

Question 15

Q

MPEG-7

Answer

A

specifies way multimedia can be indexed, and thus searched for in variety of ways relating to specific medium

Question 16

Q

MPEG-21

Answer

A

extends this notion further by including additional digital rights management (DRM) into MPEG systems

Question 17

Q

Objective of JPEG and MPEG coding

Answer

A

removal of as much statistical and perceptual redundancy as possible, to achieve highest compression
this achieved in two stages
Spatial compression and Temporal compression

Question 18

Q

Spatial compression

Answer

A

exploits fact that in many real pictures considerable similarity (correlation) exists between neighbouring areas of image

Question 19

Q

Spatial compression - intra-frame compression

Answer

A

each individual picture able to be compressed

- basis of JPEG image compression standard

Question 20

Q

Temporal compression

Answer

A

exploits fact that in most sequences, very little changes between consecutive frames

Question 21

Q

Temporal compression - inter-frame compression

Answer

A

high correlation between frames offers further lossy compression opportunities, by removing detail without loss of quality

Question 22

Q

JPEG coding

Answer

A

de facto lossy compression standard for colour and greyscale images
though known as lossy it does have lossless mode

Question 23

Q

JPEG limitations

Answer

A

no interactive functionality, cannot compress region of interest at different bit rate from remainder of image
not optimised for either natural images or synthetic computer generated images
poor compression of compound documents containing both images and text
degraded performance in noisy channel conditions

Question 24

Q

JPEG2000

Answer

A

low-bit rate image compression standard
offers interactive, multi resolution and scalable functionality
superior coding performance with fewer visually perceptible artefacts

Question 25

Q

JPEG2000 bitstream scalability

Answer

A

image can change its representation to satisfy requirements of application or receiver

Question 26

Q

JPEG2000 discrete wavelet transform (DWT)

Answer

A

decomposes image into four sub images, each having different resolution corresponding to different frequency band
original image can be reconstructed by combining four images

Question 27

Q

Region of interest

Answer

A

if in an image you are wanting to ensure a certain area has better quality than surrounding area then JPEG2000 can be used
Lossless compression is applied to this area to ensure the image is of high standard
the rest of the image can be coded at a much lower resolution

Question 28

Q

Motion JPEG(M-JPEG)

Answer

A

allows moving images to be compressed

- uses only intra frame compression

Question 29

Q

Motion vectors

Answer

A

means the difference in frame movement between two frames in a movie or video

Question 30

Q

Motion prediction

Answer

A

idea is to predict current frame from previous frame by calculating set of MVs then determine motion prediction error
this prediction can then be compensated for at the decoder

Question 31

Q

Three main picture types supported by MPEG

Answer

A

I-frames
P-frames
B-frames

Question 32

Q

I-frames

Answer

A

(intra frame) are JPEG-coded and used as reference for random access in MPEG bitstreams
coded independently without reference to other picture types
don’t use motion vectors
achieve only low compression
used any time shot changes from one sequence to another

Question 33

Q

P-frames

Answer

A

(prediction) use motion prediction and compensation to achieve higher compression than I-frames
used as reference for both future and past predictions
don’t offer random access capability within coded bitstream

Question 34

Q

B-frames

Answer

A

(bidirectional prediction) interpolated frames between _ and P-frames in both forward and backward directions
not used as reference but fill in missing frames
provide highest compression and don’t propagate coding errors

Question 35

Q

Correcting prediction

Answer

A

find best prediction using block-matching algorithm to determine set of motion vectors
calculate prediction error between estimated and actual object positions, transmit alongside motion vectors

Question 36

Q

Group of pictures

Answer

A

used by MPEG to refer to particular combination of frames that represent sequence
always start with reference I-frame
defined by two parameter, total number of frames in GOP and number of adjacent B-frames plus one

Question 37

Q

H.264/AVC

Answer

A

supports high quality delivery of audio and video
also low bit rate IP based streaming applications
offers range of profiles

Question 38

Q

Switching P and I-frames (known as SP and SI)

Answer

A

incorporated into GOP format

- designed to support efficient switching between bitstreams

Question 39

Q

Perceptual masking

Answer

A

composition of sound can alter ear’s ability to perceive specific frequencies at specific amplitudes
two types of masking; frequency masking and temporal masking
together referred to as noise masking

Question 40

Q

Frequency masking

Answer

A

arises because of inherent property of ear

- relatively loud sound at particular frequency reduces sensitivity to neighbouring frequencies

Question 41

Q

Temporal masking

Answer

A

refers to fact perceptual hearing sensitivity to sounds in narrow frequency range reduced for short period

Question 42

Q

Speech coding methods

Answer

A

waveform encoding, process the source data using either time or frequency techniques
vocoder (voice encoder), formulate a mathematical model of voice production process that can be represented by small number of parameters

Question 43

Q

Linear predictive coding (LPC)

Answer

A

estimates key speech production parameters relating to acoustics of vocal tract for both voiced and unvoiced signals

Question 44

Q

Code-excited linear prediction(CELP)

Answer

A

not coding algorithm per se
grouping of low-bit rate speech-coding solutions that employ LPC as core compression model
constructs codebook of quantised excitation vectors, known as code words
all entered into codebook
transmits model coefficients and gain to decoder and sends index pointer to one codebook entry as best excitation

Brainscape's Knowledge GenomeTM

Block 2 Part 3 Flashcards

Brainscape's Knowledge Genome^TM