Video, Motion and MPEG Flashcards
(15 cards)
Describe in bullet points, the process of Three-Steps Block Matching
1.Start with search location at center of image.
- Set step size S=4 and search parameter p=7
- Search 8 locations +/- S pixels around location (0,0) and the location (0,0)
- Pick amount of the 9 locations searched, the one with minimum cost function.
- Set the new search origin to the above picked location.
- Set the new step size as S=S/2
- Repeat the search procedure until S=1
What are the pros and cons of Gradient-Based motion estimation with respect to Full Search Block Matching? In particular, discuss how they are affected by the aperture problem.
Advantages:
1. A sub-pixelic accuracy can be obtained with a few iterations. No need to make the full (2w+1)^2 evaluations of the full search BM.
Disadvantage:
1. The Taylor Series expansion is only valid over small distances.
Explain where motion estimation is likely to fail or succeed in this frame.
- Fast motion (size of motion bigger than the search window)
- occlusions (object not visible in the other frame)
- non-rigid objects (hair, flames)
- motion blur (object is moving too fast and appears blurred)
What is, in the context of video compression, a Group of Pictures? Name its different components.
A group of pictures, or GOP structure, specifies the order in which these frames types are arranged in a batch of frames.
Different components:
1. B = bi-directional frames
2. P = predictive frames
3. I = intra-coded frames
Explain in bullet points the process of Gradient-Based Iterations for motion estimation.
- Precomputing the gradients using for instance simple forward difference as approximation.
What are the pros and cons of this Gradient-Based method over the Full Search Block Matching?
Advantages:
1. A sub-pixelic accuracy can be obtained with a few iterations. No need to make the full (2w+1)^2 evaluations of the full search BM.
Disadvantage:
1. The Taylor Series expansion is only valid over small distances.
Explain how motion compensation is used to encode B-frames.
- Have decoded forward frame In+j and backward frame In.
- To reconstruct In, have 2 DFDs. (DFDb DFDf)
- Keep the vector that gives the best prediction.
Explain what a GOP in the context of MPEG and specify the types of frames in MPEG.
A group of pictures, or GOP structure, specifies the order in which these frames types are arranged in a batch of frames.
- Bi-directional frames.
- Predictive frames
- Intracoded frames
How could the GOP structure be optimised for the following?
(a) video editing application, where random access time is critical for interactivity
(b) video surveillance (static camera)
(c) video streaming.
Name and explain 4 types of problems that can affect motion estimation.
- Occlusion- object is not visible in the other frame.
- Non-rigid objects - hair, flames.
- Fast motion - The size of the motion bigger than the search motion.
- Motion blur - The object is moving too fast and appears blurred.
Why is full search block based matching?
Too computationally heavy
What is gradient-based motion?
Gradient-Based motion estimation a class of algos that allows us to efficiently find the optimal a displacement, provided that the displacement is small.
Define parameters ‘block size’ and ‘search radius’. Describe how the choice of these parameters affects the performance of block matching.
State how the use of multiresolution approach improves the computational efficienecy of block matching compared with a conventional exhaustive search.