stereo Flashcards by ROWAN Gomanee

What is stereo vision?

A technique for estimating 3D structure by comparing two images taken from different viewpoints.

How well did you know this?

Not at all

Perfectly

What is disparity in stereo vision?

The horizontal pixel shift between corresponding points in the left and right images.

How well did you know this?

Not at all

Perfectly

What does disparity allow us to compute?

Depth, using the formula Z = (focal length × baseline) / disparity.

How well did you know this?

Not at all

Perfectly

What is sparse stereo matching?

Matching a small number of feature points to compute a sparse 3D reconstruction.

How well did you know this?

Not at all

Perfectly

What is dense stereo matching?

Computing disparities for nearly every pixel to generate a full depth map.

How well did you know this?

Not at all

Perfectly

Why is image rectification important for stereo?

It aligns image rows so that corresponding points lie on the same horizontal line, simplifying matching.

How well did you know this?

Not at all

Perfectly

What is the goal of window-based matching?

To find the best matching pixel in the right image for each pixel in the left image by comparing patches.

How well did you know this?

Not at all

Perfectly

What is the most common matching cost in stereo vision?

Sum of Absolute Differences (SAD).

How well did you know this?

Not at all

Perfectly

What is the SAD formula in stereo?

SAD = ∑ |I_L(x,y) - I_R(x-d,y)| over a local window.

How well did you know this?

Not at all

Perfectly

What are challenges for SAD matching?

Lighting variation, occlusion, textureless regions, and repetitive patterns.

How well did you know this?

Not at all

Perfectly

What is scanline stereo?

An approach that aligns entire rows (scanlines) using dynamic programming to handle occlusions and improve consistency.

How well did you know this?

Not at all

Perfectly

What is dynamic programming used for in stereo?

To find an optimal sequence of pixel matches across a row, treating the problem like path alignment.

How well did you know this?

Not at all

Perfectly

What does a diagonal move in the scanline DP grid represent?

A pixel match between the left and right images.

How well did you know this?

Not at all

Perfectly

What do horizontal or vertical moves in the DP grid represent?

Occlusions or unmatched pixels.

How well did you know this?

Not at all

Perfectly

What is a major limitation of scanline stereo?

It processes each row independently, leading to horizontal streaking artifacts and lack of 2D smoothness.

How well did you know this?

Not at all

Perfectly

What is Semi-Global Matching (SGM)?

Study These Flashcards

A stereo matching method that aggregates matching costs over multiple directions to approximate global optimization.

Who developed SGM?

Study These Flashcards

Heiko Hirschmüller.

What is the key idea behind SGM?

Study These Flashcards

To balance data fidelity and smoothness by combining local matching costs along multiple paths through the image.

How does SGM handle occlusions and noise?

Study These Flashcards

By penalizing disparity changes between neighboring pixels, promoting consistency.

What are typical window sizes used in SGM?

Study These Flashcards

Small windows, such as 3×3 or 5×5, to preserve detail while maintaining robustness.

What kind of output does SGM produce?

Study These Flashcards

A dense disparity map that can be converted into a depth map or point cloud.

What are the main advantages of SGM?

Study These Flashcards

Accurate, robust to noise and occlusion, and efficient enough for real-time or near real-time use.

What is the main input assumption for dense stereo algorithms?

Study These Flashcards

That images are rectified so that epipolar lines are horizontal.

What is the difference between sparse and dense stereo?

Study These Flashcards

Sparse stereo matches a few keypoints; dense stereo attempts to match most or all pixels.

What is the triangulation formula for depth from disparity?

Z = (f × B) / d, where f = focal length, B = baseline, d = disparity.

What causes ambiguity in pixel-wise stereo matching?

Textureless regions, occlusions, and repetitive patterns.

What is a real-world application of SGM?

Depth estimation in stereo cameras (e.g., Intel RealSense), autonomous driving, and UAV terrain mapping.

stereo Flashcards

(27 cards)