stereo Flashcards

(27 cards)

1
Q

What is stereo vision?

A

A technique for estimating 3D structure by comparing two images taken from different viewpoints.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is disparity in stereo vision?

A

The horizontal pixel shift between corresponding points in the left and right images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does disparity allow us to compute?

A

Depth, using the formula Z = (focal length × baseline) / disparity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is sparse stereo matching?

A

Matching a small number of feature points to compute a sparse 3D reconstruction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is dense stereo matching?

A

Computing disparities for nearly every pixel to generate a full depth map.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is image rectification important for stereo?

A

It aligns image rows so that corresponding points lie on the same horizontal line, simplifying matching.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the goal of window-based matching?

A

To find the best matching pixel in the right image for each pixel in the left image by comparing patches.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the most common matching cost in stereo vision?

A

Sum of Absolute Differences (SAD).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the SAD formula in stereo?

A

SAD = ∑ |I_L(x,y) - I_R(x-d,y)| over a local window.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are challenges for SAD matching?

A

Lighting variation, occlusion, textureless regions, and repetitive patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is scanline stereo?

A

An approach that aligns entire rows (scanlines) using dynamic programming to handle occlusions and improve consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is dynamic programming used for in stereo?

A

To find an optimal sequence of pixel matches across a row, treating the problem like path alignment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does a diagonal move in the scanline DP grid represent?

A

A pixel match between the left and right images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do horizontal or vertical moves in the DP grid represent?

A

Occlusions or unmatched pixels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a major limitation of scanline stereo?

A

It processes each row independently, leading to horizontal streaking artifacts and lack of 2D smoothness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Semi-Global Matching (SGM)?

A

A stereo matching method that aggregates matching costs over multiple directions to approximate global optimization.

17
Q

Who developed SGM?

A

Heiko Hirschmüller.

18
Q

What is the key idea behind SGM?

A

To balance data fidelity and smoothness by combining local matching costs along multiple paths through the image.

19
Q

How does SGM handle occlusions and noise?

A

By penalizing disparity changes between neighboring pixels, promoting consistency.

20
Q

What are typical window sizes used in SGM?

A

Small windows, such as 3×3 or 5×5, to preserve detail while maintaining robustness.

21
Q

What kind of output does SGM produce?

A

A dense disparity map that can be converted into a depth map or point cloud.

22
Q

What are the main advantages of SGM?

A

Accurate, robust to noise and occlusion, and efficient enough for real-time or near real-time use.

23
Q

What is the main input assumption for dense stereo algorithms?

A

That images are rectified so that epipolar lines are horizontal.

24
Q

What is the difference between sparse and dense stereo?

A

Sparse stereo matches a few keypoints; dense stereo attempts to match most or all pixels.

25
What is the triangulation formula for depth from disparity?
Z = (f × B) / d, where f = focal length, B = baseline, d = disparity.
26
What causes ambiguity in pixel-wise stereo matching?
Textureless regions, occlusions, and repetitive patterns.
27
What is a real-world application of SGM?
Depth estimation in stereo cameras (e.g., Intel RealSense), autonomous driving, and UAV terrain mapping.