Week 4 - 3D Computer Vision Flashcards

(32 cards)

1
Q

What is the fundamental problem with trying to recover the 3D structure of the scene from points matched between two images?

A

Fundamental ambiguity - Any point on the ray OP projects to the same image location, called P.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Stereo Correspondence?

A

Find matching pixels/features in 2 or more images and convert their 2D positions into 3D depths

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What can resolve the fundamental ambiguity in stereo correspondence?

A

A second camera can resolve the ambiguity enabling measurement via triangulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you achieve depth recovery using two cameras?

A

You use triangulation, which requires:
- Knowledge of absolute and relative camera geometry i.e. Calibration
- Point correspondence i.e. which rays to intersect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the properties of camera calibration?

A

It recovers the intrinsic parameters of the cameras e.g. focal length, pixel size, principal point, lens distortion
Relative poses between cameras, also called extrinsic parameters, are also factored in e.g. rotation, translation, scale that transforms left image on to right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the easiest way to perform camera calibration?

A

Simplest approach is to use a known calibration target object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What additional geometric distortions are present within lenses used in cameras?

A
  • Decentering errors: Displacement of the lens centre from optical axes
  • Radial distortion: Variations in light refractions, mostly in wide angle lenses
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the image warping parameter?

A

Image warping parameter is estimated to warp the ideal projected coordinate to the distorted coordinate. K contains warping parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the equation for image warping in stereo correspondence?

A

x’ = warp(x, k)
Where:
- x = Ideal image (no distortion)
- x’ = Observed image with distortion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are points generally defined in 3D space in stereo correspondence?

A

Points in the 3D space are expressed in terms of a different coordinate frame known as the world coordinate frame. The relation given between the coordinates of P in camera and world coordinate system are given by: Xcam = R(Xw - c)
Where:
- c = 3x1 vector representing the coordinates of the camera in the world coordinate system
- R = 3x3 matrix representing the orientation of the camera

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of camera calibration in a mathematical sense?

A

It is to calculate the intrinsic, extrinsic and distortion parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is epipolar geometry?

A

Given the two optical centres and a point in one image, we can compute the epipolar plane and so the corresponding epipolar line in the other image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is epipolar geometry important for camera calibration?

A

Given two calibrated cameras, it’s possible to retrieve the actual 3D coordinate of a corner in the image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can correspondence be used to measure depth of an object in an image?

A

Correspondence allows measurement of disparity: The difference in the image coordinates of the projections of a given world point into each camera.
Depth is inversely proportional to disparity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does correspondence search work?

A
  • Find a window in the original image
  • Slide it along the right scanline and compare the content of that window with that of the reference window in the original image
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the effect of a window size in correspondence search?

A
  • Larger window size: Smooth disparity maps but less detail captured
  • Smaller window size: More detail, but also more noise captured
17
Q

What are some problems with correlation-based stereo?

A
  • Window size is fixed across the image, but viewed objects differ in size and depth
  • Uniform regions always match
  • Can provide a dense disparity map, but values are only reliable where there is some local variation in intensity e.g. near edges
  • Dense disparity is computationally expensive in spatial domain
18
Q

How do you get ground truth data?

A
  • Alternative/competing sensors
  • Artificial images
  • Real images
19
Q

What are some problems that can be encountered when gathering ground truth data?

A
  • Automatic methods can have errors
  • Manual methods are slow, subjective and also error prone
  • What if standard sets don’t have the properties you are attempting to evaluate your images on?
20
Q

What is True Positive defined as?

A

True Positive - The algorithm makes a correct prediction about the presence of an object in an image

21
Q

What is False Positive defined as?

A

The algorithm predicts the presence of an object but that object is not present in the image

22
Q

What is False Negative defined as?

A

The algorithm misses an object

23
Q

What is Precision’s equation, and how is it defined?

A

Precision = TP/(TP + FP)
Fraction of responses that were correct

24
Q

What is Recall’s equation, and how is it defined?

A

Recall = TP/(TP + FN)
Fraction of correct classifications that were identified

25
How do you evaluate recognition?
Using Precision-Recall curves as a visualisation tool
26
What are some properties of Precision-Recall curves?
- Plot of precision against recall as some parameter is varied - Parameter is the threshold used to decide if the model and image are similar enough to be considered equal - Increasing threshold imposes a tighter requirement on matching, which reduces False Positives, but increases False Negatives
27
How would you measure accuracy in a classification problem?
For classification, you can use a confusion matrix, which shows what category images are confused with others
28
How is ground truth defined?
Ground truth is a set of manually-drawn bounding boxes on an image
29
What are two key elements of measurement and recognition?
- Accuracy - Robustness
30
What does a Precision Plot measure?
A Precision Plot measures the percentage of frames whose estimated location is within a given threshold distance of the ground truth.
31
When using algorithms that learn, what properties must it fulfill?
- Must be representative of the data - Must not be too specific - Must not use training data in the evaluation of performance
32
What does the term 'confining research leftward' mean in terms of Disparity Maps?
When you know which image is the left image and which image is the right image, then you can make safe estimations such as a point x in the left image could never be further to the right in the right image, thereby narrowing the search space.