Week 2 - Image Formation & Feature Representation Flashcards

(40 cards)

1
Q

What does the term Principal or Optical axis mean in regards to pinhole cameras?

A

The Z-axis is often referred to with these terms, and it assumes that the camera orientation is aligned with the Z-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the point, where the Z-axis hits the image plane, called?

A

The principal point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is the principal point located?

A

It is located at (0, 0, -f) from origin, where f is the focal length.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does the Focal Length relate to Field of View?

A

As focal length increases, field of view decreases i.e. zooming in on something is an example of this.
As focal length decreases, field of view increases i.e. zooming out on a scene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you work out the coordinates of the projection on the image plane mathematically?

A

x = f(X) / Z
y = f(Y) / Z
Where X, Y, Z is the point on the object, and x, y is the same point on the image plane.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens to parallel lines when they ‘head towards’ the horizon?

A

Parallel lines will eventually appear to converge on a vanishing point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some problems with pinhole cameras?

A
  • Pinhole size (aperture) must be ‘very small’ to obtain a clear image. However, if pinhole size is made smaller, then less light is received by the image plane.
  • If pinhole is comparable to wavelength of incoming light, then diffraction effects blur the image
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you ensure that the pinhole camera captures the sharpest image possible?

A

Diameter of pinhole camera = 2 square root of (f * wavelength of light)
Example:
If f = 50mm, and wavelength = 600nm, then diameter = ~0.35mm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some advantages in using the pinhole camera?

A
  • Simple to understand
  • Infinite depth of field
  • No lens distortion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are lenses used for?

A

Lenses are used to avoid the problems of using the pinhole camera, by capturing more light from the image plane, but retaining the same projection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the term ‘f’ stand for in terms of lenses?

A

f = Focal length of the lens, which determines the lens’s ability to bend/refract light

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is intensity measured numerically?

A

Intensity = 0 if pixel is black
Intensity = 255 if pixel is white in an 8-bit image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What factors affect the colour of a pixel in an image?

A

Light sources:
- Emittance spectrum
- Geometry
- Directional attenuation

Objects’ surface properties:
- Reflectance spectrum
- Geometry
- Absorption

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some typical use cases for Image Feature Representation?

A
  • Image alignment
  • 3D reconstruction
  • Motion tracking
  • Object/face recognition
  • Indexing and database retrieval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are image features?

A

A feature is a measurable property that describes the characteristics of an image or a region of images

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are image features often represented?

A

Often represented by scalars, vectors, matrices or tensors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What factors make image matching hard to perform?

A
  • Change in lighting
  • Change in viewpoint
  • Occlusions
  • Partial matching
  • Change over time
18
Q

What are image regions & patches?

A

Image regions & patches are segments or rectangular image patches that are used to collect a wider area of information from an image

19
Q

What is a feature vector?

A

A feature vector defines a set of descriptive features and concatenates them to produce a feature vector

20
Q

What is the idea behind using feature vectors?

A

The idea is to remove redundant or irrelevant data

21
Q

What makes histograms a good representation of colour?

A
  • Invariant to translation and rotation
  • Change slowly as viewing direction changes
  • Change slowly with object size
  • Change slowly with occlusion
22
Q

What are texture features?

A

Texture features measure the frequency with which patterns of colour/grey levels appear

23
Q

What are gradient-based features?

A

They are areas in an image which typically indicate boundaries of objects due to a spike in intensity i.e. intensity gradient.

24
Q

How do you estimate gradients using spatial filtering?

A

You take your source pixel, and the area surrounding it, and apply a convolution kernel on top of the area. You then multiply the value at each pixel corresponding with the value in the kernel, before summing all the results together to form the final source pixel value.

25
What is the formal definition of noise?
Small random bits of data added or taken away from the true value
26
How does a mean filter remove noise?
Move a kernel across the image and calculate a new pixel value based on the average of its surrounding neighbours.
27
How does a Gaussian filter work?
It works almost like a mean filter, except it adjusts the kernel to use a weighted average. The weighted average is stronger towards the centre of the area.
28
What is the formal way of finding edges?
Edges are found through the use of difference filtering in order to pick out the areas of high contrast.
29
How does edge detection work?
Edge detection works by looking for sharp changes in intensity
30
How does a Histogram of Oriented Gradients work?
- Divide the patch into smaller cells (8x8 pixels) - Define slightly larger blocks, covering several cells (2x2 cells) - Compute gradient magnitude and orientation at each pixel - Compute a local weighted histogram of gradient orientations for each cell, weighting by some function of magnitude - Concatenate histogram entries to form a HoG vector for each block - Normalise vector values by dividing some function of vector length
31
Why is invariance important?
Invariance dictates that similar results should be produced even if the conditions vary, such as scale, translation, rotation and illumination changes
32
How does Scale Invariance work?
- Find points whose surrounding patches (at some scale) are distinctive - Convolution with a Gaussian mask gives some idea of what is going on around a pixel - Gaussian masks have a natural scale: Their standard deviation
33
What are some key properties of SIFT?
- Fast and Efficient, can run in real time - Can handle: Changes in viewpoint, significant changes in illumination
34
What is Clipping in the sense of Brightness?
Clipping occurs when the pixels are too bright to be correctly recorded in the numeric range available.
35
What is the formal definition of Shutter Speed?
Shutter Speed defines how long the light is allowed onto the film/sensor for
36
How does Shutter Speed, Aperture and Gain relate?
If one goes up, then you can effectively maintain the same brightness level by decreasing the others. However, that does cause other adverse effects e.g. Depth of Field
37
What happens when you increase the Aperture?
A larger aperture means more light, but it also reduces the depth of field
38
What happens when you have a longer shutter speed?
A longer shutter speed means more motion blur
39
What makes a good scientific image for CV?
- Underexposed brightness, prevents clipping, although it introduces more noise - Centre the photo with a simple background - Record calibration target for colour balance - Optimise other settings for increased image clarity
40
What is the general consensus when collecting ranges of images for scientific analysis?
It is often cheaper and faster in the long run to spend a while making sure the images you capture are captured well and stored correctly.