Week 2 - Image Formation & Feature Representation Flashcards by Joshua Carey-Young

What does the term Principal or Optical axis mean in regards to pinhole cameras?

The Z-axis is often referred to with these terms, and it assumes that the camera orientation is aligned with the Z-axis.

How well did you know this?

Not at all

Perfectly

What is the point, where the Z-axis hits the image plane, called?

The principal point

How well did you know this?

Not at all

Perfectly

How is the principal point located?

It is located at (0, 0, -f) from origin, where f is the focal length.

How well did you know this?

Not at all

Perfectly

How does the Focal Length relate to Field of View?

As focal length increases, field of view decreases i.e. zooming in on something is an example of this.
As focal length decreases, field of view increases i.e. zooming out on a scene

How well did you know this?

Not at all

Perfectly

How do you work out the coordinates of the projection on the image plane mathematically?

x = f(X) / Z
y = f(Y) / Z
Where X, Y, Z is the point on the object, and x, y is the same point on the image plane.

How well did you know this?

Not at all

Perfectly

What happens to parallel lines when they ‘head towards’ the horizon?

Parallel lines will eventually appear to converge on a vanishing point

How well did you know this?

Not at all

Perfectly

What are some problems with pinhole cameras?

Pinhole size (aperture) must be ‘very small’ to obtain a clear image. However, if pinhole size is made smaller, then less light is received by the image plane.
If pinhole is comparable to wavelength of incoming light, then diffraction effects blur the image

How well did you know this?

Not at all

Perfectly

How do you ensure that the pinhole camera captures the sharpest image possible?

Diameter of pinhole camera = 2 square root of (f * wavelength of light)
Example:
If f = 50mm, and wavelength = 600nm, then diameter = ~0.35mm

How well did you know this?

Not at all

Perfectly

What are some advantages in using the pinhole camera?

Simple to understand
Infinite depth of field
No lens distortion

How well did you know this?

Not at all

Perfectly

What are lenses used for?

Lenses are used to avoid the problems of using the pinhole camera, by capturing more light from the image plane, but retaining the same projection.

How well did you know this?

Not at all

Perfectly

What does the term ‘f’ stand for in terms of lenses?

f = Focal length of the lens, which determines the lens’s ability to bend/refract light

How well did you know this?

Not at all

Perfectly

How is intensity measured numerically?

Intensity = 0 if pixel is black
Intensity = 255 if pixel is white in an 8-bit image

How well did you know this?

Not at all

Perfectly

What factors affect the colour of a pixel in an image?

Light sources:
- Emittance spectrum
- Geometry
- Directional attenuation

Objects’ surface properties:
- Reflectance spectrum
- Geometry
- Absorption

How well did you know this?

Not at all

Perfectly

What are some typical use cases for Image Feature Representation?

Image alignment
3D reconstruction
Motion tracking
Object/face recognition
Indexing and database retrieval

How well did you know this?

Not at all

Perfectly

What are image features?

A feature is a measurable property that describes the characteristics of an image or a region of images

How well did you know this?

Not at all

Perfectly

How are image features often represented?

Often represented by scalars, vectors, matrices or tensors.

How well did you know this?

Not at all

Perfectly

What factors make image matching hard to perform?

Study These Flashcards

Change in lighting
Change in viewpoint
Occlusions
Partial matching
Change over time

What are image regions & patches?

Study These Flashcards

Image regions & patches are segments or rectangular image patches that are used to collect a wider area of information from an image

What is a feature vector?

Study These Flashcards

A feature vector defines a set of descriptive features and concatenates them to produce a feature vector

What is the idea behind using feature vectors?

Study These Flashcards

The idea is to remove redundant or irrelevant data

What makes histograms a good representation of colour?

Study These Flashcards

Invariant to translation and rotation
Change slowly as viewing direction changes
Change slowly with object size
Change slowly with occlusion

What are texture features?

Study These Flashcards

Texture features measure the frequency with which patterns of colour/grey levels appear

What are gradient-based features?

Study These Flashcards

They are areas in an image which typically indicate boundaries of objects due to a spike in intensity i.e. intensity gradient.

How do you estimate gradients using spatial filtering?

Study These Flashcards

You take your source pixel, and the area surrounding it, and apply a convolution kernel on top of the area. You then multiply the value at each pixel corresponding with the value in the kernel, before summing all the results together to form the final source pixel value.

What is the formal definition of noise?

Small random bits of data added or taken away from the true value

How does a mean filter remove noise?

Move a kernel across the image and calculate a new pixel value based on the average of its surrounding neighbours.

How does a Gaussian filter work?

It works almost like a mean filter, except it adjusts the kernel to use a weighted average. The weighted average is stronger towards the centre of the area.

What is the formal way of finding edges?

Edges are found through the use of difference filtering in order to pick out the areas of high contrast.

How does edge detection work?

Edge detection works by looking for sharp changes in intensity

How does a Histogram of Oriented Gradients work?

- Divide the patch into smaller cells (8x8 pixels) - Define slightly larger blocks, covering several cells (2x2 cells) - Compute gradient magnitude and orientation at each pixel - Compute a local weighted histogram of gradient orientations for each cell, weighting by some function of magnitude - Concatenate histogram entries to form a HoG vector for each block - Normalise vector values by dividing some function of vector length

Why is invariance important?

Invariance dictates that similar results should be produced even if the conditions vary, such as scale, translation, rotation and illumination changes

How does Scale Invariance work?

- Find points whose surrounding patches (at some scale) are distinctive - Convolution with a Gaussian mask gives some idea of what is going on around a pixel - Gaussian masks have a natural scale: Their standard deviation

What are some key properties of SIFT?

- Fast and Efficient, can run in real time - Can handle: Changes in viewpoint, significant changes in illumination

What is Clipping in the sense of Brightness?

Clipping occurs when the pixels are too bright to be correctly recorded in the numeric range available.

What is the formal definition of Shutter Speed?

Shutter Speed defines how long the light is allowed onto the film/sensor for

How does Shutter Speed, Aperture and Gain relate?

If one goes up, then you can effectively maintain the same brightness level by decreasing the others. However, that does cause other adverse effects e.g. Depth of Field

What happens when you increase the Aperture?

A larger aperture means more light, but it also reduces the depth of field

What happens when you have a longer shutter speed?

A longer shutter speed means more motion blur

What makes a good scientific image for CV?

- Underexposed brightness, prevents clipping, although it introduces more noise - Centre the photo with a simple background - Record calibration target for colour balance - Optimise other settings for increased image clarity

What is the general consensus when collecting ranges of images for scientific analysis?

It is often cheaper and faster in the long run to spend a while making sure the images you capture are captured well and stored correctly.

Week 2 - Image Formation & Feature Representation Flashcards

(40 cards)