Low Level Vision Flashcards

Question

What size is the camera extrinsic matrix?

Answer 1

4 x 4 if identity matrix: [ 1, 0, 0, 0] [ 0, 1, 0, 0] [ 0, 0,1, 0] [ 0, 0, 0, 0]

Answer 2

- We assume Zc and Zw are equal, so where we see Zc we can substitute in Zw - This is because for most cases, Zc and Zw are very close to each other.

Answer 3

The number of columns of the first matrix must equal the number of rows of the second matrix

Answer 4

The light intensity at that point

Answer 5

0 represent black and 255 represent white

Answer 6

0 represents black and 1 represents white

Answer 7

Red Green Blue is a colour image representation, which consists of 3 channels.

Answer 8

- You take the weighted average over the three colour channels - I(grey) = (r*Ir + g*Ig + b*Ib) / r + g + b

Answer 9

- Choose a threshold value - If the pixel value is above the threshold, we assign it 1 or white - If the pixel value is below the threshold we assign it 0 or black

Answer 10

A method of downsampling - decreases resolution by removing pixels depending on a pattern - example: only keep every other row of pixels in an image

Answer 11

Reduces resolution/number of pixels by finding the maximum value for a region and using that to represent the region.

Answer 12

That we will lose some detail

Answer 13

Used to increase resolution - a simple method is approximation by the nearest available pixel or neighbour

Answer 14

An up-sampling method involving copying the adjacent pixel value from the same colour channel It's fast but inaccurate

Answer 15

- an up-sampling method involving taking the average value of the nearest two or 4 pixels from the same colour channel - It's fast and accurate in smooth regions - inaccurate at edges

Answer 16

- Translation - Scaling - Rotation

Answer 17

Translation is a type of image manipulation that moves a point to another location by adding amounts (usually as vector) to the coordinates of the point - New point is: (3 + 6, 4 + 2) = P'(9, 6)

Answer 18

- Scaling is a type of image manipulation that moves points to make things smaller or bigger. If the scale is larger than 1, the object gets bigger, if the scale is smaller than 1, the object gets smaller P(3, 4) * 0.5 = P'(1.5, 2) P(3, 4) * 2 = P'(6, 8)

Answer 19

Rotation is a type of image manipulation that moves a point when the image is rotated: Calculated by using these equations, where a is the angle by which the point(x,y) is rotated: x' = x*cos(a) - y*sin(a) y' = x*sin(a) - y*cos(a)

Answer 20

They are both filters that are used to transpose an image. They are represented by formulas. - The cross correlation filter transposes a pixel to the same location in an image - The convolution filter transposes a pixel to a position that is rotated 180 degrees in the image.

Answer 21

- Cross correlation is calculated by first adding padding: then we multiply the corresponding values and add them for a pixel. - Convolution is calculated by rotating the filter (NOT THE IMAGE) by 180 degrees first, then multiply and add for all 9 pixels in a grid

Answer 22

That the formula we use is actually for cross-correlation, we just call it the convolution filter

Answer 23

It starts at 0, so the top left corner is (0,0)

Answer 24

- Padding is adding values to the outside of the image - That when we pad an image, the boarder of padding consists of the value 0.

Answer 25

It reduces the size of an image: 7-3+1 = 5 so output is 5x5

Answer 26

Using a convolution filter where the values are less than 1, this has the affect of blurring the image

Answer 27

- depth discontinuity - surface colour discontinuity - surface normal discontinuity - Illumination discontinuity

Answer 28

dy/dx =2x + 4x^3 dy/dx = cosx -e^-x

Answer 29

Each pixel value is a discrete integer and we calculate the discrete derivative

Answer 30

- emphasizes areas where the intensity changes abruptly (happens at edges) - identifies points where the change is maximal

Answer 31

- The backward difference filter finds the derivative at a pixel point considering the difference between the pixel point and the one before it - the forward difference filter finds the derivative at a pixel point considering the difference between the pixel point and the one ahead of it - the central difference filter finds the derivative of a pixel point by considering the difference between the pixel ahead of it and the pixel before it

Answer 32

- Forward filters are useful to use at the start of an array since the first pixel in an array doesn't have anything before it - Backward filters are useful to use at the end of an array since the last pixel in an array doesn't have anything following it - a central filter is useful to use for any pixel that's not at the start or end of an array as it has a pixel point before and after it

Answer 33

Backward: df/dx = f(x) - f(x-1) Forward: df/dx = f(x) - f(x+1) Central: df/dx = f(x+1) - f(x-1)

Answer 34

Backward: [-1 1 0] Forward: [0 1 -1] Central: [-1 0 1]

Answer 35

We do normal convolution (multiply then add) Backward filter = [-1 1 0] - No padding so 10 becomes 0: - (-1*10) + (1*15) = 5 - (-1*15) + (1*10) = -5 [0, 5, -5]

Answer 36

find derivates: gradient vector (df/dx, df/dy) and use to calculate the gradient magnitude and the gradient direction. - apply difference filter, horizontally for x axis and vertically for y axis for pixel point. - calculate gradient direction - calculate gradient magnitude

Answer 37

- Applying a filter in the vertical direction gives us the 2D derivative for the y-direction - Applying a filter in the horizontal direction gives us the 2D derivative for the x-direction

Answer 38

- Light intensity changes are reflected in the gradient - an edge point corresponds to the extreme of a derivative

Answer 39

angle = tan^-1(df/dy / df/dx)

Answer 40

It's given by the gradient magnitude - ||edge strength|| = sqrt( (df/dx)^2 + (df/dy)^2 )

Answer 41

Backward filter = [-1 1 0] apply for x-axis: (-1*3) + (1*5) + (0*3) = 2 apply for y-axis: (-1*3) + (1*5) + (0*3) = 2 direction = tan^-1(2/2) = 45* magnitude = sqrt(2^2 + 2^2) = 2*root(2)

Answer 42

- For each different value we count how many of that value occurs then create a histogram with the value

Answer 43

By dividing each amount of the different values by the total number of pixels: so if it's a 5x5 image, we divide each amount by 25

Answer 44

A low-pass filter smooths images and removes noise by reducing high frequency information and retaining low frequency information

Low Level Vision Flashcards

(68 cards)