Feature detection/ matching Flashcards by Patrick Henriksen

Describe the steps in the RANSAC algorithm

Calculate model parameters from n random data points, where n is the minimum required points
Evaluate inlier percentage for this model with a distance threshold t
If inlier percentage is the current max, save this model
Repeat N times
Use the inliers from the best model and create a better model using least squares or similar methods

How well did you know this?

Not at all

Perfectly

Describe RANSAC in one sentence

RANSAC is a robust (handles outliers) method for estimating the parameters of a mathematical from observed data

How well did you know this?

Not at all

Perfectly

What should the RANSAC threshold be if the noise in the data has a normal distribution

Around 2 sigma

How well did you know this?

Not at all

Perfectly

The formula used for deciding N(Number of runs) in the RANSAC algorithm is log(1-p)/log(1-w^n). p is the probability of sampling at least one set without outliers, n is the number of data points used to estimate parameters and w is the er probability of a data point being an inlier. How is p and w usually decided

p is usually set to 0.99

w is approximated for each run.

How well did you know this?

Not at all

Perfectly

What is the full name of RANSAC

RANdom SAmple Consensus

How well did you know this?

Not at all

Perfectly

How is the canonical orientation of a feature point normally decided

The local patch is rotated so that the direction of the maximum gradient is pointing upwards.

How well did you know this?

Not at all

Perfectly

Describe the steps from feature detection to feature matching

Detect feature points in the image
Define a local patch around the feature
Extract and normalize the local patch
Create a local descriptor
Match features by calculating the distance between the descriptors.

How well did you know this?

Not at all

Perfectly

What is SIFT short for

Scale Invariant Feature Transform

How well did you know this?

Not at all

Perfectly

What type of descriptor do the SIFT and SURF algorithms use

HoG (Histogram of Gradients).

The other type of descriptors are binary descriptors.

How well did you know this?

Not at all

Perfectly

Describe the steps in the SIFT algorithm

Use LoG pyramids to determine the canonical scale and HoG to determine the canonical orientation.
Normalize the patch to 16x16 pixels, compute the gradients and apply a Gaussian weighting to the gradients.
Divide the 16x16 patch into 4x4, compute 8 gradient directions in each square and concatenate these gradients into a 128 feature vector.
Normalize to unit length, threshold to 0.2 and renormalize.

How well did you know this?

Not at all

Perfectly

What is the main advantage of binary descriptors, and how is this achieved

The main advantage is computational speed. The descriptors are binary strings and the hamming distance (XOR) can be used for matching. The XOR function is computationally very efficient.

How well did you know this?

Not at all

Perfectly

Name some binary descriptor algorithms

BRIEF, ORB, BRISK, FREAK

How well did you know this?

Not at all

Perfectly

Name some distance functions used for feature matching

L1, L2, Hamming(XOR)

How well did you know this?

Not at all

Perfectly

Explain the ROC curve

The ROC curve measures true positive rate( true positives/matched features) vs. false positive rate( false postivies/ unmatched features)

How well did you know this?

Not at all

Perfectly

Explain the ratio test

Keep the two best matches. if distance of the first divided by distance of the second is larger than some threshold (0.8) throw away the match.

How well did you know this?

Not at all

Perfectly

What is the minimum number of corresponding data points in two images required to calculate a homography

Study These Flashcards

4, where no 3 points are collinear

Describe the DLT (direct linear transform) for finding homographies.

Study These Flashcards

Transform the problem to a matrix equation on the form Ah = 0.
Normalize
Obtain the SVD of A.
If S is diagonal with positive values in descending order, h is the last column of V
Denormalize and reconstruct the homography H from h.

Why are the parameters often normalized in the DLT.

Study These Flashcards

The DLT performs best when all parameters are of similar scale.

What different kind of errors can be used when using RANSAC to determine Homographies

Study These Flashcards

Algebraic error:
e_i = ||A_i*h||

Geometric errors:
e_i = d(Hu_i, u_i2) + d(u_i, (H^-1)u_i2) (Reprojection error)
e_i = d(Hu_i, u_i2)
e_i = d(u_i, (H^-1)u_i2)

How is the SVD used to solve matrix equations on the form: Ah=0

Study These Flashcards

The nullspace of A is a linear combination of the singular vectors with singular value equal to 0 or no singular value.

How can we detect lines in images

Study These Flashcards

Using the Hough transform

What are the important characteristics of good feature points

Study These Flashcards

Distinct
Local
Efficient
Reprodusable

What is the interpretation of the eigenvalues and vectors of the M matrix in corner detection?

Study These Flashcards

The eigenvalues describe the max/min change and the vectors describe the direction of the max/ min change

What is an important property for edge features

Study These Flashcards

Small movements in any direction should equal large changes in the feature point. This is equal to the smallest eigenvalue of M being large.

What operator is often used to score corner features

The Harris operator (det(M) - alpha*trace(M))

Is the Harris detector invariant to affine changes in pixel intensity?

Only partially (Yes to additive changes, no to scaling)

Why is the Harris detector invariant to additive changes in pixel intensity

Only the derivatives are used.

How can we solve the problem that the Harris detector isn't invariant to image scaling?

We can compute the Harris score for several scales, and choose the largest. This is the canonical scale.

When will the Laplace operator have the maximum response for a binary circle?

When the zeros of the Laplace align perfectly with the circle edge.

Why is the Laplace pyramid used in blob detection?

To detect blobs at different scales.

Name some methods used for edge detection

Canny, Laplace, Sobel

What is the idea behind binary feature detectors

A local neighborhood is divided into points. These points are connected to each other and score of 1 is assigned if a point has a larger value than the previous point, else 0.

What is the M matrix in edge feature detection?

The M matrix describes changes if the local patch is slightly moved.

When are two images related by a homography?

If the images are captured from a planar scene

What is a canonical affine transformation?

A local patch normalization using affine transforms. This allows rotated planes etc. to be matched.

Describe the cross check test (Alternative to the ratio test)

Measures the projection both ways. Matches are accepted only if fa is the best match for fb AND fb is the best match for fa.

What is: Invariance and Covariance?

– Invariance: image is transformed and corner locations do not change – Covariance: if we have two transformed versions of the same image, features should be detected in corresponding locations

Feature detection/ matching Flashcards

(37 cards)