SIFT Flashcards
(51 cards)
What is a video in visual computing?
A sequence of still images (frames) shown over time to create the illusion of motion.
What challenges does object recognition in video face?
Motion, lighting changes, scale and rotation, clutter, and background changes.
What is an interest point in an image?
A pixel-level structure that is repeatable and distinctive, such as a corner or blob.
What are desirable properties of an interest point?
Repeatable, distinctive, stable under transformations, subpixel accurate, and well-represented by a descriptor.
What is a blob in image analysis?
A region with distinct intensity or texture that can be localized and measured at a certain scale.
What is a scale space in image processing?
A set of images smoothed at increasing levels of Gaussian blur, used to detect features at different scales.
What is the formula for scale space?
S(x, y, σ) = G(x, y, σ) * I(x, y), where G is a Gaussian filter and I is the image.
How are blobs detected in scale space?
By finding local maxima or minima across both spatial and scale dimensions.
What is the Difference of Gaussians (DoG)?
An approximation to the Laplacian of Gaussian, computed as the difference between two Gaussian-blurred images at different scales.
Why is DoG used in SIFT?
It is computationally efficient and effective at detecting blobs at multiple scales.
What is SIFT?
Scale-Invariant Feature Transform — a method to detect and describe distinctive features in an image.
What are the main steps of SIFT?
Blob detection, keypoint localization, orientation assignment, descriptor creation, and normalization.