Jon's Topics Flashcards
(41 cards)
Robust, Repeatable, Invariance and Constraints
Robust and Repeatable computer vision is achieved through engineered Invariance and applied Constraints. (We wants Robust and Repeatable, we achieve using Invariance and Constraints)
Feature Space
Abstract space defined by a feature extraction process. Transforming raw data into feature of some fixed number of elements (dimensionality of the space).
Distance Measures
Euclidean (L2) Distance: 2rt[SUMi=1->n (q_i-p_i)^2]
L1 Distance: SUMi=1->n ( | p_i-q_i | ). Where p,q = points
Supervised ML: Classification
Binary - Linear: Learn a hyperplane that separates two classes with a minimum error. Non-Linear: Can learn curved lines to fit data, lose generality by overfitting.
Multiclass - KNN
Unsupervised ML: Clustering
Aims to group data without any prior knowledge. K-Means: Guaranteed to find optimal results, doesn’t perform consistently. Trial and error to find best k value.
Image Features (4 types)
Global, Grid-based, Region-based, Local
Global Features
Extracted contents from entire image, outputs single Feature Vector. Image Histogram is common. Joint Colour Histogram usually considers all pixel colour components together.
Segmentation
Used for segmenting image into regions for Region based features. Using: Thresholding - Basic, Otsu, and Local or Adaptive Thresholding. K-Means
Otsu Thresholding
Otsu: Assumes image contains two classes of pixels (fore/background). Calculates optimum threshold to separate two classes with minimal intra-class variance.
Local or Adaptive Thresholding
Computes a different threshold for each pixel, based on values of neighbouring pixels. E.g. Mean adaptive thresh sets pixel to BG if it’s value is less than mean of it’s neighbours plus an offset.
Segmentation using K-Means
Simple method: Cluster colour vector of all pixels, assign each pixel based on closest cluster centroid. Could spatially encode FV to help base segments on relative position of pixels, to avoid loner different region pixels.
Connected Components
Type of segment in which all pixels are reachable to each other through a path of spatially adjacent pixels. Commonly 4-connectivity or 8-connectivity
Connected Component Labelling
Takes binary image and produces a set of connected components. Uses 2 pass algorithm
Simple Scalar Features
Area: #pixels in component Perimeter: Length around component Inner Border: Inner pixels forming edge Outer Border: Outer pixels of edge Compactness: Measure how tightly packed pixels are Dispersion: How spread out shape is
Moments
Describes Distribution of pixels in a shape. Standard 2D Cartesian Moments can be used as shape descriptors. Different shapes have different moments. Not invariant to scaling, translation, and rotation.
Central Moments + Normalised Central Moments + Hu Moments
Central: Translation invariant as moments computed about centroid of component.
Normalised Central: Translation and Scale Invariant.
Hu: 7 scale, rotation and invariant moments.
Chain Codes
Simple way of encoding boundary of an object. Walk around boundary and encode direction on each step as a number. Cyclically shift code so it forms smallest integer value. Invariant to start point.
Chain Codes Properties
Make rotation invariant by encoding differences in direction. Make scale invariant by resampling component to fixed size. Perimeter for 8-con chain code = #(Even numbers in CC) + Rt2 * #(Odd numbers in CC). Not good for shape matching due to noise, resampling and no good distance metric.
Fourier Descriptor
Encode shape info by decomposing boundary into small set of frequency components. 1: Define representation of the curve (boundary). 2: Expand representation using Fourier Theory.
Region Adjacency Graph
Build graph from set of connected components. Each node = component. Nodes connected if they share a border. Invariant to distortion, not to occlusion. Good for creating markers like QR Codes.
Point Distribution Models
Learning a low dimensional parametric model of how the shape of an object can vary. Shape represented by number of fixed 2D points at distinguishable locations on the object. Needs training set of images with same points marked.
Procrustes Analysis
Choose a reference shape, superimpose (align) all instances to current reference shape. Compute mean shape of the current set of images. If Euclidean distance between mean shape and reference shape is above a threshold, set reference shape to mean shape and repeat.
Local Interest Point
Good Local Interest Point is invariant to global & local lighting change, invariant to difference in camera’s location. Detect using corner or blob detection.
Corner Detector: Harris + Stephens
Search for corners through a small window. Corner found if shift in window by a small amount results in large change in intensity.