Week 3 - Segmentation & Registration Flashcards

(36 cards)

1
Q

Why is segmentation conducted?

A

Segmentation is used to assist completion of higher-level tasks, such as recognition, tracking, image database retrieval, feature quantification and registration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the main challenge with segmentation?

A

Divide the image into regions/segments, where each region presents a distinguished item. Each region should have similar importance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does Otsu’s Thresholding work?

A

Exhaustively search for the threshold that minimises the intra-class variance and maximises the inter-class variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the primary disadvantage of using Otsu’s Thresholding?

A

It’s not robust enough on its own for most real-world applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What key assumption does the Otsu thresholding algorithm make about the data?

A

It assumes the data forms a bi-modal histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does Region Growing work for segmentation?

A
  • Start with one or more seed points
  • Iteratively check its neighbour points: If intensity difference between the neighbour point and the region is smaller than a threshold, assign the neighbour point to the segmented region.
  • Stop when there is no new assignment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the advantages of Region Growing for segmentation?

A
  • Enables multiple class segmentation
  • Parameter is easy to adjust
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the disadvantages of Region Growing for Segmentation?

A
  • Local region solution
  • Computationally expensive
  • Leakage along weak boundaries
  • Sensitive to seed points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Snake in regards to Segmentation?

A

It is a spline, which is a series of control points, with some function/s that govern the curve between the points. A snake is only interested in the location of the control points, but not the connection between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does Active Contour, using Snake, work for segmentation?

A
  • Represent the object boundary as a parametric curve
  • A cost function is associated with the curve, so the boundary is found by optimising the cost function
  • The cost function is defined as the sum of the three terms
  • Iteratively update the contour points using something such as gradient descent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mathematical equation that defines a snake in segmentation?

A

Esnake = (alpha * Einternal) + (beta * Eimage) + (gamma * constraint)
Where:
- Einternal = Contour smoothness, point spacing, etc…
- EImage = Image features e.g. lines, edges
- Econstraint = External constraint such as user interaction keypoints

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the Snake term, called Einternal, defined?

A

Einteral = Econt + Ecurve
Where:
- Econt = Continuity i.e. control point distribution along the curve
- Ecurve = Curvature i.e. promote round curves where possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How is the ECont term defined for Snake in segmentation?

A

Econt = (Davg - ||Pi - P(i-1)||)^2
Where:
Davg is the average distance between adjacent points along the line
Pi is a point on the line, and P(i-1) is the point before that.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the Ecurve term defined for Snake in segmentation?

A

Ecurve = (||P(i-1) - 2P(i) + P(i+1)||)^2
Where:
P stands for a point on the curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What happens if you remove the Davg term from the continuity measure for Snakes in Segmentation?

A

If you remove it, then the snake will become a closed loop that effectively wraps around the target object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is a closed loop snake used for segmentation?

A

Closed snakes separate the areas on either side of the Snake. Inside of the shape the Snake creates is the foreground, and outside of the shape is the background.

17
Q

What are some drawbacks of Active Contour?

A
  • Node Distribution
  • Sharp Corners
  • Topology Changes
18
Q

How is Explicit Geometry defined?

A

Explicit Geometry - Parameterised boundaries

19
Q

How is Implicit Geometry defined?

A

Implicit Geometry - Boundaries given by zero level set

20
Q

What are some key properties of the Chan-Vese model?

A
  • No parameterisation required
  • Less sensitive to the contour initialisation and noise
  • Computationally efficient
  • Topological changes can be handled implicitly
  • Based on regional statistics rather than boundary information
21
Q

How does Graph Construction/Cut work in segmentation?

A
  • Consider the image as a graph, where it has edges, vertices, and costs between assigned values.
  • Find the optimal cut which produces the minimum cost
22
Q

What two terms make up an energy function in a graph cut problem in segmentation?

A

Data term (Unary) - It is a function derived from the observed data that measures the cost of assigning label to pixel p
Smoothness Term (Pairwise) - Measures the cost of assigning the labels to adjacent pixels p and q. It’s used to impose spatial smoothness.

23
Q

What does the smoothness term measure in graph cut problems in segmentation?

A
  • Check all pairs of neighbour pixels
  • Penalise adjacent pixels with different labels
  • Function penalises a lot for discontinuities between pixels of similar intensities
  • However, if pixels are very different, then the penalty is small
24
Q

What are the key steps of feature-based method using clustering?

A
  • Represent the characteristics of the local region for each pixel e.g. intensity, filtering, SIFT, HOG, etc…
  • Define similarity function e.g. Euclidean, Cosine, Manhattan, etc…
  • Region partition is handled as pixel classification problem using clustering analysis, such as K-Means.
25
What is the general algorithm for K-means Supervised Learning?
- Randomly select K points as initial centroids - Repeat: -- Assigning each point to its closest centroid -- Re-compute the centroid of each cluster - Until centroids/sets do not change
26
What are some advantages of using K-Means algorithm?
- Simple and efficient - Solution dependant on the initialisation
27
What are some disadvantages of using K-Means algorithm?
- Need to specify the number of clusters - Sensitive to outliers
28
What are the key steps when using Feature-based supervised learning methods?
- Represent the characteristics of the local region for each pixel e.g. intensity, filtering, HOG, etc... - Region partition is handled as pixel classification problem using supervised machine learning methods such as SVMs, Random Forest, etc... - Requires a training process with ground truth labels
29
What is a key drawback of using Feature-based Supervised Learning Methods?
It is application dependent.
30
Why is image registration useful?
- Information fusion - Information comparison - Transformation estimation - Statistical modelling and analysis based on large sets of aligned images
31
What are some sample applications of Image Registration?
- Medical e.g. pre and post treatment comparison - Remote sensing e.g. road map, satellite map - Augmented Reality e.g. aligning 3D virtual model to 2D images
32
What is the main aim for Image Registration?
To transform a source image to match with a target image
33
What are some key elements used for Image Registration?
- Geometric Transformations e.g. rigid, affine or deformable - Similarity Measurement e.g. point correspondence, intensity based - Parameter optimisation e.g. closed form solution, gradient descent
34
What is the general form for Geometric Transformation, using either Euclidean or Affine?
2D Transformations: Euclidean: 2 translations, 1 rotation Affine: 2 translations, 1 rotation, 2 scale 3D Transformations: Euclidean: 3 translations, 3 rotations Affine: 3 translations, 3 rotations, 3 scale Therefore: First values for each transformation type are always equal i.e. 2D = 2 translations, 1 rotation, and 3D = 3 translations, 3 rotations
35
What makes image registration problems easier in regards to point-based methods?
If correspondence points can be determined either manually or automatically, then the image registration problem becomes easier
36
How does the Iterative Closest Point Algorithm work?
- For each point in the source point cloud, match the closest point in the target point cloud e.g. Euclidean distance - Estimate the transformation, and use a point to point distance metric e.g. root mean square minimisation technique, which will best align each source point to its match found in the previous step - Transform the source points using the obtained transformation - Iterate the previous three steps until transformation parameters remain unchanged