Week 5 - Higher-Level Computer Vision Flashcards

(16 cards)

1
Q

What are the 4 main Recognition problems?

A
  • Recognition: Identifying the main object in an image
  • Detection: Find the location of all objects
  • Segmentation: Assign all pixels to objects
  • Pose: Find the location of the object parts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the formal definition of the ‘Detection’ problem in Recognition problems?

A

Find the location of all objects in the scenes in terms of providing a bounding box

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Semantic Image Segmentation?

A

It’s the process of partitioning the image into ‘meaningful’ segments
You group pixels based on ‘common’ properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Instance Image Segmentation used for?

A

If you need to differentiate different instances of the same object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What three main features are required for a good Object Recognition Model?

A
  • Data: Images containing objects from that class and images from all other classes
  • Feature Extraction: Work with features extracted from images
  • Machine Learning: From the features extracted, initiate and train a model that recognises this particular object class
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do HOG features work?

A
  • Divide image into a grid of cells e.g. 8x8
  • Compute edges and their orientation for every pixel location
  • Compute histogram of gradient orientations in each cell
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Bag of Features?

A

Bag of Features methods analyse the large set of very specific features generated by a training set of images and identify a small set of useful, more generic features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does Object Recognition work with Bag of Features?

A
  • Take a bunch of images: Extract features, build up a ‘dictionary’ of common features
  • Then, given a new image, extract features:
    – For each feature, find the closest visual word in the dictionary
    – Build a histogram to represent the image
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does Viola-Jones Recognition work in practice?

A
  • Slide a window across the image and evaluate a face model at every location
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the key ideas that were pulled from Viola-Jones Recognition?

A
  • Integral images for fast feature evaluation
  • Boosting for feature selection
  • Attentional cascade for fast rejection of non-face windows
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does the Integral Image work?

A

The integral image computes a value at each pixel that is the sum of the pixel values above and to the left of the source pixel inclusive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is Feature Extraction performed using Integral Images?

A

Features are extracted from sub-windows of a sample window, and each of the four feature types are scaled and shifted across all possible combinations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the formal definition of Boosting?

A

Boosting is a classification scheme that works by combining weak learners into a more accurate ensemble classifier, where a weak learner is defined as a learner that does only slightly better than random chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does Boosting work?

A
  • Need a training set of labelled examples
  • Start with all examples equally weighted
  • Learn a series of recognition rules
  • Re-weight examples so incorrect recognition by nth classifier makes that example more important to the n+1th
  • No single rule/classifier can separate complex objects from complex backgrounds, but a combination can
  • Weights are determined automatically
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the Classical Approach in Computer Vision?

A

Apply learned operations to user-defined features:
- Design/choose features
- Design/choose a classifier
- Train the classifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does Bag of Words reduce reliance on the user?

A

It clusters the results of applying the user-defined set of feature-detection operators to form a more generic visual vocabulary.