CVI FINAL EXAM DAY Flashcards

(57 cards)

1
Q

What is the key difference between RoI Pooling and RoI Align

A

RoI Pooling rounds coordinates causing misalignment while RoI Align uses interpolation to preserve spatial accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is SURF considered faster than SIFT

A

SURF uses approximations like box filters and integral images while SIFT uses precise gradients making it slower

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What kind of transformation does a Region Proposal Network RPN learn

A

It learns to transform anchor boxes into tighter object proposals by predicting shifts and scales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is C3D not used for optical flow estimation

A

Because C3D performs video classification not pixelwise motion estimation so it does not output motion vectors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does DeepLab use atrous convolution and ASPP for segmentation

A

It uses atrous convolution to enlarge receptive field and ASPP to capture multiscale context features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fast RCNN

A

Deep learning for object detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Faster RCNN

A

Deep learning for object detection with learned region proposals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

RCNN

A

Deep learning for object detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

OmniMotion

A

Advanced motion estimation using video tracking across occlusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Lucas Kanade

A

Traditional optical flow estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Horn Schunck

A

Traditional optical flow estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

FlowNet

A

Deep learning for optical flow estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

VoxelMorph

A

Deep learning for image registration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does Otsus method actually optimise during threshold selection

A

It maximises inter class variance between foreground and background pixel distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Name one deep learning model commonly used for image registration

A

VoxelMorph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Prewitt

A

Traditional edge detection using gradient approximation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

AlexNet

A

Deep learning for image classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the difference between self supervised and unsupervised learning

A

Self supervised uses pseudo labels from data while unsupervised finds structure without labels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Robinson

A

Traditional edge detection using compass kernels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

C3D

A

Deep learning for video classification and action recognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do skip connections in U Net help during segmentation

A

They pass high resolution features from encoder to decoder to preserve localisation and detail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

HED

A

Deep learning for edge detection using holistically nested networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

DoG

A

Traditional edge detection using Difference of Gaussians

24
Q

How does contrastive loss function mathematically encourage representation learning

A

It pulls similar pairs closer and pushes dissimilar pairs apart in embedding space using similarity scores

25
How are eigenfaces constructed using PCA
By computing eigenvectors from face dataset and representing new faces as weighted sums of these eigenfaces
26
When would you use mutual information instead of SSD for image registration
When the two images come from different modalities or have different intensity scales
27
What is the visual output difference between semantic and panoptic segmentation
Semantic labels each pixel with a class while panoptic also distinguishes between different object instances
28
UNet
Deep learning for semantic segmentation
29
Mask RCNN
Deep learning for instance segmentation
30
FlowNet2
Deep learning for optical flow estimation
31
YOLO
Deep learning for real time object detection
32
Sobel
Traditional edge detection using gradient approximation
33
Kirsch
Traditional edge detection using compass kernels
34
Canny
Traditional edge detection with gradient smoothing and hysteresis
35
LoG
Traditional edge detection using Laplacian of Gaussian
36
Rich Feature Hierarchies
Deep learning based edge detector built on top of CNNs
37
TimeSformer
Transformer based model for video understanding
38
DyeNet
Deep learning for video object segmentation using appearance and motion cues
39
Two Stream CNN
Deep learning for action recognition using RGB and optical flow as parallel inputs
40
How to calculate weights in CNN?
Kernel x kernel x colour channel number x filters
41
How to calculate bias in CNN?
Number of filters
42
What are two morphological operators and outline their purpose
Dilation – Expands the boundaries of foreground (white) regions. 🔹 Used to fill in small holes or connect nearby segmented regions. Erosion – Shrinks foreground regions by eroding boundaries. 🔹 Used to remove small noise or separate objects that are touching.
43
What region proposal method does RCNN use
Selective Search
44
How does RCNN extract features
It applies a CNN separately to each proposed region
45
What is the output of RCNN
Class label and bounding box for each region
46
What region proposal method does Fast RCNN use
Selective Search
47
How does Fast RCNN extract features
It applies a CNN to the entire image once and uses RoI Pooling on the feature map
48
What is the output of Fast RCNN
Class label and bounding box for each region
49
What region proposal method does Faster RCNN use
Region Proposal Network RPN
50
How does Faster RCNN extract features
It uses a CNN on the entire image and applies RoI Pooling to shared feature maps
51
What is the output of Faster RCNN
Class label and bounding box for each region
52
What region proposal method does Mask RCNN use
Region Proposal Network RPN
53
How does Mask RCNN extract features
It uses a CNN on the entire image and applies RoI Align to shared feature maps
54
What is the output of Mask RCNN
Class label bounding box and segmentation mask
55
What is a feature map
The output of a convolutional layer in a CNN: It’s a 3D tensor: [Height × Width × Channels]. Each channel detects a specific feature (edge, curve, texture) Shows how much of X pattern is in each area
56
What is spatial resolution
The size (in pixels) of the output feature map at any layer (decreases as you go deeper in a CNN) i.e. how big is the output grid
57
What is a feature hierarchy
The idea that lower layers detect simple features (edges, corners), while higher layers detect complex features (faces, wheels, textures) Makes CNNs good at generalising across task.