CVI FINAL EXAM DAY Flashcards
(57 cards)
What is the key difference between RoI Pooling and RoI Align
RoI Pooling rounds coordinates causing misalignment while RoI Align uses interpolation to preserve spatial accuracy
Why is SURF considered faster than SIFT
SURF uses approximations like box filters and integral images while SIFT uses precise gradients making it slower
What kind of transformation does a Region Proposal Network RPN learn
It learns to transform anchor boxes into tighter object proposals by predicting shifts and scales
Why is C3D not used for optical flow estimation
Because C3D performs video classification not pixelwise motion estimation so it does not output motion vectors
How does DeepLab use atrous convolution and ASPP for segmentation
It uses atrous convolution to enlarge receptive field and ASPP to capture multiscale context features
Fast RCNN
Deep learning for object detection
Faster RCNN
Deep learning for object detection with learned region proposals
RCNN
Deep learning for object detection
OmniMotion
Advanced motion estimation using video tracking across occlusions
Lucas Kanade
Traditional optical flow estimation
Horn Schunck
Traditional optical flow estimation
FlowNet
Deep learning for optical flow estimation
VoxelMorph
Deep learning for image registration
What does Otsus method actually optimise during threshold selection
It maximises inter class variance between foreground and background pixel distributions
Name one deep learning model commonly used for image registration
VoxelMorph
Prewitt
Traditional edge detection using gradient approximation
AlexNet
Deep learning for image classification
What is the difference between self supervised and unsupervised learning
Self supervised uses pseudo labels from data while unsupervised finds structure without labels
Robinson
Traditional edge detection using compass kernels
C3D
Deep learning for video classification and action recognition
How do skip connections in U Net help during segmentation
They pass high resolution features from encoder to decoder to preserve localisation and detail
HED
Deep learning for edge detection using holistically nested networks
DoG
Traditional edge detection using Difference of Gaussians
How does contrastive loss function mathematically encourage representation learning
It pulls similar pairs closer and pushes dissimilar pairs apart in embedding space using similarity scores