Autoencoders and Computer Vision Flashcards
What is Dimensionality Reduction?
Shrinking your data without losing its meaning.
What is the Curse of Dimensionality?
More features → data becomes sparse.
What is an Autoencoder?
A special kind of neural network that learns to compress then rebuild data.
What is the key idea behind an Autoencoder?
Learn a smart encoding of the input, then use that encoding to reconstruct the original.
What are the components of an Autoencoder?
- Encoder: Compresses input into smaller vector
- Latent Space (Code): The compressed form
- Decoder: Reconstructs the original from the code
- Loss: Measures how close output is to original (e.g., MSE)
What is the goal of an Autoencoder?
Minimise the difference between input and output.
Fill in the blank: The __________ is the compressed form in an Autoencoder.
Latent Space (Code)
What is the architecture of an Autoencoder?
Input size = 784, hidden size = 128, code size = 32.
What is a Denoising Autoencoder?
Trains the autoencoder to remove noise from input images.
What does the input and target look like for a Denoising Autoencoder?
- Input = Noisy image
- Target = Clean image
What is a Convolutional Autoencoder (CAE)?
Autoencoders for image data using Conv2D layers.
What is the role of the Encoder in a Convolutional Autoencoder?
It uses Conv2D layers and MaxPooling2D to compress the input.
What is the role of the Decoder in a Convolutional Autoencoder?
It reconstructs the image using Conv2D layers and upsampling methods.
True or False: Learnable upsampling in a Decoder leads to better performance than fixed upsampling.
True.
What are the applications of Autoencoders?
- Denoising
- Compression
- Image Colourisation
- Anomaly Detection
- Feature Extraction
Fill in the blank: In anomaly detection, a large __________ error indicates a likely anomaly.
reconstruction
What was the result of using a Convolutional Autoencoder on medical ultrasound images?
Successfully removed added annotations & noise.
What is the goal of a Computer Vision Pipeline?
Get machines to ‘see’ and understand images/videos.
What are the three levels of tasks in Computer Vision?
- Low-level
- Mid-level
- High-level
What are examples of low-level tasks in Computer Vision?
- Edge detection
- Texture analysis
- Color analysis
What are examples of mid-level tasks in Computer Vision?
- Segmentation
- Object tracking
What are examples of high-level tasks in Computer Vision?
- Object recognition
- Scene understanding
What is Image Segmentation?
Segmenting = Splitting an image into meaningful parts.
What are the types of Image Segmentation?
- Unsupervised Segmentation
- Supervised Segmentation
- Semantic Segmentation
- Instance Segmentation