CNN Architectures and Applications Flashcards
What is the primary problem with Fully Connected Networks (FCNs) regarding image processing?
Images are flattened into 1D vectors, losing spatial structure
This loss of spatial structure makes it difficult for FCNs to effectively understand visual data.
How do Convolutional Neural Networks (CNNs) address the issues of FCNs?
Preserve spatial locality and handle images more intelligently
CNNs maintain the spatial relationships in images, improving accuracy.
What is a Convolutional Neural Network (CNN)?
A deep learning model that works especially well with visual data like images
CNNs are specifically designed to process pixel data.
What is the key idea behind how CNNs function?
They learn patterns in an image using filters
Filters help in detecting features such as edges and textures.
What is the convolution operation in CNNs?
A small matrix called a kernel slides over the image, multiplying and summing to create a feature map
This operation extracts features from the input image.
What does the term ‘stride’ refer to in the context of CNNs?
How far the filter moves each step
Stride affects the size of the output feature map.
What are the two types of padding in CNNs?
‘valid’ (no padding) and ‘same’ (padding with zeros)
Padding is used to control the spatial dimensions of the output.
What is the function of the convolutional layer in a CNN architecture?
Extracts feature maps
Convolutional layers are crucial for identifying patterns in images.
What does the pooling layer do in a CNN?
Shrinks size while keeping important information
Pooling helps reduce the computational load and overfitting.
What is the purpose of the activation layer in a CNN?
Adds non-linearity (e.g., ReLU)
Activation functions enable CNNs to learn complex patterns.
What is the role of the fully connected layer in CNNs?
Final decision-making (e.g., classification)
Fully connected layers integrate features learned by previous layers.
How do CNNs learn?
By adjusting filters to minimize prediction error
Learning occurs through a process called backpropagation.
What is the basic process of CNN learning?
Input image goes through convolution + pooling, output passed to fully connected layers, loss function calculates error, backpropagation computes gradients, optimizer updates weights
This process iteratively improves the model’s predictions.
What is an example of the input and output of a convolutional layer?
Input: 32×32 image, Output: 30×30×16
The output dimensions depend on the kernel size and stride.
What are the total learnable parameters in a CNN example provided?
255,632
The number of parameters affects the model’s capacity to learn.
What is max pooling in CNNs?
Picks the biggest number in a region
Max pooling helps retain the most significant features.
What is average pooling in CNNs?
Takes the average of numbers in a region
Average pooling provides a smoother representation of features.
What are the benefits of pooling layers in CNNs?
- Reduces size
- Controls overfitting
- Speeds up training
Pooling layers are essential for efficient training and model performance.
What happens in the fully connected layer of a CNN?
All features combine to make a decision
This layer synthesizes information from all previous layers.
What do convolutional layers learn using?
Filters
Filters are crucial for feature extraction.
What is the purpose of pooling in CNNs?
Downsample while keeping key info
Pooling reduces dimensionality, making the model more efficient.
What does activation introduce in CNNs?
Non-linear twist (usually ReLU)
Non-linear activations allow CNNs to model complex functions.
What is the role of fully connected layers in CNNs?
Combine all for final output
Fully connected layers determine the final classification or regression output.
What is the function of filters (kernels) in convolutional layers?
Scan over input image and create feature maps
Filters are essential for identifying patterns in the data.