CNN Architectures and Applications Flashcards

Question

What is stride in the context of convolutional layers?

Answer 1

How far the filter moves ## Footnote Stride impacts the output size of the feature map.

Answer 2

No padding, smaller output ## Footnote Valid padding reduces the spatial dimensions of the feature map.

Answer 3

Keeps input/output sizes the same (zero-padding edges) ## Footnote Same padding allows for consistent output sizes.

Answer 4

Convolution and Activation (e.g., ReLU) ## Footnote This structure is fundamental for feature extraction.

Answer 5

MaxPooling2D ## Footnote Max pooling is widely used due to its effectiveness in feature retention.

Answer 6

Pool size: (2,2), stride: (2,2) ## Footnote This configuration is standard for reducing dimensions.

Answer 7

Flatten them and feed into final classifier ## Footnote Flattening transforms the multi-dimensional output into a single vector.

Answer 8

An early CNN architecture for digit recognition (MNIST) ## Footnote LeNet laid the groundwork for many modern CNN architectures.

Answer 9

Alternating Conv → ReLU → Pool ## Footnote This sequence is fundamental for effective feature extraction.

Answer 10

Shared weights ## Footnote Sharing weights reduces the number of parameters in the model.

Answer 11

20 epochs, Batch size: 128, Optimizer: Adam ## Footnote This configuration is typical for training CNNs.

Answer 12

* Spatial Dropout * Hyperparameter Tuning ## Footnote These techniques help prevent overfitting and improve model performance.

Answer 13

Vanishing Gradients or Exploding Gradients ## Footnote These issues can hinder the training process significantly.

Answer 14

Makes training smoother, faster, and more stable ## Footnote Batch normalization helps in stabilizing the learning process.

Answer 15

Norm = (x - mean) / sqrt(var + ε) Out = γ * Norm + β ## Footnote This formula standardizes the input for each mini-batch.

Answer 16

* Reduces sensitivity to weight init * Slightly regularises model (less overfitting) * Allows higher learning rates ## Footnote Batch normalization enhances the overall performance of the model.

Answer 17

Before the activation ## Footnote Placing batch normalization before activation functions can lead to better results.

Answer 18

Extract features ## Footnote Convolutional layers are vital for identifying key patterns in the input data.

Answer 19

Shrink spatial size ## Footnote Pooling layers help reduce computational complexity.

Answer 20

Make decisions ## Footnote Fully connected layers synthesize information from previous layers to output final predictions.

Answer 21

Non-linearity ## Footnote Non-linear activation functions are crucial for learning complex relationships.

Answer 22

Stabilises training ## Footnote Batch normalization helps in maintaining consistent learning dynamics.

Answer 23

Reusing a pretrained model for your own task ## Footnote Transfer learning leverages existing knowledge to improve learning efficiency.

Answer 24

* Saves time * Works well with small datasets * Leverages existing learned features ## Footnote Transfer learning is especially useful in scenarios with limited data.

Answer 25

Freeze layers and only retrain the final classifier ## Footnote This strategy helps retain learned features while adapting to new tasks.

Answer 26

Pretend you have more data by slightly changing existing images ## Footnote Data augmentation helps improve model robustness.

Answer 27

* Rotate * Flip * Zoom * Brightness tweak * Crop ## Footnote These techniques enhance the diversity of the training dataset.

Answer 28

Randomly 'turn off' neurons ## Footnote Dropout helps prevent overfitting by ensuring that the model does not rely too heavily on any individual neuron.

Answer 29

* Overfitting * Reliance on specific paths in the network ## Footnote Dropout encourages a more generalized model.

Answer 30

Dropout(0.5) → 50% of neurons are dropped ## Footnote This value is often used to balance model complexity and performance.

Answer 31

Uses only 3×3 convolutions ## Footnote VGG architecture is known for its simplicity and depth.

Answer 32

Won ImageNet 2012 ## Footnote AlexNet marked a significant advancement in deep learning for image classification.

Answer 33

Uses skip connections (identity mappings) ## Footnote Skip connections help in training very deep networks by mitigating the vanishing gradient problem.

Answer 34

ReLU: max(0, x); Fast and simple activation function ## Footnote ReLU is widely used due to its efficiency and effectiveness in learning.