ResNets Flashcards

Question 1

Q

What problem do ResNets aim to solve?

Answer

A

Vanishing and exploding gradients in deep neural networks.

Question 2

Q

What causes vanishing gradients in deep networks?

Answer

A

Repeated multiplication of small derivatives during backpropagation.

Question 3

Q

What is a common sign of gradient instability in deep models?

Answer

A

Abnormal gradient distributions, such as near-zero or spiked values.

Question 4

Q

What are three signs of unstable gradient flow?

Answer

A

Abnormal gradients, chaotic learning curves, and irregular layer outputs.

Question 5

Q

Why is ReLU preferred over sigmoid in deep networks?

Answer

A

ReLU better preserves gradients during backpropagation.

Question 6

Q

What is the core idea behind residual connections?

Answer

A

Instead of learning y = f(x), learn y = f(x) + x.

Question 7

Q

What is a residual block?

Answer

A

A network unit that adds its input to its output after a series of transformations.

Question 8

Q

Why do residual connections help with training deep models?

Answer

A

They allow gradients to flow more easily through the network.

Question 9

Q

What does f(x) + x mean in a ResNet?

Answer

A

The output is the sum of the learned transformation and the original input.

Question 10

Q

What happens if f(x) learns nothing in a ResNet?

Answer

A

The identity connection ensures the network can still pass input forward.

Question 11

Q

What analogy links ResNets and LSTMs?

Answer

A

Both preserve information over structure—LSTM across time, ResNet across depth.

Question 12

Q

How do residual connections relate to vanishing gradients?

Answer

A

They reduce the chance of vanishing gradients by providing an unimpeded gradient path.

Question 13

Q

What are skip connections in ResNets not equivalent to?

Answer

A

Encoder-decoder skip paths like in U-Nets.

Question 14

Q

What does a typical ResNet block contain?

Answer

A

Two convolutional layers and a skip connection with optional batch norm and ReLU.

Question 15

Q

What is one advantage of using residual blocks in CNNs?

Answer

A

They allow very deep networks to be trained effectively.

Question 16

Q

What is ResNet-34?

Answer

Study These Flashcards

A

A 34-layer residual network designed for ImageNet-level performance.

Question 17

Q

What architecture enabled the training of 100+ layer CNNs?

Answer

Study These Flashcards

A

Residual Neural Networks (ResNets).

Question 18

Q

What is one limitation of ResNets?

Answer

Study These Flashcards

A

Reduced interpretability due to multiple forward paths.

Question 19

Q

What can happen if residual blocks are poorly designed?

Answer

Study These Flashcards

A

They may default to identity mappings and learn nothing useful.

Question 20

Q

What is a computational cost of using ResNets?

Answer

Study These Flashcards

A

Increased parameter count and training time due to added layers.

Question 21

Q

Why might debugging ResNets be difficult?

Answer

Study These Flashcards

A

Because of the complexity introduced by residual pathways.

Question 22

Q

What do residual blocks encourage the network to learn?

Answer

Study These Flashcards

A

Only the difference (residual) between input and desired output.

Question 23

Q

What paper introduced ResNets?

Answer

Study These Flashcards

A

‘Deep Residual Learning for Image Recognition’ by He et al., 2015.

Question 24

Q

Why is deeper not always better in plain CNNs?

Answer

Study These Flashcards

A

Deeper networks can suffer from degraded training due to gradient issues.

ResNets Flashcards

(24 cards)