Fully-connected Neural Networks Flashcards

Question 1

Q

What is a feedforward (fully-connected) neural network?

Answer

A

A generalization of a single neuron: a sequence of layers where each node in layer L takes inputs from all nodes in layer L−1.

Question 2

Q

Why are non-linear activation functions necessary in neural networks?

Answer

A

Without non-linearity, stacked layers collapse into one linear transformation, so depth would add no representational power.

Question 3

Q

How is layer computation expressed in matrix form for two layers?

Answer

A

Z¹ = B¹ + W¹ · Z⁰; Z² = B² + W² · Z¹, showing each layer applies an affine transform to the previous outputs.

Question 4

Q

How do you create a linear layer in PyTorch and with what parameter initialization?

Answer

A

Use torch.nn.Linear(in_features, out_features); by default weights are initialized from a normal distribution.

Question 5

Q

How do you flatten MNIST images of shape (1,28,28) for a batch of size 32?

Answer

A

Apply tensor.flatten(start_dim=1) to get a tensor shape of (32, 784) before feeding into a linear layer.

Question 6

Q

How is a single-layer network defined for 10-digit classification?

Answer

A

nn.Linear(in_features=784, out_features=10), as MNIST has 784 inputs and 10 output classes.

Question 7

Q

What are the shapes of weight and bias parameters in a linear layer?

Answer

A

Weights: (out_features, in_features); Biases: (out_features).

Question 8

Q

Provide the formulas for the sigmoid and tanh activation functions.

Answer

A

σ(z) = 1/(1+e^(-z)); tanh(z) = (e^z − e^{-z})/(e^z + e^{-z}).

Question 9

Q

Why are sigmoid and tanh less used in hidden layers?

Answer

A

They saturate quickly (derivatives near zero), leading to vanishing gradients in deep networks.

Question 10

Q

What are the ReLU and Leaky ReLU activation formulas?

Answer

A

ReLU(x)=max(0,x); LReLU(x)=x if x≥0 else αx (α≈0.01).

Question 11

Q

How do you define a custom PyTorch Module composed of submodules?

Answer

A

Subclass nn.Module, initialize submodule layers in __init__, and define the forward pass chaining them.

Question 12

Q

What is the softmax function formula for multi-class classification?

Answer

A

P(i)=e^{ŷ_i}/Σ_{c=1}^C e^{ŷ_c}, converting logits to probability distribution over C classes.

Question 13

Q

What is the cross-entropy loss formula shown in the notebook?

Answer

A

L = -log(e^{ŷ_j}/Σ_{c=1}^C e^{ŷ_c}), where j is the true class index.

Question 14

Q

How do you transfer model and data to a GPU in PyTorch?

Answer

A

Use tensor.to(device) or model.to(device) with device = torch.device(‘cuda’) to move them to GPU memory.

Fully-connected Neural Networks Flashcards

(14 cards)