week 4 - chatgpt Flashcards

Question 1

Q

What limitation of single-layer perceptrons is addressed by multilayer neural networks?

Answer

A

Single-layer perceptrons cannot model nonlinear decision boundaries; multilayer networks can model arbitrary nonlinear functions.

Question 2

Q

What is the universal approximation theorem in neural networks?

Answer

A

A feedforward network with a single hidden layer can approximate any continuous function given sufficient neurons and proper weights.

Question 3

Q

Why are differentiable activation functions needed in multilayer neural networks?

Answer

A

Because backpropagation relies on gradient descent, which requires the derivative of the activation function to compute updates.

Question 4

Q

How does backpropagation compute the weight updates in a neural network?

Answer

A

By using the chain rule to propagate the error backward from the output layer through the hidden layers and updating weights using gradient descent.

Question 5

Q

What is the general update rule for weights using backpropagation?

Answer

A

w ← w − η * ∂J/∂w, where J is the cost function and η is the learning rate.

Question 6

Q

What is the error term for an output unit in backpropagation?

Answer

A

δ_k = (t_k − z_k) * f′(net_k), where t_k is the target and z_k is the output.

Question 7

Q

What is the error term for a hidden unit in backpropagation?

Answer

A

δ_j = f′(net_j) * sum over k of (w_kj * δ_k), where δ_k is the error of output neurons.

Question 8

Q

What is the main purpose of using stochastic (online) backpropagation?

Answer

A

To update weights after each training example, which often leads to faster convergence and better generalisation than batch updates.

Question 9

Q

What problem does early stopping help prevent during training?

Answer

A

It helps prevent overfitting by stopping training when validation error starts to increase, even if training error is still decreasing.

Question 10

Q

How is a Radial Basis Function (RBF) network different from a Multilayer Perceptron (MLP)?

Answer

A

An RBF network uses radial activation functions based on distance and typically has only one hidden layer, while an MLP uses layered linear combinations with nonlinear activations.

Question 11

Q

What are the two main training phases of an RBF network?

Answer

A

First, determine the centres of the basis functions (unsupervised), then learn output weights (supervised).

week 4 - chatgpt Flashcards

(11 cards)