recurrent neural networks Flashcards by ROWAN Gomanee

What is the key idea behind Recurrent Neural Networks (RNNs)?

They maintain a hidden state to remember information across time steps.

How well did you know this?

Not at all

Perfectly

What type of data are RNNs designed for?

Sequential or time-series data.

How well did you know this?

Not at all

Perfectly

Why is a feedforward network unsuitable for time-dependent inputs?

It treats all inputs as independent and ignores temporal order.

How well did you know this?

Not at all

Perfectly

How does an RNN incorporate memory?

By passing a hidden state from one timestep to the next.

How well did you know this?

Not at all

Perfectly

What does the formula aₜ = f(Wxₜ + Uaₜ₋₁ + b) represent?

The update rule for the RNN hidden state.

How well did you know this?

Not at all

Perfectly

What is parameter sharing in RNNs?

Using the same weights at every timestep to process inputs consistently.

How well did you know this?

Not at all

Perfectly

What is the benefit of parameter sharing?

Reduces the number of parameters and improves generalisation.

How well did you know this?

Not at all

Perfectly

What activation function is commonly used in basic RNNs?

Tanh or ReLU.

How well did you know this?

Not at all

Perfectly

What does ‘many-to-one’ RNN architecture mean?

A sequence of inputs produces a single output.

How well did you know this?

Not at all

Perfectly

What does ‘many-to-many’ RNN architecture mean?

A sequence of inputs produces a sequence of outputs.

How well did you know this?

Not at all

Perfectly

What is the main limitation of standard RNNs during training?

They suffer from vanishing and exploding gradient problems.

How well did you know this?

Not at all

Perfectly

What causes the vanishing gradient problem in RNNs?

Repeated multiplication by weights less than 1 during backpropagation.

How well did you know this?

Not at all

Perfectly

What causes the exploding gradient problem in RNNs?

Repeated multiplication by weights greater than 1, causing large gradients.

How well did you know this?

Not at all

Perfectly

What is the impact of vanishing gradients on learning?

Prevents learning of long-term dependencies.

How well did you know this?

Not at all

Perfectly

What is the impact of exploding gradients?

Leads to unstable updates and divergence during training.

How well did you know this?

Not at all

Perfectly

What does backpropagation through time (BPTT) do?

Study These Flashcards

Unrolls the network across time and computes gradients through each step.

In an RNN, what role does the weight wₐ play in memory?

Study These Flashcards

It controls how much past information is carried forward.

What happens if the memory weight wₐ is 0.5 over 2 steps?

Study These Flashcards

The signal quickly diminishes (e.g., 10.5 with input 7).

What happens if the memory weight wₐ is 2.0 over 2 steps?

Study These Flashcards

The signal explodes (e.g., 168 with input 7).

What kind of real-world task was used to motivate RNNs in this lecture?

Study These Flashcards

Rainfall prediction using radar image sequences.

What is the key advantage of RNNs over feedforward networks for sequences?

Study These Flashcards

They model temporal dependencies through recurrent connections.

What is the initial hidden state a₋₁ usually set to?

Study These Flashcards

Zero.

Why is unrolling the RNN necessary for training?

Study These Flashcards

To apply gradient-based optimisation across all timesteps.

How does an RNN process inputs over time?

Study These Flashcards

Sequentially, one timestep at a time, updating the hidden state.

What kind of function is used to compute the output in an RNN?

A non-linear function of the current input and previous hidden state.

recurrent neural networks Flashcards

(25 cards)