Recurrent Neural Networks Flashcards

Question 1

Q

What kind of data are RNNs designed to process?

Answer

A

Sequential data such as speech, music, text (sentences), DNA sequences, and time series.

Question 2

Q

Why do we need recurrence (feedback loops) in neural networks for sequences?

Answer

A

To remember past inputs and share learned features across time steps, enabling handling of variable-length inputs and context.

Question 3

Q

What is a vanilla RNN’s key characteristic in its computational graph?

Answer

A

It has directed cycles (feedback loops) that propagate hidden states over time.

Question 4

Q

Name four typical RNN use-case structures on input/output length.

Answer

A

Many-to-one (e.g., sentiment classification), one-to-many (image captioning), many-to-many (machine translation), and synchronized many-to-many (video classification).

Question 5

Q

What is the vanishing gradient problem in RNNs?

Answer

A

Gradients diminish exponentially over long sequences, making it hard for vanilla RNNs to learn long-term dependencies.

Question 6

Q

What are the two gates in a Gated Recurrent Unit (GRU)?

Answer

A

The update gate and the reset gate, which control memory content and state updates.

Question 7

Q

How does a GRU update its hidden state?

Answer

A

It uses the update gate to combine the previous state with a candidate state computed using the reset gate.

Question 8

Q

What are the three gates in an LSTM cell?

Answer

A

Input gate, forget gate, and output gate, which regulate cell state writing, resetting, and reading.

Question 9

Q

Why are LSTMs more complex to train than GRUs or vanilla RNNs?

Answer

A

They have more gates and parameters, increasing computational cost and requiring more data to learn effectively.

Question 10

Q

Compare vanilla RNN, GRU, and LSTM in terms of training difficulty and effectiveness.

Answer

A

Training difficulty: RNN < GRU < LSTM; effectiveness: RNN < GRU ≈ LSTM.

Question 11

Q

Give two real-world examples where LSTMs have been successfully applied.

Answer

A

Google Translate for speech translation; Facebook’s daily automatic translations; Apple’s QuickType and Siri text prediction.

Question 12

Q

What major architecture supplanted RNNs for sequence tasks and why?

Answer

A

Transformers replaced RNNs because they enable parallel sequence processing and avoid vanishing gradients via self-attention.

Recurrent Neural Networks Flashcards

(12 cards)