Chapter 15 Flashcards

Question 1

Q

What type of data are RNNs designed to process?

Answer

A

Sequential or time-series data.

Question 2

Q

What is a recurrent neuron?

Answer

A

A neuron that receives feedback from its previous output to influence future outputs.

Question 3

Q

How is an RNN trained?

Answer

A

Using backpropagation through time (BPTT), unrolling the network across time steps.

Question 4

Q

What is the “unrolling” of an RNN?

Answer

A

Representing the same RNN layer at multiple time steps to visualize flow across time.

Question 5

Q

What are the two main weight matrices in a recurrent neuron?

Answer

A

One for the current input (Wx) and one for the previous output (Wy).

Question 6

Q

What is a memory cell in RNNs?

Answer

A

A structure that preserves state over time, helping the network retain information.

Question 7

Q

What is the sequence-to-sequence architecture?

Answer

A

A model that takes a sequence as input and produces a sequence as output (e.g., time-series forecasting).

Question 8

Q

What is a sequence-to-vector model?

Answer

A

A model that takes a sequence input and produces a single output (e.g., sentiment analysis).

Question 9

Q

What is a vector-to-sequence model?

Answer

A

A model that takes a single input and generates a sequence (e.g., image captioning).

Question 10

Q

What is an encoder-decoder model in NLP?

Answer

A

A model that encodes an input sequence to a vector and decodes it into an output sequence (e.g., translation).

Question 11

Q

What is the difference between naive forecasting and deep learning for time series?

Answer

A

Naive forecasting predicts the last value again; deep models learn patterns for better accuracy.

Question 12

Q

What is MSE and why is it used?

Answer

A

Mean Squared Error; a loss function used to measure prediction accuracy in time series.

Question 13

Q

What is the drawback of predicting one step at a time in time series forecasting?

Answer

A

Error accumulation over successive steps.

Question 14

Q

What is the advantage of predicting multiple steps at once in time series forecasting?

Answer

A

Reduced error accumulation and more stable training gradients.

Question 15

Q

What is the main issue when handling long sequences in RNNs?

Answer

A

Unstable gradients and memory loss of earlier inputs.

Question 16

Q

What is batch normalization and how does it help RNNs?

Answer

Study These Flashcards

A

Normalization across batches; it can stabilize training but is difficult to apply across time.

Question 17

Q

What is layer normalization in RNNs?

Answer

Study These Flashcards

A

Normalization across feature dimensions, easier to use than batch normalization for RNNs.

Question 18

Q

What is the short-term memory problem in RNNs?

Answer

Study These Flashcards

A

RNNs forget inputs after many time steps due to vanishing gradients.

Question 19

Q

What is an LSTM cell?

Answer

Study These Flashcards

A

A memory cell with gates (forget, input, output) that maintains long-term memory in sequences.

Question 20

Q

What is a GRU cell?

Answer

Study These Flashcards

A

A simplified LSTM with fewer gates, merging hidden and cell states.

Question 21

Q

How do 1-D convolutional layers work in sequence modeling?

Answer

Study These Flashcards

A

They apply filters across time steps to detect temporal patterns.

Question 22

Q

What is WaveNet and what does it do?

Answer

Study These Flashcards

A

A deep neural network for sequence generation using dilated convolutions to capture long-range dependencies.

Question 23

Q

What is the role of dilation in WaveNet?

Answer

Study These Flashcards

A

It allows the network to look back further in time without increasing the number of layers.

Chapter 15 Flashcards

(23 cards)