chapter 16 Flashcards

Question 1

Q

Why are RNNs useful for Natural Language Processing (NLP)?

Answer

A

They process sequences, making them suitable for tasks involving word or character order.

Question 2

Q

What are common NLP tasks?

Answer

A

Text generation, sentiment analysis, and machine translation.

Question 3

Q

What is a character-level RNN?

Answer

A

An RNN that predicts the next character in a sequence based on previous characters.

Question 4

Q

What dataset was used for training in the character RNN example?

Answer

A

The complete works of William Shakespeare.

Question 5

Q

How are characters encoded for training in RNNs?

Answer

A

Using one-hot encoding or integer encoding.

Question 6

Q

What is ‘truncated backpropagation through time’?

Answer

A

A technique to train RNNs on shorter sequences instead of full-length texts.

Question 7

Q

What does the ‘temperature’ parameter control in text generation?

Answer

A

The randomness of the generated text; lower values make it more deterministic.

Question 8

Q

What is the difference between stateless and stateful RNNs?

Answer

A

Stateless resets state after each batch; stateful preserves state between batches.

Question 9

Q

Why use a stateful RNN?

Answer

A

To maintain long-term dependencies over sequences across batches.

Question 10

Q

What is sentiment analysis?

Answer

A

Classifying text based on emotional tone, e.g., positive or negative reviews.

Question 11

Q

What dataset is used for sentiment analysis in the lecture?

Answer

A

IMDb movie review dataset.

Question 12

Q

What is an embedding layer?

Answer

A

A layer that maps each word ID to a dense vector capturing semantic similarity.

Question 13

Q

What does ‘mask_zero=True’ do in an embedding layer?

Answer

A

Ignores padding tokens during training.

Question 14

Q

What is an encoder-decoder architecture used for?

Answer

A

Translation and sequence generation tasks.

Question 15

Q

Why is the input reversed in encoder-decoder models?

Answer

A

To make the first words more accessible to the decoder.

Question 16

Q

What is a GRU cell?

Answer

Study These Flashcards

A

A simplified version of LSTM with fewer gates and a single hidden state.

Question 17

Q

How does an RNN generate long text?

Answer

Study These Flashcards

A

By generating one character at a time and feeding the output back as input.

Question 18

Q

What is the TimeDistributed layer used for?

Answer

Study These Flashcards

A

To apply the same Dense layer across each time step.

Question 19

Q

Why does character-level RNN have limited context?

Answer

Study These Flashcards

A

Because it typically only looks back 100 characters.

Question 20

Q

How can you improve a Char-RNN?

Answer

Study These Flashcards

A

Use deeper networks, tune temperature, or increase training data.

Question 21

Q

Why is dropout used in RNNs?

Answer

Study These Flashcards

A

To prevent overfitting by randomly deactivating neurons during training.

chapter 16 Flashcards

(21 cards)