LSTM Flashcards

(25 cards)

1
Q

What problem do LSTMs solve in RNNs?

A

They mitigate vanishing and exploding gradients over long sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does LSTM stand for?

A

Long Short-Term Memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the main innovation in LSTM architecture?

A

It introduces gates that control memory flow through time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two types of memory in LSTM?

A

Short-term (hidden state) and long-term (cell state).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three gates in an LSTM?

A

Forget gate, input gate, and output gate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of the forget gate in an LSTM?

A

To control how much of the previous cell state to retain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What activation function is used in LSTM gates?

A

Sigmoid.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the range of sigmoid outputs in LSTM gates?

A

Between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What happens if the forget gate outputs 0?

A

The memory is completely forgotten.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the input gate control in an LSTM?

A

How much new information is added to the cell state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of the candidate memory \u007E Cₜ?

A

It represents new information to be added to the cell state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is tanh used for the candidate memory update?

A

To allow both positive and negative memory contributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What equation updates the cell state in an LSTM?

A

Cₜ = fₜ·Cₜ₋₁ + iₜ·\u007ECₜ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the output gate determine in an LSTM?

A

How much of the cell state is passed to the hidden state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is the LSTM hidden state computed?

A

hₜ = oₜ · tanh(Cₜ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is tanh used on the cell state for output?

A

To allow rich activations for the hidden state.

17
Q

How does the LSTM cell state help gradient flow?

A

It allows additive updates, preserving gradients over time.

18
Q

What does the LSTM use to control memory flow?

A

Gates that filter information at each timestep.

19
Q

How does the LSTM prevent vanishing gradients?

A

By maintaining long-term memory through the cell state.

20
Q

What is backpropagation through time (BPTT)?

A

The algorithm used to compute gradients across timesteps in RNNs and LSTMs.

21
Q

What is the trade-off of using LSTMs over RNNs?

A

LSTMs are more powerful but computationally more expensive.

22
Q

What architecture allows deep LSTM learning?

A

Stacked or multi-layer LSTMs.

23
Q

What is one example of a real-world LSTM use case?

A

Predicting financial time series like FTSE350.

24
Q

What kind of data are LSTMs especially good at?

A

Sequential or time-dependent data.

25
What allows LSTMs to model long-term dependencies?
Controlled memory updates using gated architecture.