Week 10 - Kalman Filter, Recurrent Neural Networks Flashcards

Question 1

Q

Define Kalman Filter

Answer

A

It is similar to HMMs, but it is for Kalman filter is for continuous hidden state (𝒁𝒕)

Question 2

Q

What does each hidden state (𝒁𝒕) depend on?

Answer

A

It depends on the one before it in time 𝑍𝑡−1, for all 𝑡 = 1, 2, … , 𝑇

Question 3

Q

What are the observations depends on in Kalman filter?

Answer

A

It only depend on the associated hidden state (𝒁𝒕)

Question 4

Q

Question 5

Q

What is Kalman filter often used for?

Answer

A

It is often used for target tracking or motion smoothing with noisy observations applications in automated airplane or ship guidance

Question 6

Q

What is the linear- Gaussian recurrence relation?

Answer

A

LGRR is the simplest model. The form is:

Question 7

Q

What is the main focus of reinforcement learning (RL) in the context of decision-making?

Answer

A

RL focuses on finding optimal policies to maximize rewards by making decisions based on observed effects of those choices, rather than predicting sequences from observed data.

Question 8

Q

How can dynamic programming be applied in reinforcement learning problems?

Answer

A

Dynamic programming is used in reinforcement learning when the problem is modeled as a Markov Decision Process (MDP), a special kind of sequential graphical model.

Question 9

Q

What components define the progression in a Markov Decision Process (MDP)?

Answer

A

An MDP consists of a sequence of input actions Zt that lead to observed states St where each state has an associated deterministic reward R(St).

Question 10

Q

How are rewards and utility defined in a Markov Decision Process?

Answer

A

Each state St has a deterministic reward R(St). The utility U of a sequence of states (S1,…,St) is also a deterministic function of these states

Question 11

Q

What limitation do standard deep networks have when dealing with sequential data?

Answer

A

Standard deep networks do not take the ordering of data into account.

Question 12

Q

How do Recurrent Neural Networks (RNNs) differ from standard deep networks?

Answer

A

It treats the hidden layer from the previous stage as an additional input to the current stage, enabling them to process sequential data

Question 13

Q

Question 14

Q

What role does the hidden state Zt-1 play in RNNs?

Answer

A

The value of the previous hidden state Zt-1 is included in computing the current hidden state Zt, allowing the network to retain memory of past inputs.

Question 15

Q

How do RNNS handle the order of inputs?

Answer

A

RNNS explicitly incorporate order dependence into their structure, allowing them to model temporal or sequential relationships in data.

Question 16

Q

What an example of RNN?

Answer

Study These Flashcards

A

Stock prediction

Question 17

Q

What is the goal of using an RNN for stock prediction?

Answer

Study These Flashcards

A

The goal is to predict the next day’s stock price given the prices from the previous N days (e.g N = 3)

Question 18

Q

What are the inputs and key parameters in an RNN for stock prediction?

Answer

Study These Flashcards

A

Input are daily stock prices (plus a bias). The parameters include weight w0,w1, recurrent weight u1, and output weight v0,v1, which are shared across time steps.

Question 19

Q

How are the hidden and output values computed in a stock prediction RNN?

Answer

Study These Flashcards

A

Each hidden state 𝑧𝑡 = 𝑓(𝑤0 + 𝑤1𝑥𝑡 + 𝑢1𝑧𝑡−1), and the output y = 𝑓(v0 + v1Zt, using the same weights across time steps

Question 20

Q

What is a major challenge when training RNNs and what are the solutions?

Answer

Study These Flashcards

A

RNNs suffer from the vanishing or exploding gradient problem, Solution include LSTM networks, and Tranformers

Week 10 - Kalman Filter, Recurrent Neural Networks Flashcards

(20 cards)