L10 - Reinforcement Learning Flashcards
(6 cards)
What is Reinforcement Learning?
Reinforcement Learning (RL) is a type of machine learning where an agent learns to make decisions by interacting with an environment. The agent’s goal is to maximize cumulative rewards over time by learning which actions yield the best outcomes.
Describe the relationship between an Agent and an Environment
An Agent can observe an environemnt.
An agent can perform Actions which have an effect in the Environment.
What is the RL Q-function?
In Reinforcement Learning RL the Agent tries to maximise the total reward.
The Q-function captures the expected total reward for an action and so together with a Policy helps the Agent choose which action to take.
What is the RL Policy?
The rules an Agent should follow to determin its actions. Eg maximise reward.
What is a DQN?
Deep Q Network
A Deel Neural Network that uses a Q-function to make decisions.