EXAM3 Flashcards
(64 cards)
What does Seq2Seq stand for?
Sequence to Sequence
True or False: Seq2Seq models are primarily used for tasks involving sequential data.
True
What is the primary function of attention mechanisms in Seq2Seq models?
To allow the model to focus on different parts of the input sequence when generating the output sequence.
Fill in the blank: In a Seq2Seq model, the _______ network encodes the input sequence.
encoder
What type of neural network is commonly used in Seq2Seq architectures?
Recurrent Neural Network (RNN)
Name one advantage of using attention mechanisms in Seq2Seq models.
They improve the model’s ability to handle long input sequences.
In the context of reinforcement learning, what does MDP stand for?
Markov Decision Process
True or False: An MDP is defined by a set of states, actions, transition probabilities, and rewards.
True
What are the four main components of a Markov Decision Process?
States, Actions, Transition Probabilities, Rewards
Fill in the blank: In reinforcement learning, the goal is to learn a policy that maximizes the _______.
cumulative reward
What does the term ‘policy’ refer to in reinforcement learning?
A strategy that defines the action to take in each state.
What is the difference between a deterministic policy and a stochastic policy?
A deterministic policy always selects the same action for a given state, while a stochastic policy selects actions according to a probability distribution.
What is a reward function in the context of MDPs?
A function that provides feedback to the agent based on the actions taken in the environment.
True or False: The Bellman equation is used to calculate the value of states in an MDP.
True
Fill in the blank: The _______ in an MDP describes the probability of transitioning from one state to another given a specific action.
transition probability
What is reinforcement learning?
A type of machine learning where an agent learns to make decisions by taking actions in an environment to maximize cumulative rewards.
What is the purpose of ‘prompting techniques’ in machine learning?
To guide the model’s responses by providing specific input formats or questions.
Fill in the blank: Prompting techniques are often used in _______ models to enhance performance.
language
True or False: Prompting can involve providing examples to the model before asking it to generate a response.
True
What is a common application of Seq2Seq models?
Machine translation
What does ‘attention score’ represent in attention mechanisms?
The importance of each input token to the current output token being generated.
How does self-attention differ from traditional attention mechanisms?
Self-attention allows the model to attend to all positions in the input sequence, including itself.
What is the role of the decoder in a Seq2Seq model?
To generate the output sequence from the encoded input representation.
What technique is often used to prevent overfitting in Seq2Seq models?
Dropout