09 - Dialogue Systems Flashcards

Question 1

Q

What are some every day examples of dialogue systems?

Answer

A

Siri, Alexa, Cortana etc.

Question 2

Q

In recent years researchers have gone away from the ‘personal assistants’ approach and focused more on?

Answer

A

Large language models and chat-based apps such as ChatGPT.

Question 3

Q

What describes a chit-chat based system?

Answer

A

No goal. It is not task-oriented. It is just to chat, and it is very open-ended and has a broad domain. The object is to be indistinguishable from humans (turing test).

Question 4

Q

What describes a task oriented dialogue system?

Answer

A

Take action in the real world. They are goal oriented and wants to complete tasks. Natural language (conversational) is a nice-to-have.

Question 5

Q

Chit-chat uses more which kinds of models?

Answer

A

Seq-2-seq models

Question 6

Q

Goal-oriented models uses which models?

Answer

A

End-2-end models. Sometimes single domain models.

Question 7

Q

What are some challenges in dialogue systems? Mention atleast 3.

Answer

A

Variability in natural language.
Robustness.
Recall/precision trade-off
Meaningful representations
Common sense and knowledge of the world
Ability to learn
Transparency

Question 8

Q

Modular task-oriented systems can be divided into for example?

Answer

A

Predefined domains. Restaurant, taxi, movie etc.

Question 9

Q

A task-oriented system needs to be able to quickly identify what, when it is asked a question?

Answer

A

The domain and the intent of the question.

Question 10

Q

What is ‘slot filling’?

Answer

A

It is the process where a model, given a question, extracts the correct features for the ‘search’ or ‘goal’ such as: “Find me a good thai restaurant” –> Restaurant –> Good ratings –> Thai.

Question 11

Q

We sometimes have text classification models that are trained on transcribed text from spoken language. Why is this a challenge?

Answer

A

Because the transcribed text can be mismatched compared to when we write. Difference in speech and writing.

Question 12

Q

What is Dialogue State Tracking (DST)?

Answer

A

It refers to the process of monitoring and updating the current state of the dialogue during a conversation. It involves keeping track of relevant information, such as user goals, user preferences, and system actions, to maintain a coherent and accurate representation of the ongoing dialogue.

Question 13

Q

What is the benefit of using reinforcement learning in dialogue systems?

Answer

A

We introduce rewards. These can be both short term and long term and thus a model might learn to postpone immediate reward for a larger reward later on.

Question 14

Q

What is the MultiWOZ dataset?

Answer

A

It is a dataset that was created to facilitate research and development of dialogue systems that can handle multi-domain conversations.

Question 15

Q

Metrics in natural language generation that are based on word overlaps are?

Answer

A

BLEU, METEOR, ROGUE

Question 16

Q

Metrics in natural language generation that are based on word embeddings are?

Answer

A

BERTScore

Question 17

Q

BLEU uses which scores?

Answer

A

N-gram precision, which is a score between 0 and 1.

Question 18

Q

How does BERTScore work?

Answer

A

BERTScore is a metric that computes the similarity between two sentences by using contextualized word embeddings from BERT and aligning them based on cosine similarity to capture semantic similarity.

Question 19

Q

End-to-end dialogue systems are becoming more popular. What are they?

Answer

A

Yet again, it is a system that takes in an input and models an output without relying on intermediate processes or stages.

Question 20

Q

What is the ‘Proximal Policy Optimization’ (PPO)?

Answer

A

In Proximal Policy Optimization (PPO), a language model acts as a policy that generates text based on a given prompt, and the model’s parameters are updated to maximize reward metrics by combining a preference model, a constraint on policy shift, and computing advantage for each state-action pair.

Question 21

Q

What makes the ‘Alpaca’ models so unique?

Answer

A

They are trained on input that is generated as output from chatGPT model.

Question 22

Q

There are in this course 5 components of a prompt. These are?

Answer

A

Input data
Exemplars
Instruction
Indicators
Context

Question 23

Q

For smaller models, if results are poor, what might we do?

Answer

A

Fine-tune the model itself, since fine-tuning the prompt might be ineffective.

Question 24

Q

For larger models (in dialogue systems), if results are poor, what might we do?

Answer

A

Fine-tune the prompt with advanced techniques such as chain-of-thought or least-to-most.

Question 25

Q

What is the key idea in ‘Knowledge augmentation’?

Answer

A

The key idea is to find knowledge in the form of relevant textual information that is provided as context in the prompt.

Question 26

Q

What is unique about conversational question answering?

Answer

A

Tasks like searching the web or recommendations are done through a conversational approach. It feels like talking to a very knowledgable person.

Question 27

Q

What does Sparrow propose as a conversational question anwering model?

Answer

A

An end-2-end approach using Large Language Models (LLM’s)

Question 28

Q

Sparrow is trained fine-tuning a LLM with reinforcement learning from human feedback. True or false?

Question 29

Q

How is the Sparrow model finetuned using human feedback?

Answer

A

Using per-turn response preference (finish non-complete sentence) and adversarial probing (make rule, identify violations).

Question 30

Q

Sparrow ‘introduces’ extra participants into the dialog. These are called search query and search result. What are these?

Answer

A

The search query is the string that queries the search and the result is the feedback from the query - the answer from google for example.

Question 31

Q

When we compare chit-chat models to task-oriented models which of the two is it more normal to incorporate LLM’s?

Answer

A

In the chit-chat models.

Question 32

Q

Why is it difficult to incorporate LLM into task-oriented models?

Answer

A

Because task-oriented models have explicit goals and therefore have a more moduralized pipeline for interpretability and controllability.

Question 33

Q

What is the ‘Few-Shot Bot (FSB)’?

Answer

A

A model that combines few-shot prompting with a skill selector.
it retrieves information depending on the skill and uses a LLM to generate an answer conditioned on dialogue history.

Question 34

Q

Recent trends in this topic of Dialogue Systems include?

Answer

A

Empathy in dialogue systems and multimodal (think writing AND pictures) dialogue systems.

Brainscape's Knowledge GenomeTM

09 - Dialogue Systems Flashcards

Brainscape's Knowledge Genome^TM