09 - Dialogue Systems Flashcards

1
Q

What are some every day examples of dialogue systems?

A

Siri, Alexa, Cortana etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In recent years researchers have gone away from the ‘personal assistants’ approach and focused more on?

A

Large language models and chat-based apps such as ChatGPT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What describes a chit-chat based system?

A

No goal. It is not task-oriented. It is just to chat, and it is very open-ended and has a broad domain. The object is to be indistinguishable from humans (turing test).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What describes a task oriented dialogue system?

A

Take action in the real world. They are goal oriented and wants to complete tasks. Natural language (conversational) is a nice-to-have.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Chit-chat uses more which kinds of models?

A

Seq-2-seq models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Goal-oriented models uses which models?

A

End-2-end models. Sometimes single domain models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some challenges in dialogue systems? Mention atleast 3.

A

Variability in natural language.
Robustness.
Recall/precision trade-off
Meaningful representations
Common sense and knowledge of the world
Ability to learn
Transparency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Modular task-oriented systems can be divided into for example?

A

Predefined domains. Restaurant, taxi, movie etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A task-oriented system needs to be able to quickly identify what, when it is asked a question?

A

The domain and the intent of the question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is ‘slot filling’?

A

It is the process where a model, given a question, extracts the correct features for the ‘search’ or ‘goal’ such as: “Find me a good thai restaurant” –> Restaurant –> Good ratings –> Thai.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

We sometimes have text classification models that are trained on transcribed text from spoken language. Why is this a challenge?

A

Because the transcribed text can be mismatched compared to when we write. Difference in speech and writing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Dialogue State Tracking (DST)?

A

It refers to the process of monitoring and updating the current state of the dialogue during a conversation. It involves keeping track of relevant information, such as user goals, user preferences, and system actions, to maintain a coherent and accurate representation of the ongoing dialogue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the benefit of using reinforcement learning in dialogue systems?

A

We introduce rewards. These can be both short term and long term and thus a model might learn to postpone immediate reward for a larger reward later on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the MultiWOZ dataset?

A

It is a dataset that was created to facilitate research and development of dialogue systems that can handle multi-domain conversations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Metrics in natural language generation that are based on word overlaps are?

A

BLEU, METEOR, ROGUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Metrics in natural language generation that are based on word embeddings are?

A

BERTScore

17
Q

BLEU uses which scores?

A

N-gram precision, which is a score between 0 and 1.

18
Q

How does BERTScore work?

A

BERTScore is a metric that computes the similarity between two sentences by using contextualized word embeddings from BERT and aligning them based on cosine similarity to capture semantic similarity.

19
Q

End-to-end dialogue systems are becoming more popular. What are they?

A

Yet again, it is a system that takes in an input and models an output without relying on intermediate processes or stages.

20
Q

What is the ‘Proximal Policy Optimization’ (PPO)?

A

In Proximal Policy Optimization (PPO), a language model acts as a policy that generates text based on a given prompt, and the model’s parameters are updated to maximize reward metrics by combining a preference model, a constraint on policy shift, and computing advantage for each state-action pair.

21
Q

What makes the ‘Alpaca’ models so unique?

A

They are trained on input that is generated as output from chatGPT model.

22
Q

There are in this course 5 components of a prompt. These are?

A
  1. Input data
  2. Exemplars
  3. Instruction
  4. Indicators
  5. Context
23
Q

For smaller models, if results are poor, what might we do?

A

Fine-tune the model itself, since fine-tuning the prompt might be ineffective.

24
Q

For larger models (in dialogue systems), if results are poor, what might we do?

A

Fine-tune the prompt with advanced techniques such as chain-of-thought or least-to-most.

25
Q

What is the key idea in ‘Knowledge augmentation’?

A

The key idea is to find knowledge in the form of relevant textual information that is provided as context in the prompt.

26
Q

What is unique about conversational question answering?

A

Tasks like searching the web or recommendations are done through a conversational approach. It feels like talking to a very knowledgable person.

27
Q

What does Sparrow propose as a conversational question anwering model?

A

An end-2-end approach using Large Language Models (LLM’s)

28
Q

Sparrow is trained fine-tuning a LLM with reinforcement learning from human feedback. True or false?

A

True

29
Q

How is the Sparrow model finetuned using human feedback?

A

Using per-turn response preference (finish non-complete sentence) and adversarial probing (make rule, identify violations).

30
Q

Sparrow ‘introduces’ extra participants into the dialog. These are called search query and search result. What are these?

A

The search query is the string that queries the search and the result is the feedback from the query - the answer from google for example.

31
Q

When we compare chit-chat models to task-oriented models which of the two is it more normal to incorporate LLM’s?

A

In the chit-chat models.

32
Q

Why is it difficult to incorporate LLM into task-oriented models?

A

Because task-oriented models have explicit goals and therefore have a more moduralized pipeline for interpretability and controllability.

33
Q

What is the ‘Few-Shot Bot (FSB)’?

A

A model that combines few-shot prompting with a skill selector.
it retrieves information depending on the skill and uses a LLM to generate an answer conditioned on dialogue history.

34
Q

Recent trends in this topic of Dialogue Systems include?

A

Empathy in dialogue systems and multimodal (think writing AND pictures) dialogue systems.