dialogue state trackingspeech.ee.ntu.edu.tw › ~tlkagk › courses › dlhlp20 › dstqa...
TRANSCRIPT
Dialogue State Tracking (DST)
• It is the core of task-oriented dialogue.
• State-of-the-art approaches consider DST as QA.
https://dstc9.dstc.community/
It was called Dialog State Tracking Challenge (DSTC) before 2017.
Task-oriented Dialogue
NLU
PolicyNLG
ASR
TTS
User Input
SystemOutput
State Tracker
state
Input: several sequences, Output: a sequence
End-to-end
Dialogue History
Dialogue State Tracking
Summarization of what has happened in the dialogue
(for policy module to make decision)
Dialogue State Tracking
State
Dialogue State Tracking (DST)
Dialogue State
hotel price range: moderate
A set of (key, value)
key value
Key: Defined before developing your system
Value: choose from a set of predefined options
slotdomain
null
cheapexpensivedon’t care
Dialogue State Tracking (DST)
Dialogue State
hotel price range: moderate
A set of (key, value)
key value
Key: Defined before developing your system
Value: choose from a set of options
slotdomain
restaurant price range:
hotel internet: yes / no
moderate
Same options
Dialogue State Tracking (DST)
𝑆1
𝑆2
𝑈1
𝑈2
𝑆𝑇
𝑈𝑇
……
a se
qu
ence
of
sen
ten
ces
What the user and agent have said
𝑆: agent, 𝑈: user
Key 𝑘 include domain and slot
𝑘1:
𝑘2:
𝑘3:
𝑣11, 𝑣2
1, … , 𝑣𝑛11
𝑣12, 𝑣2
2, … , 𝑣𝑛22
𝑣13, 𝑣2
3, … , 𝑣𝑛33
Select the correct value for each key
What is WOZ?
• One way of system test / data collection
https://dl.acm.org/doi/pdf/10.1145/2163.358100
Artificial AI(AAI)
Challenge - Infinite values
• Some slots have almost infinite possible values (e.g. leave time, phone number)
Knowledge source question
answer
Module for Source Module for QuestionAttention
Module for Answer
Dialogue History
𝑆1, 𝑈1, 𝑆2, 𝑈2, … , 𝑆𝑇 , 𝑈𝑇
What is the [SLOT NAME]?
What is the phone number?
Span in dialogue history
Challenge – New Service Having training data New service
Question: What is the destination of train?
Question: What is the destination of taxi?
[Zhou, et al., NeurIPS Workshop’19]
DST QA If a user booked a restaurant, then the destination of the taxi is likely to be that restaurant.
The (domain, slot) pairs are not independent.
Reference
• Shuyang Gao, Abhishek Sethi, Sanchit Agarwal, Tagyoung Chung, Dilek Hakkani-Tur, Dialog State Tracking: A Neural Reading Comprehension Approach, SIGDIAL, 2019
• Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher, A Simple Language Model for Task-Oriented Dialogue, arXiv, 2020
• Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher, Pascale Fung, Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems, ACL, 2019
• Li Zhou, Kevin Small, Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question Answering, NeurIPS Workshop, 2019
• Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta, Pranav Khaitan, Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset, AAAI, 2020
Reference
• Qi Zhu, Kaili Huang, Zheng Zhang, Xiaoyan Zhu, Minlie Huang, CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset, TACL, 2020
• Paweł Budzianowski, Tsung-Hsien Wen, Bo-Hsiang Tseng, Iñigo Casanueva, Stefan Ultes, Osman Ramadan, Milica Gašić, MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling, EMNLP, 2018
One slide for this course
Model
ModelModel
Model
Model class Model class
ASR TTS
VC, Source Separation
Speaker Verification BERT, Multi-BERTAudio BERT
Text “Style” Transfer
Vocoder
One Sequence Multiple Sequences
One Class
Sentiment ClassificationStance Detection
Veracity PredictionIntent Classification
Dialogue Policy
NLISearch Engine
Relation Extraction
Class for each Token
POS taggingWord segmentation
Extractive SummarizationSlotting Filling
NER
Copyfrom Input
Extractive QA
GeneralSequence
Abstractive SummarizationTranslation
Grammar CorrectionNLG
General QAChatbot
State TrackerTask Oriented Dialogue
Other? Parsing, Coreference Resolution
NAT