Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems

NAACL 2021 System Demonstrations Track Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Acharya, Anish, Adhikari, Suranjit, Agarwal, Sanchit, Auvray, Vincent, Belgamwar, Nehal, Biswas, Arijit, Chandra, Shubhra, Chung, Tagyoung, Fazel-Zarandi, Maryam, Gabriel, Raefer, Gao, Shuyang, Goel, Rahul, Hakkani-Tur, Dilek, Jezabek, Jan, Jha, Abhay, Kao, Jiun-Yu, Krishnan, Prakash, Ku, Peter, Goyal, Anuj, Lin, Chien-Wei, Liu, Qing, Mandal, Arindam, Metallinou, Angeliki, Naik, Vishal, Pan, Yi, Paul, Shachi, Perera, Vittorio, Sethi, Abhishek, Shen, Minmin, Strom, Nikko, Wang, Eddie
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:NAACL 2021 System Demonstrations Track Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new domain, limiting scalability of such systems. Similarly, rule-based dialogue systems require extensive writing and maintenance of rules and do not scale either. End-to-End dialogue systems, on the other hand, do not require module-specific annotations but need a large amount of data for training. To overcome these problems, in this demo, we present Alexa Conversations, a new approach for building goal-oriented dialogue systems that is scalable, extensible as well as data efficient. The components of this system are trained in a data-driven manner, but instead of collecting annotated conversations for training, we generate them using a novel dialogue simulator based on a few seed dialogues and specifications of APIs and entities provided by the developer. Our approach provides out-of-the-box support for natural conversational phenomena like entity sharing across turns or users changing their mind during conversation without requiring developers to provide any such dialogue flows. We exemplify our approach using a simple pizza ordering task and showcase its value in reducing the developer burden for creating a robust experience. Finally, we evaluate our system using a typical movie ticket booking task and show that the dialogue simulator is an essential component of the system that leads to over $50\%$ improvement in turn-level action signature prediction accuracy.
DOI:10.48550/arxiv.2104.09088