Hierarchical Reinforcement Learning With Guidance for Multi-Domain Dialogue Policy

Achieving high performance in a multi-domain dialogue system with low computation is undoubtedly challenging. Previous works applying an end-to-end approach have been very successful. However, the computational cost remains a major issue since the large-sized language model using GPT-2 is required....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2023, Vol.31, p.748-761
Hauptverfasser:	Rohmatillah, Mahdin, Chien, Jen-Tzung
Format:	Artikel
Sprache:	eng
Schlagworte:	Cloning Costs Dialogue system Domains guidance learning hierarchical reinforcement learning Human performance Interactive computer systems Machine learning Optimization Pipelines policy optimization Reinforcement learning Representations Task analysis Training Transformers
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!