Data Synthesis for Domain Development of Natural Language Understanding for Assistant Systems

In one embodiment, a method includes receiving a request to train a natural-language understanding (NLU) model for a new domain, accessing a context-free grammar associated with the new domain, wherein the context-free grammar defines production rules with respect to ontology tokens associated with...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Moran, Brian, Levin, Theodore Frank, Difranco, Daniel, Kolmykov-Zotov, Alexander, Desai, Shrey
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In one embodiment, a method includes receiving a request to train a natural-language understanding (NLU) model for a new domain, accessing a context-free grammar associated with the new domain, wherein the context-free grammar defines production rules with respect to ontology tokens associated with the new domain and utterance tokens for generating natural-language strings in the new domain, generating utterance-frame pairs based on traversing a hierarchical grammar tree associated with the context-free grammar based on the production rules, wherein each utterance-frame pair comprises an utterance and a corresponding frame, wherein each frame comprises ontology tokens associated with the new domain and utterance tokens corresponding to one or more of the ontology tokens of the frame, and training the NLU model based on the utterance-frame pairs.