Learning a Multi-Domain Curriculum for Neural Machine Translation
Most data selection research in machine translation focuses on improving a single domain. We perform data selection for multiple domains at once. This is achieved by carefully introducing instance-level domain-relevance features and automatically constructing a training curriculum to gradually conce...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Most data selection research in machine translation focuses on improving a
single domain. We perform data selection for multiple domains at once. This is
achieved by carefully introducing instance-level domain-relevance features and
automatically constructing a training curriculum to gradually concentrate on
multi-domain relevant and noise-reduced data batches. Both the choice of
features and the use of curriculum are crucial for balancing and improving all
domains, including out-of-domain. In large-scale experiments, the multi-domain
curriculum simultaneously reaches or outperforms the individual performance and
brings solid gains over no-curriculum training. |
---|---|
DOI: | 10.48550/arxiv.1908.10940 |