Epi-Curriculum: Episodic Curriculum Learning for Low-Resource Domain Adaptation in Neural Machine Translation

Neural machine translation (NMT) models have achieved comparable results to human translation with a large number of parallel corpora available. However, their performance remains poor when translating on new domains with a limited number of data. Recent studies either only show the model's rob...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on artificial intelligence 2024-12, Vol.5 (12), p.6095-6108
Hauptverfasser: Chen, Keyu, Zhuang, Di, Li, Mingchen, Morris Chang, J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Neural machine translation (NMT) models have achieved comparable results to human translation with a large number of parallel corpora available. However, their performance remains poor when translating on new domains with a limited number of data. Recent studies either only show the model's robustness to domain shift or the superiority in adapting to new domains with a limited number of data. A solution for addressing both the model's robustness and adaptability is underexplored. In this article, we present a novel approach Epi-Curriculum to address low-resource domain adaptation (DA), which contains a new episodic training framework along with a denoised curriculum learning. Our episodic training framework enhances the model's robustness to domain shift by episodically exposing the encoder/decoder to an inexperienced decoder/encoder. The denoised curriculum learning filters the noised data and further improves the model's adaptability by gradually guiding the learning process from easy to more difficult tasks. Extensive experiments have been conducted on English-German (En-De), English-Romanian (En-Ro), and English-French (En-Fr) translation tasks. Our results show that: 1) Epi-Curriculum outperforms the baseline on unseen and seen domains by 2.28 and 3.64 BLEU score on En-De task, and 3.32 and 2.23 on En-Ro task; and 2) our episodic training framework outperforms the recent popular metalearning framework in terms of robustness to domain shift and achieves comparable adaptability to new domains.
ISSN:2691-4581
2691-4581
DOI:10.1109/TAI.2024.3396125