Extremely Low-resource Multilingual Neural Machine Translation for Indic Mizo Language
Machine translation requires a vast amount of parallel data in order to generate high-quality translations. Since many Indian languages lack sufficient resources, enhancing translation performance for these language pairs can have a significant impact. This study aims to address the issue of low-res...
Gespeichert in:
Veröffentlicht in: | International journal of information technology (Singapore. Online) 2023-12, Vol.15 (8), p.4275-4282 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Machine translation requires a vast amount of parallel data in order to generate high-quality translations. Since many Indian languages lack sufficient resources, enhancing translation performance for these language pairs can have a significant impact. This study aims to address the issue of low-resource neural machine translation between Mizo and English by utilizing other Indian languages in a multilingual framework. We explore the use of multilingual techniques for 13 pairs of Indic languages in both many-to-one and one-to-many setups, proposing a method for Multilingual Neural Machine Translation to enhance the translation quality of the low-resource Mizo language. We assess the effectiveness of ensemble decoding and transliteration into the Roman script through qualitative and quantitative approaches. The empirical findings demonstrate that incorporating transliteration and utilizing ensemble decoding with checkpoint ensembles leads to improved translation quality. |
---|---|
ISSN: | 2511-2104 2511-2112 |
DOI: | 10.1007/s41870-023-01480-8 |