Extremely Low-resource Multilingual Neural Machine Translation for Indic Mizo Language

Machine translation requires a vast amount of parallel data in order to generate high-quality translations. Since many Indian languages lack sufficient resources, enhancing translation performance for these language pairs can have a significant impact. This study aims to address the issue of low-res...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of information technology (Singapore. Online) 2023-12, Vol.15 (8), p.4275-4282
Hauptverfasser: Lalrempuii, Candy, Soni, Badal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Machine translation requires a vast amount of parallel data in order to generate high-quality translations. Since many Indian languages lack sufficient resources, enhancing translation performance for these language pairs can have a significant impact. This study aims to address the issue of low-resource neural machine translation between Mizo and English by utilizing other Indian languages in a multilingual framework. We explore the use of multilingual techniques for 13 pairs of Indic languages in both many-to-one and one-to-many setups, proposing a method for Multilingual Neural Machine Translation to enhance the translation quality of the low-resource Mizo language. We assess the effectiveness of ensemble decoding and transliteration into the Roman script through qualitative and quantitative approaches. The empirical findings demonstrate that incorporating transliteration and utilizing ensemble decoding with checkpoint ensembles leads to improved translation quality.
ISSN:2511-2104
2511-2112
DOI:10.1007/s41870-023-01480-8