Optimal translation of English to Bahasa Indonesia using statistical machine translation system

Finding optimal translation is not an easy task as it requires in-depth knowledge of the language to re-encode the meaning into the target language. This paper explores the translation process of a statistical machine translation system from English to Bahasa Indonesia by considering four weight var...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mantoro, T., Asian, J., Octavian, R., Ayu, M. A.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Finding optimal translation is not an easy task as it requires in-depth knowledge of the language to re-encode the meaning into the target language. This paper explores the translation process of a statistical machine translation system from English to Bahasa Indonesia by considering four weight variables i.e. translation model, language model, distortion (reordering) and for word penalty. This translation approach does not require in-depth knowledge of the linguistic properties of the languages. The well-behaved aligned parallel corpus as the training data for the machine translation is used to increase the BLEU and NIST scores in getting better quality translations. One way to enhance the corpus is by increasing the number of words and/or sentences it contains. In this study, the better evaluation score is achieved when we alter the weights of some translation parameters. Our study shows that both the weights and well-behaved aligned parallel corpus play significant roles in improving the translation quality which presented by higher score of NIST and BLEU. Our results show that this approach has better performance than a popular Rule Based Machine Translation (RBMT) system.
DOI:10.1109/ICT4M.2013.6518918