VBD-MT Chinese-Vietnamese Translation Systems for VLSP 2022
We present our systems participated in the VLSP 2022 machine translation shared task. In the shared task this year, we participated in both translation tasks, i.e., Chinese-Vietnamese and Vietnamese-Chinese translations. We build our systems based on the neural-based Transformer model with the power...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | We present our systems participated in the VLSP 2022 machine translation
shared task. In the shared task this year, we participated in both translation
tasks, i.e., Chinese-Vietnamese and Vietnamese-Chinese translations. We build
our systems based on the neural-based Transformer model with the powerful
multilingual denoising pre-trained model mBART. The systems are enhanced by a
sampling method for backtranslation, which leverage large scale available
monolingual data. Additionally, several other methods are applied to improve
the translation quality including ensembling and postprocessing. We achieve
38.9 BLEU on ChineseVietnamese and 38.0 BLEU on VietnameseChinese on the public
test sets, which outperform several strong baselines. |
---|---|
DOI: | 10.48550/arxiv.2308.07601 |