Interactive Machine Translation for the Language Modernization and Spelling Normalization of Historical Documents

[EN] Historical documents are an important part of our cultural heritage. Among other task related to their processing, it is important to modernize their language in order to make them accessible to a broader audience and to achieve an orthography consistency to reduce the linguistic variation inhe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Domingo-Ballester, Miguel, Casacuberta Nolla, Francisco
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[EN] Historical documents are an important part of our cultural heritage. Among other task related to their processing, it is important to modernize their language in order to make them accessible to a broader audience and to achieve an orthography consistency to reduce the linguistic variation inherent in them. Language modernization and spelling normalization have those goals in mind. However, they still have a long way to go. Thus, in order to help scholars generate error-free modernizations/normalizations when the quality is essential, we propose an interactive framework based on interactive machine translation. In this work, we deployed two different interactive protocols into these tasks. We evaluated our proposal under simulated environments, observing significant reductions of the human effort. The research leading to these results has received funding from Generalitat Valenciana under project PROMETEO/2019/121 and from ValgrAI (Valencian Graduate School and Research Network for Artificial Intelligence). We gratefully acknowledge Andres Trapiello and Ediciones Destino for granting us permission to use their book in our research. Domingo-Ballester, M.; Casacuberta Nolla, F. (2023). Interactive Machine Translation for the Language Modernization and Spelling Normalization of Historical Documents. Pattern Analysis and Applications. 26(4):1601-1614. https://doi.org/10.1007/s10044-023-01164-w