OCR error correction for Vietnamese handwritten text using neural machine translation

OCR post-processing is an important step for improving the quality of OCR output texts. Long short-term memory (LSTM) is a deep learning model, which has wide-range applications in many domains like time series prediction, natural language processing and speech recognition. In this paper, we propose...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Nguyen, D. Q., Le, A. D., Phan, M. N., Kromer, P., Zelinka, I.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Coders Competition Encoders-Decoders Error correction Error correction & detection Handwriting recognition Machine learning Machine translation Natural language processing Optical character recognition Post-processing Speech recognition Texts
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	OCR post-processing is an important step for improving the quality of OCR output texts. Long short-term memory (LSTM) is a deep learning model, which has wide-range applications in many domains like time series prediction, natural language processing and speech recognition. In this paper, we propose an OCR error correction model using neural machine translation with bidirectional LSTM networks at syllable level. Vietnamese OCR text dataset for the model evaluation is outputted from an OCR engine based on the attention-based encoder-decoder (AED) model taking input of handwritten text in the benchmark database of the ICFHR 2018 Vietnamese online handwritten text recognition competition. The experimental results show that the proposed model helps decrease the word error rate in the OCR output texts of the above AED model by about 2%. The model performance is also discussed and compared to the other baseline methods in the competition.
ISSN:	0094-243X 1551-7616
DOI:	10.1063/5.0066679