Error correction scheme augmented with statistical and lexical learning capability, for Japanese OCR

This paper proposes an improved error-correction scheme based on word-pair grammar augmented by statistical and lexical learning capability derived from manual error correction history. This learns corrections by hand to update probability score on the confusion matrix and memorizes new strings of c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Kazunori, M.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper proposes an improved error-correction scheme based on word-pair grammar augmented by statistical and lexical learning capability derived from manual error correction history. This learns corrections by hand to update probability score on the confusion matrix and memorizes new strings of characters gained after error-correction as to use them as a parameter of likelihood calculation for automatic error correction. The small evaluation of this algorithm has revealed, word-pair parsing based error correction improves about 2.5% of recognition rate, confusion matrix based statistical learning improves about 1.0%, and character string learning incorporated with statistical learning improves about 1.4% to output the recognition rate more than 99%, while OCR bare recognition rate is about 95%.
ISSN:1051-4651
2831-7475
DOI:10.1109/ICPR.1996.547627