Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings

In integrated segmentation and recognition of character strings, the underlying classifier is trained to be resistant to noncharacters. We evaluate the performance of state-of-the-art pattern classifiers of this kind. First, we build a baseline numeral string recognition system with simple but effec...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence 2004-11, Vol.26 (11), p.1395
Hauptverfasser:	Liu, Cheng-Lin, Sako, Hiroshi, Fujisawa, Hiromichi
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial Intelligence Automatic Data Processing - methods Cluster Analysis Computer Graphics Computer Simulation Handwriting Image Enhancement - methods Image Interpretation, Computer-Assisted - methods Information Storage and Retrieval - methods Numerical Analysis, Computer-Assisted Pattern Recognition, Automated Reading Reproducibility of Results Sensitivity and Specificity Signal Processing, Computer-Assisted Subtraction Technique User-Computer Interface
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In integrated segmentation and recognition of character strings, the underlying classifier is trained to be resistant to noncharacters. We evaluate the performance of state-of-the-art pattern classifiers of this kind. First, we build a baseline numeral string recognition system with simple but effective presegmentation. The classification scores of the candidate patterns generated by presegmentation are combined to evaluate the segmentation paths and the optimal path is found using the beam search strategy. Three neural classifiers, two discriminative density models, and two support vector classifiers are evaluated. Each classifier has some variations depending on the training strategy: maximum likelihood, discriminative learning both with and without noncharacter samples. The string recognition performances are evaluated on the numeral string images of the NIST Special Database 19 and the zipcode images of the CEDAR CDROM-1. The results show that noncharacter training is crucial for neural classifiers and support vector classifiers, whereas, for the discriminative density models, the regularization of parameters is important. The string recognition results compare favorably to the best ones reported in the literature though we totally ignored the geometric context. The best results were obtained using a support vector classifier, but the neural classifiers and discriminative density models show better trade-off between accuracy and computational overhead.
ISSN:	0162-8828
DOI:	10.1109/TPAMI.2004.104