A ranking-based feature selection approach for handwritten character recognition

•We present a method for feature selection for OCR.•We combine feature ranking based technique with a greedy search strategy.•We considered effective and widely used feature sets in handwriting recognition.•We found that a reduced feature set can be selected without sacrificing performance. Feature...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition letters 2019-04, Vol.121, p.77-86
Hauptverfasser: Cilia, Nicole Dalia, De Stefano, Claudio, Fontanella, Francesco, Scotto di Freca, Alessandra
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•We present a method for feature selection for OCR.•We combine feature ranking based technique with a greedy search strategy.•We considered effective and widely used feature sets in handwriting recognition.•We found that a reduced feature set can be selected without sacrificing performance. Feature selection is generally considered a very important step in any pattern recognition process. Its aim is that of reducing the computational cost of the classification task, in an attempt to increase, or not to reduce, the classification performance. In the framework of handwriting recognition, the large variability of the handwriting of different writers makes the selection of appropriate feature sets even more complex and have been widely investigated. Although promising, the results achieved so far present several limitations, that include, among others, the computational complexity, the dependence on the adopted classifiers and the difficulty in evaluating the interactions among features. In this study, we tried to overcome some of the above drawbacks by adopting a feature-ranking-based technique: we considered different univariate measures to produce a feature ranking and we proposed a greedy search approach for choosing the feature subset able to maximize the classification results. In the experiments, we considered one of the most effective and widely used set of features in handwriting recognition to verify whether our approach allows us to obtain good classification results by selecting a reduced set of features. The experimental results, obtained by using standard real word databases of handwritten characters, confirmed the effectiveness of our proposal.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2018.04.007