Handwritten Script Identification from Text Lines
Proc. of 7th International Conference on Advances in Communication, Network and Computing (CNC), 2016 In a multilingual country like India where 12 different official scripts are in use, automatic identification of handwritten script facilitates many important applications such as automatic transcri...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Proc. of 7th International Conference on Advances in
Communication, Network and Computing (CNC), 2016 In a multilingual country like India where 12 different official scripts are
in use, automatic identification of handwritten script facilitates many
important applications such as automatic transcription of multilingual
documents, searching for documents on the web/digital archives containing a
particular script and for the selection of script specific Optical Character
Recognition (OCR) system in a multilingual environment. In this paper, we
propose a robust method towards identifying scripts from the handwritten
documents at text line-level. The recognition is based upon features extracted
using Chain Code Histogram (CCH) and Discrete Fourier Transform (DFT). The
proposed method is experimented on 800 handwritten text lines written in seven
Indic scripts namely, Gujarati, Kannada, Malayalam, Oriya, Tamil, Telugu, Urdu
along with Roman script and yielded an average identification rate of 95.14%
using Support Vector Machine (SVM) classifier. |
---|---|
DOI: | 10.48550/arxiv.2009.07433 |