SCUT-COUCH TextlineNU: An Unconstrained Online Handwritten Chinese Text Lines Dataset
An unconstrained online handwritten Chinese text lines dataset, SCUT-COUCH Textline_NU, a subset of SCUT-COUCH [1] [2], is built to facilitate the research of unconstrained online Chinese text recognition. Texts for hand copying are sampled from China Daily corpus with a stratified random manner. Th...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | An unconstrained online handwritten Chinese text lines dataset, SCUT-COUCH Textline_NU, a subset of SCUT-COUCH [1] [2], is built to facilitate the research of unconstrained online Chinese text recognition. Texts for hand copying are sampled from China Daily corpus with a stratified random manner. The current vision of SCUT-COUCH Textline_NU has 8,809 text lines (4,813 lines are collected by touch screen LCD and 3,996 by digital pen) and 159,866 characters in total that are written by more than 157 participants. To demonstrate that the dataset is practical, an over-segmentation, dynamic programming and semantic model based algorithm was presented for segmenting and recognizing the unconstrained online Chinese text lines. In preliminary experiments on the dataset, the proposed algorithm recognition achieves a baseline accuracy of 56.41%. |
---|---|
DOI: | 10.1109/ICFHR.2010.123 |