Segmentation of Arabic Handwritten Documents into Text Lines using Watershed Transform
A crucial task in character recognition systems is the segmentation of the document into text lines and especially if it is handwritten. When dealing with non-Latin document such as Arabic, the challenge becomes greater since in addition to the variability of writing, the presence of diacritical poi...
Gespeichert in:
Veröffentlicht in: | International journal of interactive multimedia and artificial intelligence 2017-12, Vol.4 (6), p.96 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A crucial task in character recognition systems is the segmentation of the document into text lines and especially if it is handwritten. When dealing with non-Latin document such as Arabic, the challenge becomes greater since in addition to the variability of writing, the presence of diacritical points and the high number of ascender and descender characters complicates more the process of the segmentation. To remedy with this complexity and even to make this difficulty an advantage since the focus is on the Arabic language which is semi-cursive in nature, a method based on the Watershed Transform technique is proposed. Tested on > [21] a segmentation rate of 93% for a 95% of matching score is achieved. KEYWORDS Text Line Segmentation, Arabic Script, Handwritten Character Recognition, Connected Component Analysis, Projection Profile, Watershed Transform. |
---|---|
ISSN: | 1989-1660 1989-1660 |
DOI: | 10.9781/ijimai.2017.08.002 |