Segmentation of Arabic Handwritten Documents into Text Lines using Watershed Transform

A crucial task in character recognition systems is the segmentation of the document into text lines and especially if it is handwritten. When dealing with non-Latin document such as Arabic, the challenge becomes greater since in addition to the variability of writing, the presence of diacritical poi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of interactive multimedia and artificial intelligence 2017-12, Vol.4 (6), p.96
Hauptverfasser: Souhar, A, Boulid, Y, Ameur, ElB, Ouagague, Mly. M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A crucial task in character recognition systems is the segmentation of the document into text lines and especially if it is handwritten. When dealing with non-Latin document such as Arabic, the challenge becomes greater since in addition to the variability of writing, the presence of diacritical points and the high number of ascender and descender characters complicates more the process of the segmentation. To remedy with this complexity and even to make this difficulty an advantage since the focus is on the Arabic language which is semi-cursive in nature, a method based on the Watershed Transform technique is proposed. Tested on > [21] a segmentation rate of 93% for a 95% of matching score is achieved. KEYWORDS Text Line Segmentation, Arabic Script, Handwritten Character Recognition, Connected Component Analysis, Projection Profile, Watershed Transform.
ISSN:1989-1660
1989-1660
DOI:10.9781/ijimai.2017.08.002