An efficient preprocessing block for the middle-age Persian manuscripts

In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Alirezaee, S., Aghaeinia, H., Faez, K., Rashidzadeh, R.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i.e. lines, words and characters. The proposed strategy has been tested on 200 page of the middle-age Persian. We have also used the success of the k-means algorithm on page to line segmentation as a criterion for the performance evaluation on the test data. The results show the proposed algorithm has 98.12% accuracy on page to line segmentation
ISSN:0840-7789
2576-7046
DOI:10.1109/CCECE.2005.1557418