An efficient preprocessing block for the middle-age Persian manuscripts
In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this paper, a preprocessing block for the middle-age Persian documents is proposed. The main idea is based on the mathematical morphology, connected components and clustering. The proposed algorithm is capable to simultaneously remove the noise and segment the manuscript to its basic components i.e. lines, words and characters. The proposed strategy has been tested on 200 page of the middle-age Persian. We have also used the success of the k-means algorithm on page to line segmentation as a criterion for the performance evaluation on the test data. The results show the proposed algorithm has 98.12% accuracy on page to line segmentation |
---|---|
ISSN: | 0840-7789 2576-7046 |
DOI: | 10.1109/CCECE.2005.1557418 |