METHOD AND SYSTEM FOR PREPROCESSING IMAGE FOR OPTICAL CHARACTER RECOGNITION

PROBLEM TO BE SOLVED: To provide a method and a system for preprocessing an image including one or more of Arabic text and non-text items for Optical Character Recognition (OCR).SOLUTION: The method includes determining a plurality of components associated with the Arabic text and the non-text items...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: MOHAMED SULEIMAN KHORSHLD, HUSSEIN KHALID ALI O'MALLEY
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:PROBLEM TO BE SOLVED: To provide a method and a system for preprocessing an image including one or more of Arabic text and non-text items for Optical Character Recognition (OCR).SOLUTION: The method includes determining a plurality of components associated with the Arabic text and the non-text items. The component includes a set of connected pixels. A first set of characteristic parameters is then calculated for the plurality of components. The plurality of components are subsequently merged based on the first set of characteristic parameters to form one or more sub-words and/or one or more words.