METHOD AND SYSTEM FOR PREPROCESSING IMAGE FOR OPTICAL CHARACTER RECOGNITION
PROBLEM TO BE SOLVED: To provide a method and a system for preprocessing an image including one or more of Arabic text and non-text items for Optical Character Recognition (OCR).SOLUTION: The method includes determining a plurality of components associated with the Arabic text and the non-text items...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | PROBLEM TO BE SOLVED: To provide a method and a system for preprocessing an image including one or more of Arabic text and non-text items for Optical Character Recognition (OCR).SOLUTION: The method includes determining a plurality of components associated with the Arabic text and the non-text items. The component includes a set of connected pixels. A first set of characteristic parameters is then calculated for the plurality of components. The plurality of components are subsequently merged based on the first set of characteristic parameters to form one or more sub-words and/or one or more words. |
---|