An Arabic OCR Approach Using Levenshtein Distance and CNNs
Arabic Handwritten Character Recognition (AHCR) systems encounter various challenges arising from the unique characteristics of the Arabic language and the limited availability of public databases. Consequently, numerous research endeavors have aimed to enhance the recognition accuracy of AHCR. In t...
Gespeichert in:
Veröffentlicht in: | Ingénierie des systèmes d'Information 2024-02, Vol.29 (1), p.9-17 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Arabic Handwritten Character Recognition (AHCR) systems encounter various challenges arising from the unique characteristics of the Arabic language and the limited availability of public databases. Consequently, numerous research endeavors have aimed to enhance the recognition accuracy of AHCR. In this study, we propose a solution inspired by the intricate functions of the human visual cortex and hippocampus. Our proposed system employs a segmentation method to break down Arabic characters, and a Convolutional Neural Network (CNN) is then utilized for character recognition. To recognize entire words, we employ the Levenshtein distance with a personalized database containing an extensive collection of Arabic words. Experimental results demonstrate that our system yields a word error rate ranging from 1% to 25%, contingent upon the number of accurately recognized characters. |
---|---|
ISSN: | 1633-1311 2116-7125 |
DOI: | 10.18280/isi.290102 |