An Arabic OCR Approach Using Levenshtein Distance and CNNs

Arabic Handwritten Character Recognition (AHCR) systems encounter various challenges arising from the unique characteristics of the Arabic language and the limited availability of public databases. Consequently, numerous research endeavors have aimed to enhance the recognition accuracy of AHCR. In t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Ingénierie des systèmes d'Information 2024-02, Vol.29 (1), p.9-17
Hauptverfasser: Fakhet, Walid, El Khediri, Salim, Zidi, Salah
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Arabic Handwritten Character Recognition (AHCR) systems encounter various challenges arising from the unique characteristics of the Arabic language and the limited availability of public databases. Consequently, numerous research endeavors have aimed to enhance the recognition accuracy of AHCR. In this study, we propose a solution inspired by the intricate functions of the human visual cortex and hippocampus. Our proposed system employs a segmentation method to break down Arabic characters, and a Convolutional Neural Network (CNN) is then utilized for character recognition. To recognize entire words, we employ the Levenshtein distance with a personalized database containing an extensive collection of Arabic words. Experimental results demonstrate that our system yields a word error rate ranging from 1% to 25%, contingent upon the number of accurately recognized characters.
ISSN:1633-1311
2116-7125
DOI:10.18280/isi.290102