A Stochastic Arabic Diacritizer Based on a Hybrid of Factorized and Unfactorized Textual Features

This paper introduces a large-scale dual-mode stochastic system to automatically diacritize raw Arabic text. The first of these modes determines the most likely diacritics by choosing the sequence of full-form Arabic word diacritizations with maximum marginal probability via A^ lattice search and lo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on audio, speech, and language processing speech, and language processing, 2011-01, Vol.19 (1), p.166-175
Hauptverfasser: Rashwan, Mohsen A A, Al-Badrashiny, Mohamed A S A A, Attia, Mohamed, Abdou, Sherif M, Rafea, Ahmed
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!