A multi-agent system for pos-tagging vocalized Arabic texts

In this paper, we address the problem of Part-Of-Speech (POS) tagging of Arabic texts with vowel marks. After the description of the specificities of Arabic language and the induced difficulties on the task of POS-tagging, we propose an approach that combines several methods (stochastic and rule-bas...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International arab journal of information technology 2007, Vol.4 (4), p.322-329
Hauptverfasser: al-Zuraybi, Chiraz Bin Uthman, Bin Ahmad, Muhammad, Torjmen, Aroua
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, we address the problem of Part-Of-Speech (POS) tagging of Arabic texts with vowel marks. After the description of the specificities of Arabic language and the induced difficulties on the task of POS-tagging, we propose an approach that combines several methods (stochastic and rule-based). For the implementation of these methods and the global POS-tagging system, we adopted a multi-agent architecture. In which, five tagger agents work in parallel, each one applies its own method, in order to propose for each word in a sentence the suitable tag among those proposed by the morphological analyzer. The tagger agents cooperate together and with the unknown words solver agent to resolve unknown words. A voting agent decides in the end, which tag to affect to each word. Finally, we present the experimental protocol we used to evaluate the system carried out in this work and the obtained results that we consider very satisfactory.
ISSN:1683-3198
1683-3198