An arabic question classification method based on new taxonomy and continuous distributed representation of words

The unability of search engines to retrieve precise answer for a given question leads research teams to build question answering systems (QAS). These systems provide exact answers of questions formulated in natural languages. Question classification is a crucial task for QAS since finding the correc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of King Saud University. Computer and information sciences 2021-02, Vol.33 (2), p.218-224
Hauptverfasser: Hamza, Alami, En-Nahnahi, Noureddine, Zidani, Khalid Alaoui, El Alaoui Ouatik, Said
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The unability of search engines to retrieve precise answer for a given question leads research teams to build question answering systems (QAS). These systems provide exact answers of questions formulated in natural languages. Question classification is a crucial task for QAS since finding the correct answer type increases the performance of this latter. The questions taxonomy plays an important role in question classification. A broad range of taxonomies are proposed; most of these are not designed for Arabic questions. The contribution of the paper is twofold. First, we build a taxonomy for open domain Arabic questions. Second, we propose an efficient method for classifying Arabic questions. The basic idea consists of two stages: first, we compute representation of questions according to continuous distributed representation of words which allows to capture syntactic and semantic relations between words. Then, we apply a machine learning approach to classify questions into seven types or categories. We carried out several experiments and compared the proposed method with different state of arts Arabic question classification methods. Experimental results show that the proposed method achieves 90% in terms of accuracy.
ISSN:1319-1578
2213-1248
DOI:10.1016/j.jksuci.2019.01.001