Support Vector Machine Text Classification System: Using Ant Colony Optimization Based Feature Subset Selection

Feature subset selection (FSS) is an important step for effective text classification systems. In this work, we have implemented a support vector machine (SVM) text classifier for Arabic articles. Moreover, we have implemented a novel FSS method based on Ant Colony Optimization (ACO) and Chi-square...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mesleh, A.M., Kanaan, G.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Feature subset selection (FSS) is an important step for effective text classification systems. In this work, we have implemented a support vector machine (SVM) text classifier for Arabic articles. Moreover, we have implemented a novel FSS method based on Ant Colony Optimization (ACO) and Chi-square statistic. The proposed ACO-Based FSS method adapted Chi-square statistic as heuristic information and the effectiveness of the SVM classifier as a guide to improve the selection of features for each category. Compared to the six state-of-the-art FSS methods, our ACO Based-FSS algorithm achieved better TC effectiveness. Evaluation used an in-house Arabic text classification corpus that consists of 1445 documents independently classified into nine categories. The experimental results were presented in terms of macro-averaging precision, macro-averaging recall and macro-averaging F 1 measures.
DOI:10.1109/ICCES.2008.4772984