A GA-based feature selection and ensemble learning for high-dimensional datasets

When dealing with high-dimensional datasets with fewer samples, feature selection and ensemble learning are two effective strategies. In this paper, we focus our attention on genetic algorithm based feature selection for ensemble learning. We use an improved GA algorithm (IGA) to reduce the dimensio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Pei-Yong Xia, Xiang-Qian Ding, Bai-Ning Jiang
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Ada-Boost Bagging Boosting Computational complexity Computer science Cybernetics Ensemble learning Feature Selection Filters Genetic algorithms Machine learning Oceans Pattern recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	When dealing with high-dimensional datasets with fewer samples, feature selection and ensemble learning are two effective strategies. In this paper, we focus our attention on genetic algorithm based feature selection for ensemble learning. We use an improved GA algorithm (IGA) to reduce the dimensionality of the feature space, and then evaluate using bagging and Ada-Boost constructed by the reduced features. Experimental results on several UCI datasets demonstrate that the improved GA-based feature selection algorithm (IGAFS) is often able to obtain a better feature subset when compared with the standard GA-based feature selection algorithm (SGAFS). Our experiments also indicate that ensemble learning using IGAFS is more accuracy than employing SGAFS and the whole feature space in general conditions.
ISSN:	2160-133X
DOI:	10.1109/ICMLC.2009.5212542