Estimation of Discriminative Feature Subset Using Community Modularity

Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k -features and applying community modularity to select highly informative features as a g...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific reports 2016-04, Vol.6 (1), p.25040-25040, Article 25040
Hauptverfasser:	Zhao, Guodong, Liu, Sanming
Format:	Artikel
Sprache:	eng
Schlagworte:	631/114/1305 639/705/117 Accuracy Algorithms Classification Data mining Data processing Feature selection Humanities and Social Sciences Learning algorithms Machine learning Methods multidisciplinary Probability distribution Science Variables
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k -features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k -features. An efficient FS method called k -features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments.
ISSN:	2045-2322 2045-2322
DOI:	10.1038/srep25040