Feature selection using bare-bones particle swarm optimization with mutual information

•Proposing a novel PSO-based feature selection algorithm with mutual information.•Presenting an effective swarm initialization strategy based on label correlation.•Designing two local search operators, the supplementary and deletion operators.•Giving an adaptive flip mutation to help particles jump...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2021-04, Vol.112, p.107804, Article 107804
Hauptverfasser: Song, Xian-fang, Zhang, Yong, Gong, Dun-wei, Sun, Xiao-yan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Proposing a novel PSO-based feature selection algorithm with mutual information.•Presenting an effective swarm initialization strategy based on label correlation.•Designing two local search operators, the supplementary and deletion operators.•Giving an adaptive flip mutation to help particles jump out of local extremum. Feature selection (FS) is an important data processing method in pattern recognition and data mining. Due to not considering characteristics of the FS problem itself, traditional particle update mechanisms and swarm initialization strategies adopted in most particle swarm optimization (PSO) limit their performance on dealing with high-dimensional FS problems. Focused on it, this paper proposes a novel feature selection algorithm based on bare bones PSO (BBPSO) with mutual information. Firstly, an effective swarm initialization strategy based on label correlation is developed, making full use of the correlation between features and class labels to accelerate the convergence of swarm. Then, in order to enhance the exploitation performance of the algorithm, two local search operators, i.e., the supplementary operator and the deletion operator, are developed based on feature relevance-redundancy. Furthermore, an adaptive flip mutation operator is designed to help particles jump out of local optimal solutions. We apply the proposed algorithm to typical datasets based on the K-Nearest Neighbor classifier (K-NN), and compare it with eleven state-of-the-art algorithms, SFS, PTA, SGA, BPSO, PSO(4-2), HPSO-LS, Binary BPSO, NaFA, IBFA, KPLS-mRMR and SMBA-CSFS. The experimental results show that the proposed algorithm can achieve a feature subset with better performance, and is a highly competitive FS algorithm.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2020.107804