IMCStacking: Cost-sensitive stacking learning with feature inverse mapping for imbalanced problems

Stacking related methods develop rapidly recent years. However, few Stacking based ensemble methods are designed for imbalanced problems. In this paper, a novel Feature Inverse Mapping based Cost-sensitive Stacking learning (IMCStacking) is proposed to solve the problems encountered in imbalanced cl...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge-based systems 2018-06, Vol.150, p.27-37
Hauptverfasser: Cao, Chenjie, Wang, Zhe
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Stacking related methods develop rapidly recent years. However, few Stacking based ensemble methods are designed for imbalanced problems. In this paper, a novel Feature Inverse Mapping based Cost-sensitive Stacking learning (IMCStacking) is proposed to solve the problems encountered in imbalanced classification. In IMCStacking, we integrate the cost-sensitive Logistic Regression as the final classifier to regard different costs to majority and minority samples. Furthermore, a quick and effective feature inverse mapping technique is applied to IMCStacking to maximize the utilization of the cross-validation process during the Stacking ensemble. This trick can make the proposed method learn better classification thresholds for imbalanced problems. As the result, IMCStacking implements the cost-sensitive strategy on both data level and feature level to overcome the imbalances. Moreover, both linear and forest based approaches work as base classifiers in IMCStacking to guarantee enough generalization. Finally, comprehensive comparison experiments about training times and mean accuracy (M-ACC) on typical imbalanced datasets from KEEL demonstrate both the effectiveness and efficiency of the proposed IMCStacking.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2018.02.031