Investigating the impact of selection criteria in dynamic ensemble selection methods

•Present three dynamic ensemble selection (DES) methods.•Evaluate the impact of proximity measures in DES methods.•Perform an empirical analysis with one-step and two-step DES methods.•Obtain the best results of the empirical analysis with two-step DES methods.•Obtain similar results of the empirica...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2018-09, Vol.106, p.141-153
Hauptverfasser: Lustosa Filho, Jose Augusto S., Canuto, Anne M.P., Santiago, Regivan Hugo Nunes
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Present three dynamic ensemble selection (DES) methods.•Evaluate the impact of proximity measures in DES methods.•Perform an empirical analysis with one-step and two-step DES methods.•Obtain the best results of the empirical analysis with two-step DES methods.•Obtain similar results of the empirical analysis with the use of proximity measures. Ensemble of Classifiers are composed of parallel-organized components (individual classifiers) whose outputs are combined using a combination method that provides the final output for an ensemble. In this context, Dynamic Ensemble Systems (DES) is an ensemble-based system that, for each test pattern, a different ensemble structure is defined, in which a subset of classifiers is selected from an initial pool of classifiers. During the selection process of a DES, any criterion can be used, being the most important ones accuracy and distance. Distance measures are used to assess the distance of the classifier outputs within a validation set and the main examples of this measure are diversity and similarity. In this paper, we investigate the impact of selection criteria in DES methods. More specifically, we focus on the use of different distance measures (diversity and similarity) as selection criteria. In other to do this, an empirical analysis has been conducted using six different DES methods (three of them are existing methods and the remaining three are proposed in this paper) and with 20 different classification datasets. Our findings indicated that a distance measure improves the overall performance of the state-of-the-art ensemble generation methods.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2018.04.002