FT4cip: A new functional tree for classification in class imbalance problems

Decision trees (DTs) are popular classifiers partly due to their reasonably good classification performance, their ease of interpretation, and their widespread use in ensembles. To improve the classification performance of individual DTs, researchers have used linear combinations of features in inne...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge-based systems 2022-09, Vol.252, p.109294, Article 109294
Hauptverfasser: Cañete-Sifuentes, Leonardo, Monroy, Raúl, Medina-Pérez, Miguel Angel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Decision trees (DTs) are popular classifiers partly due to their reasonably good classification performance, their ease of interpretation, and their widespread use in ensembles. To improve the classification performance of individual DTs, researchers have used linear combinations of features in inner nodes (Multivariate decision trees), leaf nodes (Model trees), or both (Functional trees). In this paper, we present a new functional tree, Functional Tree for class imbalance problems (FT4cip). FT4cip is designed to work with class imbalance problems, where one of the classes in the database has few objects compared to another class. FT4cip achieves better classification performance, in terms of AUC, than the best model tree (LMT) and functional tree (Gama) that we identified. The statistical comparison was made in 110 databases using Bayesian statistical tests. We also make a meta-analysis of classification performance per type of database, which helps us recommend a classifier given a problem. We show how each design decision taken when building FT4cip contributes to classification performance or simple models, and rank them according to their importance to classification performance. To avoid a problem of fragmentation in DT literature, we contrast each design decision taken when building FT4cip against LMT and Gama. •We introduce the Functional Tree for class imbalance problems (FT4cip).•We make a statistical comparison of FT4cip against rival methods in 110 databases.•The comparison shows FT4cip has better classification performance than rivals.•A meta-analysis lets us recommend what classifier to use given a specific problem.•The meta-analysis shows FT4cip has great performance in class imbalance problems.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2022.109294