A thorough experimental comparison of multilabel methods for classification performance

Multilabel classification as a data mining task has recently attracted increasing interest from researchers. Many current data mining applications address problems with instances that belong to more than one class. These problems require the development of new, efficient methods. Advantageously usin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2024-07, Vol.151, p.110342, Article 110342
Hauptverfasser: García-Pedrajas, Nicolás E., Cuevas-Muñoz, José M., Cerruela-García, Gonzalo, de Haro-García, Aida
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Multilabel classification as a data mining task has recently attracted increasing interest from researchers. Many current data mining applications address problems with instances that belong to more than one class. These problems require the development of new, efficient methods. Advantageously using the correlation among different labels can provide better performance than methods that manage each label separately. In recent decades, many methods have been developed to deal with multilabel datasets, which makes it difficult to decide which method is the most appropriate for a given task. In this paper, we present the most comprehensive comparison carried out so far. We compare a total of 62 different methods and several configurations of each one for a total of 197 trained models. We also use a large set of problems comprising 65 datasets. In addition, we studied the efficiency of the methods considering six different classification performance metrics. Our results show that, although there are methods that repeatedly appear among the top-performing models, the best methods are closely related to the metric used for evaluating the performance. We also analyzed different aspects of the behavior of the methods. •We present a thorough comparison of multi-label methods.•The behavior depending on base classifiers is studied for many methods.•Six different metrics are used in the comparison.•Clustering of similar methods is carried out.•The behavior of the methods depending characteristics is carried out.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2024.110342