A classification method to classify bone marrow cells with class imbalance problem
•This research presents a class balance classification method (CBCM) for classifying bone marrow cell data sets that have a problem of imbalance.•Automatic classifier for identifying 15 different types of bone marrow cells.•CBCM uses Class-Balanced focal loss.•CBCM outperforms other balance approach...
Gespeichert in:
Veröffentlicht in: | Biomedical signal processing and control 2022-02, Vol.72, p.103296, Article 103296 |
---|---|
Hauptverfasser: | , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •This research presents a class balance classification method (CBCM) for classifying bone marrow cell data sets that have a problem of imbalance.•Automatic classifier for identifying 15 different types of bone marrow cells.•CBCM uses Class-Balanced focal loss.•CBCM outperforms other balance approaches, such as random over-sampling, synthetic minority over-sampling technique (SMOTE), random under-sampling, weighted random forest and weighted cross-entropy function.
Bone marrow cell morphology has long been used to diagnose blood diseases. However, it requires long-term experience from a suitable person. Furthermore, the outcomes of their recognition are subjective and no quantitative standard has been established yet. Consequently, developing a deep learning automatic system for classifying bone marrow cells is extremely important. However, real-life data sets, such as bone marrow cell data, constantly suffer from a long-tail distribution problem, owing to which the final trained classifier is biased toward a large number of categories. Thus, addressing this issue is crucial. The current research presents a class balance classification method (CBCM) for classifying 15 types of bone marrow cell data sets with a class imbalance problem. CBCM outperforms other balance approaches such as random over-sampling, synthetic minority over-sampling technique (SMOTE), random under-sampling, weighted random forest and weighted cross-entropy function, achieving precision, sensitivity, and specificity values of 84.53%, 84.44% and 99.29% respectively. A more extensive comparison between the baseline and CBCM, as well as the Grad-CAM and Guided Grad-CAM of CBCM, reveals that CBCM is a reliable and effective solution to address the long-tail distribution problem of the bone marrow cell data sets. |
---|---|
ISSN: | 1746-8094 1746-8108 |
DOI: | 10.1016/j.bspc.2021.103296 |