A Hybrid Deep Learning Approach for Skin Cancer Classification Using Swin Transformer and Dense Group Shuffle Non-Local Attention Network

Skin lesions are localized regions of the skin exhibiting atypical growth or appearance and can be either benign or malignant. Detecting and classifying these lesions accurately is crucial for early diagnosis and effective treatment of skin cancer. However, existing approaches for classifying cancer...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024, Vol.12, p.158040-158051
Hauptverfasser: Karthik, R., Menaka, R., Atre, Shivansh, Cho, Jaehyuk, Veerappampalayam Easwaramoorthy, Sathishkumar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Skin lesions are localized regions of the skin exhibiting atypical growth or appearance and can be either benign or malignant. Detecting and classifying these lesions accurately is crucial for early diagnosis and effective treatment of skin cancer. However, existing approaches for classifying cancerous and non-cancerous skin lesions from dermoscopic images are limited in capturing the appropriate features and also susceptible to overfitting in imbalanced datasets. This work aims to improve skin cancer classification by enhancing feature representation in dermoscopic images. A hybrid classification system is proposed, combining two tracks: the Swin Transformer and the Dense Group Shuffle Non-Local Attention (DGSNLA) Network, which integrates DenseNet169, Group Shuffle Depth-wise blocks (GSDW), and an enhanced non-local attention block (ENLA). This deep feature fusion approach effectively aggregates global and local features, resulting in more accurate feature representation and improved classification performance. To the best of our knowledge, this is the first study to combine a transformer with a custom CNN for skin cancer classification. The proposed network achieves an accuracy of 94.21%, an F1-score of 96.64%. a precision of 96.62%, and a recall of 96.25% on the HAM10000 dataset.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3485507