Attributes based skin lesion detection and recognition: A mask RCNN and transfer learning-based deep learning framework

•Decorrelation formulation based contrast improvement.•Lesion segmentation using modified MASK RCNN.•Transfer Learning based CNN features are extracted.•A Entropy-controlled LS-SVM based best CNN features are selected. Malignant melanoma is considered to be one of the deadliest types of skin cancers...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition letters 2021-03, Vol.143, p.58-66
Hauptverfasser: Khan, Muhammad Attique, Akram, Tallha, Zhang, Yu-Dong, Sharif, Muhammad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Decorrelation formulation based contrast improvement.•Lesion segmentation using modified MASK RCNN.•Transfer Learning based CNN features are extracted.•A Entropy-controlled LS-SVM based best CNN features are selected. Malignant melanoma is considered to be one of the deadliest types of skin cancers which is responsible for the massive number of deaths worldwide. According to the American Cancer Society (ACS), more than a million Americans are living with this melanoma. Since 2019, 192,310 new cases of melanoma are registered, where 95,380 are noninvasive, and 96,480 are invasive. The numbers of deaths due to melanoma in 2019 alone are 7,230, comprising 4,740 men and 2,490 women. Melanoma may be curable if diagnosed at the earlier stages; however, the manual diagnosis is time-consuming and also dependent on the expert dermatologist. In this work, a fully automated computerized aided diagnosis (CAD) system is proposed based on the deep learning framework. In the proposed scheme, the original dermoscopic images are initially pre-processed using the decorrelation formulation technique, which later passes the resultant images to the MASK-RCNN for the lesion segmentation. In this step, the MASK RCNN model is trained using the segmented RGB images generated from the ground truth images of ISBI2016 and ISIC2017 datasets. The resultant segmented images are later passed to the DenseNet deep model for feature extraction. Two different layers, average pool and fully connected, are used for feature extraction, which are later combined, and the resultant vector is forwarded to the feature selection block for down - sampling using proposed entropy-controlled least square SVM (LS-SVM). Three datasets are utilized for validation - ISBI2016, ISBI2017, and HAM10000 to achieve an accuracy of 96.3%, 94.8%, and 88.5% respectively. Further, the performance of MASK-RCNN is also validated on ISBI2016 and ISBI2017 to attain an accuracy of 93.6% and 92.7%. To further increase our confidence in the proposed framework, a fair comparison with other state-of-the-art is also provided.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2020.12.015