Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation

Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, art...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020-01, Vol.8, p.1-1
Hauptverfasser: Pham, Tri-Cong, Doucet, Antoine, Luong, Chi-Mai, Tran, Cong-Thanh, Hoang, Van-Dung
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1
container_issue
container_start_page 1
container_title IEEE access
container_volume 8
creator Pham, Tri-Cong
Doucet, Antoine
Luong, Chi-Mai
Tran, Cong-Thanh
Hoang, Van-Dung
description Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, artificial intelligence has come to the forefront to facilitate skin cancer diagnosis based on medical images. Many deep learning models have been studied and developed, but the imbalance of performance among classes in the multi-class classification is still a challenging problem. This study proposes a hybrid method for handling class imbalance of skin-disease classification. This method combines the data level method of balanced mini-batch logic followed by real-time image augmentation with the algorithm level method of designing new loss function. The training dataset includes 24,530 dermoscopic images of seven skin disease categories, which is by far the largest dataset of skin cancer. The performance metrics of six proposed methods are evaluated on a test dataset of 2,453 images. Our proposed EfficientNetB4-CLF model achieves the highest accuracy of 89.97% and also the highest mean recall of 86.13% with the smallest recalls' standard deviations of 7.60%. Compared to the original methods, our proposed solution not only surpasses 4.65% (86.13% vs 81.48%) of mean recalls but also reduces 4.24% of the recalls' standard deviations (from ±11.84% to ±7.60%). This result indicates that our hybrid method is highly effective in training the Deep CNN network on the skin-disease imbalanced dataset. It addresses the problem of slow learning of the minority classes in the networks by combining the data level method of balanced mini-batch logic followed by the real-time image augmentation with the algorithm level method of the newly designed loss function.
doi_str_mv 10.1109/ACCESS.2020.3016653
format Article
fullrecord <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_proquest_journals_2454643189</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9167198</ieee_id><doaj_id>oai_doaj_org_article_2e317fcda2314ed897f00e866d9ee062</doaj_id><sourcerecordid>2454643189</sourcerecordid><originalsourceid>FETCH-LOGICAL-c442t-fcf4459db0159486aaf67185d280b730924d092ff6dde306993aebaa26ef5e8a3</originalsourceid><addsrcrecordid>eNpVUU1v1DAUjBBIVKW_oJdInDhk8Vec-LhaFbrSShwKZ-vFfs56SewSJ0XwK_jJeDdVBT7Y741mxvaborilZEMpUR-3u93dw8OGEUY2nFApa_6quGJUqorXXL7-p35b3KR0Inm1Gaqbq-LPfnyc4pMPfZm--1BZnxASlmaAlLzzBmYfQ9llzJa5MEua4-h_526IKZVuCebCMHHsfMjwTz8fM3-AYHI3-uCrDmZzzPzemxKCLSeEoZr9iKUfoccSln7EMF-uele8cTAkvHk-r4tvn-6-7u6rw5fP-932UBkh2Fw544Sole0IrZVoJYCTDW1ry1rSNZwoJmzenJPWIidSKQ7YATCJrsYW-HWxX31thJN-nPJLpl86gtcXIE69hmn2ZkDNkNPGGQuMU4G2VY0jBFsprUIkkmWvD6vXEYb_rO63B33GCCdMKs6eaOa-X7l57D8WTLM-xWUK-auaiVpIwWmrMouvLDPlKU_oXmwp0efU9Zq6Pqeun1PPqttV5RHxRaFonoxq-V9s7Kp0</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2454643189</pqid></control><display><type>article</type><title>Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Pham, Tri-Cong ; Doucet, Antoine ; Luong, Chi-Mai ; Tran, Cong-Thanh ; Hoang, Van-Dung</creator><creatorcontrib>Pham, Tri-Cong ; Doucet, Antoine ; Luong, Chi-Mai ; Tran, Cong-Thanh ; Hoang, Van-Dung</creatorcontrib><description>Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, artificial intelligence has come to the forefront to facilitate skin cancer diagnosis based on medical images. Many deep learning models have been studied and developed, but the imbalance of performance among classes in the multi-class classification is still a challenging problem. This study proposes a hybrid method for handling class imbalance of skin-disease classification. This method combines the data level method of balanced mini-batch logic followed by real-time image augmentation with the algorithm level method of designing new loss function. The training dataset includes 24,530 dermoscopic images of seven skin disease categories, which is by far the largest dataset of skin cancer. The performance metrics of six proposed methods are evaluated on a test dataset of 2,453 images. Our proposed EfficientNetB4-CLF model achieves the highest accuracy of 89.97% and also the highest mean recall of 86.13% with the smallest recalls' standard deviations of 7.60%. Compared to the original methods, our proposed solution not only surpasses 4.65% (86.13% vs 81.48%) of mean recalls but also reduces 4.24% of the recalls' standard deviations (from ±11.84% to ±7.60%). This result indicates that our hybrid method is highly effective in training the Deep CNN network on the skin-disease imbalanced dataset. It addresses the problem of slow learning of the minority classes in the networks by combining the data level method of balanced mini-batch logic followed by the real-time image augmentation with the algorithm level method of the newly designed loss function.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3016653</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Artificial Intelligence ; Augmentation ; balanced mini-batch logic ; Cancer ; Classification ; Classification algorithms ; Computation and Language ; Computer Science ; Datasets ; deep neural networks ; Digital Libraries ; Document and Text Processing ; Human-Computer Interaction ; hybrid method ; imbalanced dataset ; Information Retrieval ; Lesions ; Logic ; loss function ; Machine Learning ; Medical imaging ; Melanoma ; Model accuracy ; Neural networks ; Performance measurement ; Real time ; Real-time systems ; Skin ; Skin cancer ; Skin disease ; Standard deviation ; Training</subject><ispartof>IEEE access, 2020-01, Vol.8, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c442t-fcf4459db0159486aaf67185d280b730924d092ff6dde306993aebaa26ef5e8a3</citedby><cites>FETCH-LOGICAL-c442t-fcf4459db0159486aaf67185d280b730924d092ff6dde306993aebaa26ef5e8a3</cites><orcidid>0000-0002-5507-6454 ; 0000-0003-1572-7248 ; 0000-0001-6160-3356</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9167198$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>230,314,776,780,860,881,2096,27610,27901,27902,54908</link.rule.ids><backlink>$$Uhttps://hal.science/hal-03026932$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Pham, Tri-Cong</creatorcontrib><creatorcontrib>Doucet, Antoine</creatorcontrib><creatorcontrib>Luong, Chi-Mai</creatorcontrib><creatorcontrib>Tran, Cong-Thanh</creatorcontrib><creatorcontrib>Hoang, Van-Dung</creatorcontrib><title>Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation</title><title>IEEE access</title><addtitle>Access</addtitle><description>Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, artificial intelligence has come to the forefront to facilitate skin cancer diagnosis based on medical images. Many deep learning models have been studied and developed, but the imbalance of performance among classes in the multi-class classification is still a challenging problem. This study proposes a hybrid method for handling class imbalance of skin-disease classification. This method combines the data level method of balanced mini-batch logic followed by real-time image augmentation with the algorithm level method of designing new loss function. The training dataset includes 24,530 dermoscopic images of seven skin disease categories, which is by far the largest dataset of skin cancer. The performance metrics of six proposed methods are evaluated on a test dataset of 2,453 images. Our proposed EfficientNetB4-CLF model achieves the highest accuracy of 89.97% and also the highest mean recall of 86.13% with the smallest recalls' standard deviations of 7.60%. Compared to the original methods, our proposed solution not only surpasses 4.65% (86.13% vs 81.48%) of mean recalls but also reduces 4.24% of the recalls' standard deviations (from ±11.84% to ±7.60%). This result indicates that our hybrid method is highly effective in training the Deep CNN network on the skin-disease imbalanced dataset. It addresses the problem of slow learning of the minority classes in the networks by combining the data level method of balanced mini-batch logic followed by the real-time image augmentation with the algorithm level method of the newly designed loss function.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Augmentation</subject><subject>balanced mini-batch logic</subject><subject>Cancer</subject><subject>Classification</subject><subject>Classification algorithms</subject><subject>Computation and Language</subject><subject>Computer Science</subject><subject>Datasets</subject><subject>deep neural networks</subject><subject>Digital Libraries</subject><subject>Document and Text Processing</subject><subject>Human-Computer Interaction</subject><subject>hybrid method</subject><subject>imbalanced dataset</subject><subject>Information Retrieval</subject><subject>Lesions</subject><subject>Logic</subject><subject>loss function</subject><subject>Machine Learning</subject><subject>Medical imaging</subject><subject>Melanoma</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Performance measurement</subject><subject>Real time</subject><subject>Real-time systems</subject><subject>Skin</subject><subject>Skin cancer</subject><subject>Skin disease</subject><subject>Standard deviation</subject><subject>Training</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpVUU1v1DAUjBBIVKW_oJdInDhk8Vec-LhaFbrSShwKZ-vFfs56SewSJ0XwK_jJeDdVBT7Y741mxvaborilZEMpUR-3u93dw8OGEUY2nFApa_6quGJUqorXXL7-p35b3KR0Inm1Gaqbq-LPfnyc4pMPfZm--1BZnxASlmaAlLzzBmYfQ9llzJa5MEua4-h_526IKZVuCebCMHHsfMjwTz8fM3-AYHI3-uCrDmZzzPzemxKCLSeEoZr9iKUfoccSln7EMF-uele8cTAkvHk-r4tvn-6-7u6rw5fP-932UBkh2Fw544Sole0IrZVoJYCTDW1ry1rSNZwoJmzenJPWIidSKQ7YATCJrsYW-HWxX31thJN-nPJLpl86gtcXIE69hmn2ZkDNkNPGGQuMU4G2VY0jBFsprUIkkmWvD6vXEYb_rO63B33GCCdMKs6eaOa-X7l57D8WTLM-xWUK-auaiVpIwWmrMouvLDPlKU_oXmwp0efU9Zq6Pqeun1PPqttV5RHxRaFonoxq-V9s7Kp0</recordid><startdate>20200101</startdate><enddate>20200101</enddate><creator>Pham, Tri-Cong</creator><creator>Doucet, Antoine</creator><creator>Luong, Chi-Mai</creator><creator>Tran, Cong-Thanh</creator><creator>Hoang, Van-Dung</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>1XC</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-5507-6454</orcidid><orcidid>https://orcid.org/0000-0003-1572-7248</orcidid><orcidid>https://orcid.org/0000-0001-6160-3356</orcidid></search><sort><creationdate>20200101</creationdate><title>Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation</title><author>Pham, Tri-Cong ; Doucet, Antoine ; Luong, Chi-Mai ; Tran, Cong-Thanh ; Hoang, Van-Dung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c442t-fcf4459db0159486aaf67185d280b730924d092ff6dde306993aebaa26ef5e8a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Augmentation</topic><topic>balanced mini-batch logic</topic><topic>Cancer</topic><topic>Classification</topic><topic>Classification algorithms</topic><topic>Computation and Language</topic><topic>Computer Science</topic><topic>Datasets</topic><topic>deep neural networks</topic><topic>Digital Libraries</topic><topic>Document and Text Processing</topic><topic>Human-Computer Interaction</topic><topic>hybrid method</topic><topic>imbalanced dataset</topic><topic>Information Retrieval</topic><topic>Lesions</topic><topic>Logic</topic><topic>loss function</topic><topic>Machine Learning</topic><topic>Medical imaging</topic><topic>Melanoma</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Performance measurement</topic><topic>Real time</topic><topic>Real-time systems</topic><topic>Skin</topic><topic>Skin cancer</topic><topic>Skin disease</topic><topic>Standard deviation</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pham, Tri-Cong</creatorcontrib><creatorcontrib>Doucet, Antoine</creatorcontrib><creatorcontrib>Luong, Chi-Mai</creatorcontrib><creatorcontrib>Tran, Cong-Thanh</creatorcontrib><creatorcontrib>Hoang, Van-Dung</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pham, Tri-Cong</au><au>Doucet, Antoine</au><au>Luong, Chi-Mai</au><au>Tran, Cong-Thanh</au><au>Hoang, Van-Dung</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020-01-01</date><risdate>2020</risdate><volume>8</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, artificial intelligence has come to the forefront to facilitate skin cancer diagnosis based on medical images. Many deep learning models have been studied and developed, but the imbalance of performance among classes in the multi-class classification is still a challenging problem. This study proposes a hybrid method for handling class imbalance of skin-disease classification. This method combines the data level method of balanced mini-batch logic followed by real-time image augmentation with the algorithm level method of designing new loss function. The training dataset includes 24,530 dermoscopic images of seven skin disease categories, which is by far the largest dataset of skin cancer. The performance metrics of six proposed methods are evaluated on a test dataset of 2,453 images. Our proposed EfficientNetB4-CLF model achieves the highest accuracy of 89.97% and also the highest mean recall of 86.13% with the smallest recalls' standard deviations of 7.60%. Compared to the original methods, our proposed solution not only surpasses 4.65% (86.13% vs 81.48%) of mean recalls but also reduces 4.24% of the recalls' standard deviations (from ±11.84% to ±7.60%). This result indicates that our hybrid method is highly effective in training the Deep CNN network on the skin-disease imbalanced dataset. It addresses the problem of slow learning of the minority classes in the networks by combining the data level method of balanced mini-batch logic followed by the real-time image augmentation with the algorithm level method of the newly designed loss function.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3016653</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-5507-6454</orcidid><orcidid>https://orcid.org/0000-0003-1572-7248</orcidid><orcidid>https://orcid.org/0000-0001-6160-3356</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2020-01, Vol.8, p.1-1
issn 2169-3536
2169-3536
language eng
recordid cdi_proquest_journals_2454643189
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Algorithms
Artificial Intelligence
Augmentation
balanced mini-batch logic
Cancer
Classification
Classification algorithms
Computation and Language
Computer Science
Datasets
deep neural networks
Digital Libraries
Document and Text Processing
Human-Computer Interaction
hybrid method
imbalanced dataset
Information Retrieval
Lesions
Logic
loss function
Machine Learning
Medical imaging
Melanoma
Model accuracy
Neural networks
Performance measurement
Real time
Real-time systems
Skin
Skin cancer
Skin disease
Standard deviation
Training
title Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T01%3A01%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improving%20skin-disease%20classification%20based%20on%20customized%20loss%20function%20combined%20with%20balanced%20mini-batch%20logic%20and%20real-time%20image%20augmentation&rft.jtitle=IEEE%20access&rft.au=Pham,%20Tri-Cong&rft.date=2020-01-01&rft.volume=8&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3016653&rft_dat=%3Cproquest_hal_p%3E2454643189%3C/proquest_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2454643189&rft_id=info:pmid/&rft_ieee_id=9167198&rft_doaj_id=oai_doaj_org_article_2e317fcda2314ed897f00e866d9ee062&rfr_iscdi=true