Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study

Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied biochemistry and biotechnology 2020-02, Vol.190 (2), p.341-359
Hauptverfasser: Verma, Anurag Kumar, Pal, Saurabh, Kumar, Surjeet
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 359
container_issue 2
container_start_page 341
container_title Applied biochemistry and biotechnology
container_volume 190
creator Verma, Anurag Kumar
Pal, Saurabh
Kumar, Surjeet
description Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.
doi_str_mv 10.1007/s12010-019-03093-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2265788254</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2264475699</sourcerecordid><originalsourceid>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</originalsourceid><addsrcrecordid>eNp9kcFu1DAQhi0EokvhBTggS1y4pIwnsRMf0bYFpFZUanu2vPGkdUmcxU6Q2hMPwRPyJHibAhKHnqzxfPPPSB9jrwUcCID6fRIIAgoQuoASdFncPWErIWUuUYunbAVYlwVio_fYi5RuAAQ2sn7O9kpRSlBKrVg6i-R8O_kx8LHj51994Ic-kU3EL5MPV_woJBo2PfFDO1l-6sPu84La6-C_zZS4DY4fk53mSPycelqyTmm6Ht2vHz8tX4_D1kY7-e8ZmGZ3-5I962yf6NXDu88uj48u1p-Kky8fP68_nBRtJXAqqHWbbuMkKVBOC1SyQuEk2FqUAlsrJequUVapWmHT1dhUAloCXSlACU25z94tuds47k6dzOBTS31vA41zMpgj66ZBWWX07X_ozTjHkK_bUVVVS6V1pnCh2jimFKkz2-gHG2-NALNTYhYlJisx90rMXR568xA9bwZyf0f-OMhAuQApt8IVxX-7H4n9DRhTlvg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2264475699</pqid></control><display><type>article</type><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</creator><creatorcontrib>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</creatorcontrib><description>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</description><identifier>ISSN: 0273-2289</identifier><identifier>EISSN: 1559-0291</identifier><identifier>DOI: 10.1007/s12010-019-03093-z</identifier><identifier>PMID: 31350666</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial intelligence ; Bayes Theorem ; Biochemistry ; Biotechnology ; Chemistry ; Chemistry and Materials Science ; Classifiers ; Comparative studies ; Data Mining ; Datasets ; Dermatology ; Humans ; Learning algorithms ; Machine Learning ; Predictive Value of Tests ; Scientific papers ; Skin diseases ; Skin Diseases - diagnosis</subject><ispartof>Applied biochemistry and biotechnology, 2020-02, Vol.190 (2), p.341-359</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Applied Biochemistry and Biotechnology is a copyright of Springer, (2019). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</citedby><cites>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</cites><orcidid>0000-0001-9545-7481</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12010-019-03093-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12010-019-03093-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31350666$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Verma, Anurag Kumar</creatorcontrib><creatorcontrib>Pal, Saurabh</creatorcontrib><creatorcontrib>Kumar, Surjeet</creatorcontrib><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><title>Applied biochemistry and biotechnology</title><addtitle>Appl Biochem Biotechnol</addtitle><addtitle>Appl Biochem Biotechnol</addtitle><description>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Bayes Theorem</subject><subject>Biochemistry</subject><subject>Biotechnology</subject><subject>Chemistry</subject><subject>Chemistry and Materials Science</subject><subject>Classifiers</subject><subject>Comparative studies</subject><subject>Data Mining</subject><subject>Datasets</subject><subject>Dermatology</subject><subject>Humans</subject><subject>Learning algorithms</subject><subject>Machine Learning</subject><subject>Predictive Value of Tests</subject><subject>Scientific papers</subject><subject>Skin diseases</subject><subject>Skin Diseases - diagnosis</subject><issn>0273-2289</issn><issn>1559-0291</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNp9kcFu1DAQhi0EokvhBTggS1y4pIwnsRMf0bYFpFZUanu2vPGkdUmcxU6Q2hMPwRPyJHibAhKHnqzxfPPPSB9jrwUcCID6fRIIAgoQuoASdFncPWErIWUuUYunbAVYlwVio_fYi5RuAAQ2sn7O9kpRSlBKrVg6i-R8O_kx8LHj51994Ic-kU3EL5MPV_woJBo2PfFDO1l-6sPu84La6-C_zZS4DY4fk53mSPycelqyTmm6Ht2vHz8tX4_D1kY7-e8ZmGZ3-5I962yf6NXDu88uj48u1p-Kky8fP68_nBRtJXAqqHWbbuMkKVBOC1SyQuEk2FqUAlsrJequUVapWmHT1dhUAloCXSlACU25z94tuds47k6dzOBTS31vA41zMpgj66ZBWWX07X_ozTjHkK_bUVVVS6V1pnCh2jimFKkz2-gHG2-NALNTYhYlJisx90rMXR568xA9bwZyf0f-OMhAuQApt8IVxX-7H4n9DRhTlvg</recordid><startdate>20200201</startdate><enddate>20200201</enddate><creator>Verma, Anurag Kumar</creator><creator>Pal, Saurabh</creator><creator>Kumar, Surjeet</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7ST</scope><scope>7T7</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>RC3</scope><scope>SOI</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9545-7481</orcidid></search><sort><creationdate>20200201</creationdate><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><author>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Bayes Theorem</topic><topic>Biochemistry</topic><topic>Biotechnology</topic><topic>Chemistry</topic><topic>Chemistry and Materials Science</topic><topic>Classifiers</topic><topic>Comparative studies</topic><topic>Data Mining</topic><topic>Datasets</topic><topic>Dermatology</topic><topic>Humans</topic><topic>Learning algorithms</topic><topic>Machine Learning</topic><topic>Predictive Value of Tests</topic><topic>Scientific papers</topic><topic>Skin diseases</topic><topic>Skin Diseases - diagnosis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Verma, Anurag Kumar</creatorcontrib><creatorcontrib>Pal, Saurabh</creatorcontrib><creatorcontrib>Kumar, Surjeet</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Environment Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Nucleic Acids Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>Environment Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Applied biochemistry and biotechnology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Verma, Anurag Kumar</au><au>Pal, Saurabh</au><au>Kumar, Surjeet</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</atitle><jtitle>Applied biochemistry and biotechnology</jtitle><stitle>Appl Biochem Biotechnol</stitle><addtitle>Appl Biochem Biotechnol</addtitle><date>2020-02-01</date><risdate>2020</risdate><volume>190</volume><issue>2</issue><spage>341</spage><epage>359</epage><pages>341-359</pages><issn>0273-2289</issn><eissn>1559-0291</eissn><abstract>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</abstract><cop>New York</cop><pub>Springer US</pub><pmid>31350666</pmid><doi>10.1007/s12010-019-03093-z</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0001-9545-7481</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0273-2289
ispartof Applied biochemistry and biotechnology, 2020-02, Vol.190 (2), p.341-359
issn 0273-2289
1559-0291
language eng
recordid cdi_proquest_miscellaneous_2265788254
source MEDLINE; SpringerLink Journals - AutoHoldings
subjects Algorithms
Artificial intelligence
Bayes Theorem
Biochemistry
Biotechnology
Chemistry
Chemistry and Materials Science
Classifiers
Comparative studies
Data Mining
Datasets
Dermatology
Humans
Learning algorithms
Machine Learning
Predictive Value of Tests
Scientific papers
Skin diseases
Skin Diseases - diagnosis
title Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T19%3A24%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20Skin%20Disease%20Using%20Ensemble%20Data%20Mining%20Techniques%20and%20Feature%20Selection%20Method%E2%80%94a%20Comparative%20Study&rft.jtitle=Applied%20biochemistry%20and%20biotechnology&rft.au=Verma,%20Anurag%20Kumar&rft.date=2020-02-01&rft.volume=190&rft.issue=2&rft.spage=341&rft.epage=359&rft.pages=341-359&rft.issn=0273-2289&rft.eissn=1559-0291&rft_id=info:doi/10.1007/s12010-019-03093-z&rft_dat=%3Cproquest_cross%3E2264475699%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2264475699&rft_id=info:pmid/31350666&rfr_iscdi=true