Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study
Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using...
Gespeichert in:
Veröffentlicht in: | Applied biochemistry and biotechnology 2020-02, Vol.190 (2), p.341-359 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 359 |
---|---|
container_issue | 2 |
container_start_page | 341 |
container_title | Applied biochemistry and biotechnology |
container_volume | 190 |
creator | Verma, Anurag Kumar Pal, Saurabh Kumar, Surjeet |
description | Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction. |
doi_str_mv | 10.1007/s12010-019-03093-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2265788254</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2264475699</sourcerecordid><originalsourceid>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</originalsourceid><addsrcrecordid>eNp9kcFu1DAQhi0EokvhBTggS1y4pIwnsRMf0bYFpFZUanu2vPGkdUmcxU6Q2hMPwRPyJHibAhKHnqzxfPPPSB9jrwUcCID6fRIIAgoQuoASdFncPWErIWUuUYunbAVYlwVio_fYi5RuAAQ2sn7O9kpRSlBKrVg6i-R8O_kx8LHj51994Ic-kU3EL5MPV_woJBo2PfFDO1l-6sPu84La6-C_zZS4DY4fk53mSPycelqyTmm6Ht2vHz8tX4_D1kY7-e8ZmGZ3-5I962yf6NXDu88uj48u1p-Kky8fP68_nBRtJXAqqHWbbuMkKVBOC1SyQuEk2FqUAlsrJequUVapWmHT1dhUAloCXSlACU25z94tuds47k6dzOBTS31vA41zMpgj66ZBWWX07X_ozTjHkK_bUVVVS6V1pnCh2jimFKkz2-gHG2-NALNTYhYlJisx90rMXR568xA9bwZyf0f-OMhAuQApt8IVxX-7H4n9DRhTlvg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2264475699</pqid></control><display><type>article</type><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</creator><creatorcontrib>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</creatorcontrib><description>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</description><identifier>ISSN: 0273-2289</identifier><identifier>EISSN: 1559-0291</identifier><identifier>DOI: 10.1007/s12010-019-03093-z</identifier><identifier>PMID: 31350666</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial intelligence ; Bayes Theorem ; Biochemistry ; Biotechnology ; Chemistry ; Chemistry and Materials Science ; Classifiers ; Comparative studies ; Data Mining ; Datasets ; Dermatology ; Humans ; Learning algorithms ; Machine Learning ; Predictive Value of Tests ; Scientific papers ; Skin diseases ; Skin Diseases - diagnosis</subject><ispartof>Applied biochemistry and biotechnology, 2020-02, Vol.190 (2), p.341-359</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Applied Biochemistry and Biotechnology is a copyright of Springer, (2019). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</citedby><cites>FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</cites><orcidid>0000-0001-9545-7481</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12010-019-03093-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12010-019-03093-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31350666$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Verma, Anurag Kumar</creatorcontrib><creatorcontrib>Pal, Saurabh</creatorcontrib><creatorcontrib>Kumar, Surjeet</creatorcontrib><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><title>Applied biochemistry and biotechnology</title><addtitle>Appl Biochem Biotechnol</addtitle><addtitle>Appl Biochem Biotechnol</addtitle><description>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Bayes Theorem</subject><subject>Biochemistry</subject><subject>Biotechnology</subject><subject>Chemistry</subject><subject>Chemistry and Materials Science</subject><subject>Classifiers</subject><subject>Comparative studies</subject><subject>Data Mining</subject><subject>Datasets</subject><subject>Dermatology</subject><subject>Humans</subject><subject>Learning algorithms</subject><subject>Machine Learning</subject><subject>Predictive Value of Tests</subject><subject>Scientific papers</subject><subject>Skin diseases</subject><subject>Skin Diseases - diagnosis</subject><issn>0273-2289</issn><issn>1559-0291</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNp9kcFu1DAQhi0EokvhBTggS1y4pIwnsRMf0bYFpFZUanu2vPGkdUmcxU6Q2hMPwRPyJHibAhKHnqzxfPPPSB9jrwUcCID6fRIIAgoQuoASdFncPWErIWUuUYunbAVYlwVio_fYi5RuAAQ2sn7O9kpRSlBKrVg6i-R8O_kx8LHj51994Ic-kU3EL5MPV_woJBo2PfFDO1l-6sPu84La6-C_zZS4DY4fk53mSPycelqyTmm6Ht2vHz8tX4_D1kY7-e8ZmGZ3-5I962yf6NXDu88uj48u1p-Kky8fP68_nBRtJXAqqHWbbuMkKVBOC1SyQuEk2FqUAlsrJequUVapWmHT1dhUAloCXSlACU25z94tuds47k6dzOBTS31vA41zMpgj66ZBWWX07X_ozTjHkK_bUVVVS6V1pnCh2jimFKkz2-gHG2-NALNTYhYlJisx90rMXR568xA9bwZyf0f-OMhAuQApt8IVxX-7H4n9DRhTlvg</recordid><startdate>20200201</startdate><enddate>20200201</enddate><creator>Verma, Anurag Kumar</creator><creator>Pal, Saurabh</creator><creator>Kumar, Surjeet</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7ST</scope><scope>7T7</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>RC3</scope><scope>SOI</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9545-7481</orcidid></search><sort><creationdate>20200201</creationdate><title>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</title><author>Verma, Anurag Kumar ; Pal, Saurabh ; Kumar, Surjeet</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c412t-ecdbfbd5e606d91265421d50a71312ca5529f86a667628f728410ce0946025083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Bayes Theorem</topic><topic>Biochemistry</topic><topic>Biotechnology</topic><topic>Chemistry</topic><topic>Chemistry and Materials Science</topic><topic>Classifiers</topic><topic>Comparative studies</topic><topic>Data Mining</topic><topic>Datasets</topic><topic>Dermatology</topic><topic>Humans</topic><topic>Learning algorithms</topic><topic>Machine Learning</topic><topic>Predictive Value of Tests</topic><topic>Scientific papers</topic><topic>Skin diseases</topic><topic>Skin Diseases - diagnosis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Verma, Anurag Kumar</creatorcontrib><creatorcontrib>Pal, Saurabh</creatorcontrib><creatorcontrib>Kumar, Surjeet</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Environment Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Nucleic Acids Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>Environment Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Applied biochemistry and biotechnology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Verma, Anurag Kumar</au><au>Pal, Saurabh</au><au>Kumar, Surjeet</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study</atitle><jtitle>Applied biochemistry and biotechnology</jtitle><stitle>Appl Biochem Biotechnol</stitle><addtitle>Appl Biochem Biotechnol</addtitle><date>2020-02-01</date><risdate>2020</risdate><volume>190</volume><issue>2</issue><spage>341</spage><epage>359</epage><pages>341-359</pages><issn>0273-2289</issn><eissn>1559-0291</eissn><abstract>Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.</abstract><cop>New York</cop><pub>Springer US</pub><pmid>31350666</pmid><doi>10.1007/s12010-019-03093-z</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0001-9545-7481</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0273-2289 |
ispartof | Applied biochemistry and biotechnology, 2020-02, Vol.190 (2), p.341-359 |
issn | 0273-2289 1559-0291 |
language | eng |
recordid | cdi_proquest_miscellaneous_2265788254 |
source | MEDLINE; SpringerLink Journals - AutoHoldings |
subjects | Algorithms Artificial intelligence Bayes Theorem Biochemistry Biotechnology Chemistry Chemistry and Materials Science Classifiers Comparative studies Data Mining Datasets Dermatology Humans Learning algorithms Machine Learning Predictive Value of Tests Scientific papers Skin diseases Skin Diseases - diagnosis |
title | Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T19%3A24%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20Skin%20Disease%20Using%20Ensemble%20Data%20Mining%20Techniques%20and%20Feature%20Selection%20Method%E2%80%94a%20Comparative%20Study&rft.jtitle=Applied%20biochemistry%20and%20biotechnology&rft.au=Verma,%20Anurag%20Kumar&rft.date=2020-02-01&rft.volume=190&rft.issue=2&rft.spage=341&rft.epage=359&rft.pages=341-359&rft.issn=0273-2289&rft.eissn=1559-0291&rft_id=info:doi/10.1007/s12010-019-03093-z&rft_dat=%3Cproquest_cross%3E2264475699%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2264475699&rft_id=info:pmid/31350666&rfr_iscdi=true |