An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data
Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of t...
Gespeichert in:
Veröffentlicht in: | Applied geomatics 2020-03, Vol.12 (1), p.95-105 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 105 |
---|---|
container_issue | 1 |
container_start_page | 95 |
container_title | Applied geomatics |
container_volume | 12 |
creator | Chutia, Dibyajyoti Borah, Naiwrita Baruah, Diganta Bhattacharyya, Dhruba Kumar Raju, P. L. N. Sarma, K. K. |
description | Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers. |
doi_str_mv | 10.1007/s12518-019-00281-8 |
format | Article |
fullrecord | <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_2356804974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A717942000</galeid><sourcerecordid>A717942000</sourcerecordid><originalsourceid>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</originalsourceid><addsrcrecordid>eNp9kU1PAyEQhjdGE436BzyRePKwCuwXe2yMWhMTEz8Sb2TKDi1NCxVYY_-9rOtHvAgHhuF5YYY3y04YPWeUNheB8YqJnLI2p5QLloud7ICJus5bLl52f-JW7GfHISzpMBpalfwg6yeWoNaoonlDApuNd6AWRDtPzDpt3oydk7hIR0r1HtSWOE2AeLCdWw8YhkjUCkIw2mAS2U_6O6MgGmcHzXS7QT_EHUQ4yvY0rAIef62H2fP11dPlNL-7v7m9nNzlqmh5zEFUmqIoNKsrPcOSCcYAmkZ1s4Kpru503VZCVBw1K8tacyoo77BpmdIUqllxmJ2O96ZOXvtUqly63tv0pORFVQtatk2ZqPORmsMKpbHaxdRpmh2ujXIWtUn5ScOatuTp65Lg7I8gMRHf4xz6EOTt48Nflo-s8i4Ej1puvFmD30pG5WCfHO2TyT75aZ8USVSMopBgO0f_W_c_qg_aE50p</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2356804974</pqid></control><display><type>article</type><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><source>SpringerNature Journals</source><creator>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</creator><creatorcontrib>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</creatorcontrib><description>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</description><identifier>ISSN: 1866-9298</identifier><identifier>EISSN: 1866-928X</identifier><identifier>DOI: 10.1007/s12518-019-00281-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Accuracy ; Analysis ; Classification ; Earth ; Earth and Environmental Science ; Geographical Information Systems/Cartography ; Geography ; Geophysics/Geodesy ; Machine learning ; Measurement Science and Instrumentation ; Original Paper ; Parameters ; Principal components analysis ; Remote Sensing/Photogrammetry ; Surveying</subject><ispartof>Applied geomatics, 2020-03, Vol.12 (1), p.95-105</ispartof><rights>Società Italiana di Fotogrammetria e Topografia (SIFET) 2019</rights><rights>COPYRIGHT 2020 Springer</rights><rights>2019© Società Italiana di Fotogrammetria e Topografia (SIFET) 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</citedby><cites>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</cites><orcidid>0000-0002-6812-1007</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12518-019-00281-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12518-019-00281-8$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>315,782,786,27933,27934,41497,42566,51328</link.rule.ids></links><search><creatorcontrib>Chutia, Dibyajyoti</creatorcontrib><creatorcontrib>Borah, Naiwrita</creatorcontrib><creatorcontrib>Baruah, Diganta</creatorcontrib><creatorcontrib>Bhattacharyya, Dhruba Kumar</creatorcontrib><creatorcontrib>Raju, P. L. N.</creatorcontrib><creatorcontrib>Sarma, K. K.</creatorcontrib><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><title>Applied geomatics</title><addtitle>Appl Geomat</addtitle><description>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</description><subject>Accuracy</subject><subject>Analysis</subject><subject>Classification</subject><subject>Earth</subject><subject>Earth and Environmental Science</subject><subject>Geographical Information Systems/Cartography</subject><subject>Geography</subject><subject>Geophysics/Geodesy</subject><subject>Machine learning</subject><subject>Measurement Science and Instrumentation</subject><subject>Original Paper</subject><subject>Parameters</subject><subject>Principal components analysis</subject><subject>Remote Sensing/Photogrammetry</subject><subject>Surveying</subject><issn>1866-9298</issn><issn>1866-928X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kU1PAyEQhjdGE436BzyRePKwCuwXe2yMWhMTEz8Sb2TKDi1NCxVYY_-9rOtHvAgHhuF5YYY3y04YPWeUNheB8YqJnLI2p5QLloud7ICJus5bLl52f-JW7GfHISzpMBpalfwg6yeWoNaoonlDApuNd6AWRDtPzDpt3oydk7hIR0r1HtSWOE2AeLCdWw8YhkjUCkIw2mAS2U_6O6MgGmcHzXS7QT_EHUQ4yvY0rAIef62H2fP11dPlNL-7v7m9nNzlqmh5zEFUmqIoNKsrPcOSCcYAmkZ1s4Kpru503VZCVBw1K8tacyoo77BpmdIUqllxmJ2O96ZOXvtUqly63tv0pORFVQtatk2ZqPORmsMKpbHaxdRpmh2ujXIWtUn5ScOatuTp65Lg7I8gMRHf4xz6EOTt48Nflo-s8i4Ej1puvFmD30pG5WCfHO2TyT75aZ8USVSMopBgO0f_W_c_qg_aE50p</recordid><startdate>20200301</startdate><enddate>20200301</enddate><creator>Chutia, Dibyajyoti</creator><creator>Borah, Naiwrita</creator><creator>Baruah, Diganta</creator><creator>Bhattacharyya, Dhruba Kumar</creator><creator>Raju, P. L. N.</creator><creator>Sarma, K. K.</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>7TN</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><orcidid>https://orcid.org/0000-0002-6812-1007</orcidid></search><sort><creationdate>20200301</creationdate><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><author>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Analysis</topic><topic>Classification</topic><topic>Earth</topic><topic>Earth and Environmental Science</topic><topic>Geographical Information Systems/Cartography</topic><topic>Geography</topic><topic>Geophysics/Geodesy</topic><topic>Machine learning</topic><topic>Measurement Science and Instrumentation</topic><topic>Original Paper</topic><topic>Parameters</topic><topic>Principal components analysis</topic><topic>Remote Sensing/Photogrammetry</topic><topic>Surveying</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chutia, Dibyajyoti</creatorcontrib><creatorcontrib>Borah, Naiwrita</creatorcontrib><creatorcontrib>Baruah, Diganta</creatorcontrib><creatorcontrib>Bhattacharyya, Dhruba Kumar</creatorcontrib><creatorcontrib>Raju, P. L. N.</creatorcontrib><creatorcontrib>Sarma, K. K.</creatorcontrib><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>Oceanic Abstracts</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><jtitle>Applied geomatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chutia, Dibyajyoti</au><au>Borah, Naiwrita</au><au>Baruah, Diganta</au><au>Bhattacharyya, Dhruba Kumar</au><au>Raju, P. L. N.</au><au>Sarma, K. K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</atitle><jtitle>Applied geomatics</jtitle><stitle>Appl Geomat</stitle><date>2020-03-01</date><risdate>2020</risdate><volume>12</volume><issue>1</issue><spage>95</spage><epage>105</epage><pages>95-105</pages><issn>1866-9298</issn><eissn>1866-928X</eissn><abstract>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12518-019-00281-8</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-6812-1007</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1866-9298 |
ispartof | Applied geomatics, 2020-03, Vol.12 (1), p.95-105 |
issn | 1866-9298 1866-928X |
language | eng |
recordid | cdi_proquest_journals_2356804974 |
source | SpringerNature Journals |
subjects | Accuracy Analysis Classification Earth Earth and Environmental Science Geographical Information Systems/Cartography Geography Geophysics/Geodesy Machine learning Measurement Science and Instrumentation Original Paper Parameters Principal components analysis Remote Sensing/Photogrammetry Surveying |
title | An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-01T06%3A07%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20approach%20for%20improving%20the%20accuracy%20of%20a%20random%20forest%20classifier%20in%20the%20classification%20of%20Hyperion%20data&rft.jtitle=Applied%20geomatics&rft.au=Chutia,%20Dibyajyoti&rft.date=2020-03-01&rft.volume=12&rft.issue=1&rft.spage=95&rft.epage=105&rft.pages=95-105&rft.issn=1866-9298&rft.eissn=1866-928X&rft_id=info:doi/10.1007/s12518-019-00281-8&rft_dat=%3Cgale_proqu%3EA717942000%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2356804974&rft_id=info:pmid/&rft_galeid=A717942000&rfr_iscdi=true |