An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data

Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied geomatics 2020-03, Vol.12 (1), p.95-105
Hauptverfasser: Chutia, Dibyajyoti, Borah, Naiwrita, Baruah, Diganta, Bhattacharyya, Dhruba Kumar, Raju, P. L. N., Sarma, K. K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 105
container_issue 1
container_start_page 95
container_title Applied geomatics
container_volume 12
creator Chutia, Dibyajyoti
Borah, Naiwrita
Baruah, Diganta
Bhattacharyya, Dhruba Kumar
Raju, P. L. N.
Sarma, K. K.
description Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.
doi_str_mv 10.1007/s12518-019-00281-8
format Article
fullrecord <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_2356804974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A717942000</galeid><sourcerecordid>A717942000</sourcerecordid><originalsourceid>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</originalsourceid><addsrcrecordid>eNp9kU1PAyEQhjdGE436BzyRePKwCuwXe2yMWhMTEz8Sb2TKDi1NCxVYY_-9rOtHvAgHhuF5YYY3y04YPWeUNheB8YqJnLI2p5QLloud7ICJus5bLl52f-JW7GfHISzpMBpalfwg6yeWoNaoonlDApuNd6AWRDtPzDpt3oydk7hIR0r1HtSWOE2AeLCdWw8YhkjUCkIw2mAS2U_6O6MgGmcHzXS7QT_EHUQ4yvY0rAIef62H2fP11dPlNL-7v7m9nNzlqmh5zEFUmqIoNKsrPcOSCcYAmkZ1s4Kpru503VZCVBw1K8tacyoo77BpmdIUqllxmJ2O96ZOXvtUqly63tv0pORFVQtatk2ZqPORmsMKpbHaxdRpmh2ujXIWtUn5ScOatuTp65Lg7I8gMRHf4xz6EOTt48Nflo-s8i4Ej1puvFmD30pG5WCfHO2TyT75aZ8USVSMopBgO0f_W_c_qg_aE50p</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2356804974</pqid></control><display><type>article</type><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><source>SpringerNature Journals</source><creator>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</creator><creatorcontrib>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</creatorcontrib><description>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</description><identifier>ISSN: 1866-9298</identifier><identifier>EISSN: 1866-928X</identifier><identifier>DOI: 10.1007/s12518-019-00281-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Accuracy ; Analysis ; Classification ; Earth ; Earth and Environmental Science ; Geographical Information Systems/Cartography ; Geography ; Geophysics/Geodesy ; Machine learning ; Measurement Science and Instrumentation ; Original Paper ; Parameters ; Principal components analysis ; Remote Sensing/Photogrammetry ; Surveying</subject><ispartof>Applied geomatics, 2020-03, Vol.12 (1), p.95-105</ispartof><rights>Società Italiana di Fotogrammetria e Topografia (SIFET) 2019</rights><rights>COPYRIGHT 2020 Springer</rights><rights>2019© Società Italiana di Fotogrammetria e Topografia (SIFET) 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</citedby><cites>FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</cites><orcidid>0000-0002-6812-1007</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12518-019-00281-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12518-019-00281-8$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>315,782,786,27933,27934,41497,42566,51328</link.rule.ids></links><search><creatorcontrib>Chutia, Dibyajyoti</creatorcontrib><creatorcontrib>Borah, Naiwrita</creatorcontrib><creatorcontrib>Baruah, Diganta</creatorcontrib><creatorcontrib>Bhattacharyya, Dhruba Kumar</creatorcontrib><creatorcontrib>Raju, P. L. N.</creatorcontrib><creatorcontrib>Sarma, K. K.</creatorcontrib><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><title>Applied geomatics</title><addtitle>Appl Geomat</addtitle><description>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</description><subject>Accuracy</subject><subject>Analysis</subject><subject>Classification</subject><subject>Earth</subject><subject>Earth and Environmental Science</subject><subject>Geographical Information Systems/Cartography</subject><subject>Geography</subject><subject>Geophysics/Geodesy</subject><subject>Machine learning</subject><subject>Measurement Science and Instrumentation</subject><subject>Original Paper</subject><subject>Parameters</subject><subject>Principal components analysis</subject><subject>Remote Sensing/Photogrammetry</subject><subject>Surveying</subject><issn>1866-9298</issn><issn>1866-928X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kU1PAyEQhjdGE436BzyRePKwCuwXe2yMWhMTEz8Sb2TKDi1NCxVYY_-9rOtHvAgHhuF5YYY3y04YPWeUNheB8YqJnLI2p5QLloud7ICJus5bLl52f-JW7GfHISzpMBpalfwg6yeWoNaoonlDApuNd6AWRDtPzDpt3oydk7hIR0r1HtSWOE2AeLCdWw8YhkjUCkIw2mAS2U_6O6MgGmcHzXS7QT_EHUQ4yvY0rAIef62H2fP11dPlNL-7v7m9nNzlqmh5zEFUmqIoNKsrPcOSCcYAmkZ1s4Kpru503VZCVBw1K8tacyoo77BpmdIUqllxmJ2O96ZOXvtUqly63tv0pORFVQtatk2ZqPORmsMKpbHaxdRpmh2ujXIWtUn5ScOatuTp65Lg7I8gMRHf4xz6EOTt48Nflo-s8i4Ej1puvFmD30pG5WCfHO2TyT75aZ8USVSMopBgO0f_W_c_qg_aE50p</recordid><startdate>20200301</startdate><enddate>20200301</enddate><creator>Chutia, Dibyajyoti</creator><creator>Borah, Naiwrita</creator><creator>Baruah, Diganta</creator><creator>Bhattacharyya, Dhruba Kumar</creator><creator>Raju, P. L. N.</creator><creator>Sarma, K. K.</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>7TN</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><orcidid>https://orcid.org/0000-0002-6812-1007</orcidid></search><sort><creationdate>20200301</creationdate><title>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</title><author>Chutia, Dibyajyoti ; Borah, Naiwrita ; Baruah, Diganta ; Bhattacharyya, Dhruba Kumar ; Raju, P. L. N. ; Sarma, K. K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c392t-a85f0e83f165fbe41811aa77cdb31cd6df6958852ef1446f20802de791cf0a5b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Analysis</topic><topic>Classification</topic><topic>Earth</topic><topic>Earth and Environmental Science</topic><topic>Geographical Information Systems/Cartography</topic><topic>Geography</topic><topic>Geophysics/Geodesy</topic><topic>Machine learning</topic><topic>Measurement Science and Instrumentation</topic><topic>Original Paper</topic><topic>Parameters</topic><topic>Principal components analysis</topic><topic>Remote Sensing/Photogrammetry</topic><topic>Surveying</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chutia, Dibyajyoti</creatorcontrib><creatorcontrib>Borah, Naiwrita</creatorcontrib><creatorcontrib>Baruah, Diganta</creatorcontrib><creatorcontrib>Bhattacharyya, Dhruba Kumar</creatorcontrib><creatorcontrib>Raju, P. L. N.</creatorcontrib><creatorcontrib>Sarma, K. K.</creatorcontrib><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>Oceanic Abstracts</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><jtitle>Applied geomatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chutia, Dibyajyoti</au><au>Borah, Naiwrita</au><au>Baruah, Diganta</au><au>Bhattacharyya, Dhruba Kumar</au><au>Raju, P. L. N.</au><au>Sarma, K. K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data</atitle><jtitle>Applied geomatics</jtitle><stitle>Appl Geomat</stitle><date>2020-03-01</date><risdate>2020</risdate><volume>12</volume><issue>1</issue><spage>95</spage><epage>105</epage><pages>95-105</pages><issn>1866-9298</issn><eissn>1866-928X</eissn><abstract>Random forest (RF) is one of the most powerful ensemble classifiers often used in machine learning applications. It has been found successful on many benchmarked data. However, the performance of an RF model is highly affected by the calibration of the model parameters. It requires optimization of two parameters—(i) size of RF and (ii) number of features. RF is based on the principle of bagging and random selection of relevant features. This paper conveys an effective method in improving classification accuracy of RF. The principal component analysis (PCA) technique was used for dimension reduction of spectral bands whereas correlation-based feature selection (CFS) was used to identify the optimal set of features. RF was initialized by 10 random trees with an increment of 10, with a variable number of features till the model achieved its highest accuracy. The model was tested with variable sample sizes in order to observe the effectiveness. An investigation was carried out on Hyperion sensor data of the Earth Observing-1 (EO-1) satellite. The performance of RF was observed to be significantly enhanced in terms of predictive ability and computational expenses with the optimized set of features and number of random trees as base classifiers. While comparing with the other advanced classifiers like a support vector machine (SVM), multilayer perceptron (MLP) and maximum likelihood classifier (MLC), the optimized RF outperformed all the other classifiers.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12518-019-00281-8</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-6812-1007</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1866-9298
ispartof Applied geomatics, 2020-03, Vol.12 (1), p.95-105
issn 1866-9298
1866-928X
language eng
recordid cdi_proquest_journals_2356804974
source SpringerNature Journals
subjects Accuracy
Analysis
Classification
Earth
Earth and Environmental Science
Geographical Information Systems/Cartography
Geography
Geophysics/Geodesy
Machine learning
Measurement Science and Instrumentation
Original Paper
Parameters
Principal components analysis
Remote Sensing/Photogrammetry
Surveying
title An effective approach for improving the accuracy of a random forest classifier in the classification of Hyperion data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-01T06%3A07%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20approach%20for%20improving%20the%20accuracy%20of%20a%20random%20forest%20classifier%20in%20the%20classification%20of%20Hyperion%20data&rft.jtitle=Applied%20geomatics&rft.au=Chutia,%20Dibyajyoti&rft.date=2020-03-01&rft.volume=12&rft.issue=1&rft.spage=95&rft.epage=105&rft.pages=95-105&rft.issn=1866-9298&rft.eissn=1866-928X&rft_id=info:doi/10.1007/s12518-019-00281-8&rft_dat=%3Cgale_proqu%3EA717942000%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2356804974&rft_id=info:pmid/&rft_galeid=A717942000&rfr_iscdi=true