Applying a weighted random forests method to extract karst sinkholes from LiDAR data

•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions impr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of hydrology (Amsterdam) 2016-02, Vol.533, p.343-352
Hauptverfasser: Zhu, Junfeng, Pierskalla, William P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 352
container_issue
container_start_page 343
container_title Journal of hydrology (Amsterdam)
container_volume 533
creator Zhu, Junfeng
Pierskalla, William P.
description •We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier. Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.
doi_str_mv 10.1016/j.jhydrol.2015.12.012
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1786174347</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022169415009518</els_id><sourcerecordid>1786174347</sourcerecordid><originalsourceid>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</originalsourceid><addsrcrecordid>eNqNkEtPwzAQhC0EEqXwE5B85JLgRxLHJ1SVp1QJCZWz5dhO4zSJg20e_fekau-wh93LzGjnA-AaoxQjXNy2advstHddShDOU0xShMkJmOGS8YQwxE7BDCFCElzw7BxchNCiaSjNZmC9GMduZ4cNlPDb2E0TjYZeDtr1sHbehBhgb2LjNIwOmp_opYpwK32IMNhh27jOBFj7Sb6y94s3qGWUl-Csll0wV8c7B--PD-vlc7J6fXpZLlaJpLyMiSxpiTitc55LZaTONedVVmFOKUaUVDUvq4xVEqOClajiZa7qqUZeS6V0JQs6BzeH3NG7j8_pV9HboEzXycG4zyAwKwvMMpqxf0gLQouMTnsO8oNUeReCN7UYve2l3wmMxB64aMURuNgDF5iICfjkuzv4zFT5yxovgrJmUEZbb1QU2tk_En4B_g2MOQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1762364362</pqid></control><display><type>article</type><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><source>Elsevier ScienceDirect Journals</source><creator>Zhu, Junfeng ; Pierskalla, William P.</creator><creatorcontrib>Zhu, Junfeng ; Pierskalla, William P.</creatorcontrib><description>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier. Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</description><identifier>ISSN: 0022-1694</identifier><identifier>EISSN: 1879-2707</identifier><identifier>DOI: 10.1016/j.jhydrol.2015.12.012</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Classifiers ; Depression ; Karst ; LiDAR ; Machine learning ; Sinkhole ; Sinkholes ; Surface water ; Training ; Visual inspection ; Weighted random forests</subject><ispartof>Journal of hydrology (Amsterdam), 2016-02, Vol.533, p.343-352</ispartof><rights>2015 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</citedby><cites>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</cites><orcidid>0000-0002-6296-8637</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0022169415009518$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids></links><search><creatorcontrib>Zhu, Junfeng</creatorcontrib><creatorcontrib>Pierskalla, William P.</creatorcontrib><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><title>Journal of hydrology (Amsterdam)</title><description>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier. Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</description><subject>Classifiers</subject><subject>Depression</subject><subject>Karst</subject><subject>LiDAR</subject><subject>Machine learning</subject><subject>Sinkhole</subject><subject>Sinkholes</subject><subject>Surface water</subject><subject>Training</subject><subject>Visual inspection</subject><subject>Weighted random forests</subject><issn>0022-1694</issn><issn>1879-2707</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNqNkEtPwzAQhC0EEqXwE5B85JLgRxLHJ1SVp1QJCZWz5dhO4zSJg20e_fekau-wh93LzGjnA-AaoxQjXNy2advstHddShDOU0xShMkJmOGS8YQwxE7BDCFCElzw7BxchNCiaSjNZmC9GMduZ4cNlPDb2E0TjYZeDtr1sHbehBhgb2LjNIwOmp_opYpwK32IMNhh27jOBFj7Sb6y94s3qGWUl-Csll0wV8c7B--PD-vlc7J6fXpZLlaJpLyMiSxpiTitc55LZaTONedVVmFOKUaUVDUvq4xVEqOClajiZa7qqUZeS6V0JQs6BzeH3NG7j8_pV9HboEzXycG4zyAwKwvMMpqxf0gLQouMTnsO8oNUeReCN7UYve2l3wmMxB64aMURuNgDF5iICfjkuzv4zFT5yxovgrJmUEZbb1QU2tk_En4B_g2MOQ</recordid><startdate>201602</startdate><enddate>201602</enddate><creator>Zhu, Junfeng</creator><creator>Pierskalla, William P.</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><orcidid>https://orcid.org/0000-0002-6296-8637</orcidid></search><sort><creationdate>201602</creationdate><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><author>Zhu, Junfeng ; Pierskalla, William P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Classifiers</topic><topic>Depression</topic><topic>Karst</topic><topic>LiDAR</topic><topic>Machine learning</topic><topic>Sinkhole</topic><topic>Sinkholes</topic><topic>Surface water</topic><topic>Training</topic><topic>Visual inspection</topic><topic>Weighted random forests</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Junfeng</creatorcontrib><creatorcontrib>Pierskalla, William P.</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Journal of hydrology (Amsterdam)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Junfeng</au><au>Pierskalla, William P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</atitle><jtitle>Journal of hydrology (Amsterdam)</jtitle><date>2016-02</date><risdate>2016</risdate><volume>533</volume><spage>343</spage><epage>352</epage><pages>343-352</pages><issn>0022-1694</issn><eissn>1879-2707</eissn><abstract>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier. Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.jhydrol.2015.12.012</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-6296-8637</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0022-1694
ispartof Journal of hydrology (Amsterdam), 2016-02, Vol.533, p.343-352
issn 0022-1694
1879-2707
language eng
recordid cdi_proquest_miscellaneous_1786174347
source Elsevier ScienceDirect Journals
subjects Classifiers
Depression
Karst
LiDAR
Machine learning
Sinkhole
Sinkholes
Surface water
Training
Visual inspection
Weighted random forests
title Applying a weighted random forests method to extract karst sinkholes from LiDAR data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-16T09%3A24%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Applying%20a%20weighted%20random%20forests%20method%20to%20extract%20karst%20sinkholes%20from%20LiDAR%20data&rft.jtitle=Journal%20of%20hydrology%20(Amsterdam)&rft.au=Zhu,%20Junfeng&rft.date=2016-02&rft.volume=533&rft.spage=343&rft.epage=352&rft.pages=343-352&rft.issn=0022-1694&rft.eissn=1879-2707&rft_id=info:doi/10.1016/j.jhydrol.2015.12.012&rft_dat=%3Cproquest_cross%3E1786174347%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1762364362&rft_id=info:pmid/&rft_els_id=S0022169415009518&rfr_iscdi=true