Applying a weighted random forests method to extract karst sinkholes from LiDAR data
•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions impr...
Gespeichert in:
Veröffentlicht in: | Journal of hydrology (Amsterdam) 2016-02, Vol.533, p.343-352 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 352 |
---|---|
container_issue | |
container_start_page | 343 |
container_title | Journal of hydrology (Amsterdam) |
container_volume | 533 |
creator | Zhu, Junfeng Pierskalla, William P. |
description | •We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier.
Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification. |
doi_str_mv | 10.1016/j.jhydrol.2015.12.012 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1786174347</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022169415009518</els_id><sourcerecordid>1786174347</sourcerecordid><originalsourceid>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</originalsourceid><addsrcrecordid>eNqNkEtPwzAQhC0EEqXwE5B85JLgRxLHJ1SVp1QJCZWz5dhO4zSJg20e_fekau-wh93LzGjnA-AaoxQjXNy2advstHddShDOU0xShMkJmOGS8YQwxE7BDCFCElzw7BxchNCiaSjNZmC9GMduZ4cNlPDb2E0TjYZeDtr1sHbehBhgb2LjNIwOmp_opYpwK32IMNhh27jOBFj7Sb6y94s3qGWUl-Csll0wV8c7B--PD-vlc7J6fXpZLlaJpLyMiSxpiTitc55LZaTONedVVmFOKUaUVDUvq4xVEqOClajiZa7qqUZeS6V0JQs6BzeH3NG7j8_pV9HboEzXycG4zyAwKwvMMpqxf0gLQouMTnsO8oNUeReCN7UYve2l3wmMxB64aMURuNgDF5iICfjkuzv4zFT5yxovgrJmUEZbb1QU2tk_En4B_g2MOQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1762364362</pqid></control><display><type>article</type><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><source>Elsevier ScienceDirect Journals</source><creator>Zhu, Junfeng ; Pierskalla, William P.</creator><creatorcontrib>Zhu, Junfeng ; Pierskalla, William P.</creatorcontrib><description>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier.
Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</description><identifier>ISSN: 0022-1694</identifier><identifier>EISSN: 1879-2707</identifier><identifier>DOI: 10.1016/j.jhydrol.2015.12.012</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Classifiers ; Depression ; Karst ; LiDAR ; Machine learning ; Sinkhole ; Sinkholes ; Surface water ; Training ; Visual inspection ; Weighted random forests</subject><ispartof>Journal of hydrology (Amsterdam), 2016-02, Vol.533, p.343-352</ispartof><rights>2015 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</citedby><cites>FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</cites><orcidid>0000-0002-6296-8637</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0022169415009518$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids></links><search><creatorcontrib>Zhu, Junfeng</creatorcontrib><creatorcontrib>Pierskalla, William P.</creatorcontrib><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><title>Journal of hydrology (Amsterdam)</title><description>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier.
Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</description><subject>Classifiers</subject><subject>Depression</subject><subject>Karst</subject><subject>LiDAR</subject><subject>Machine learning</subject><subject>Sinkhole</subject><subject>Sinkholes</subject><subject>Surface water</subject><subject>Training</subject><subject>Visual inspection</subject><subject>Weighted random forests</subject><issn>0022-1694</issn><issn>1879-2707</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNqNkEtPwzAQhC0EEqXwE5B85JLgRxLHJ1SVp1QJCZWz5dhO4zSJg20e_fekau-wh93LzGjnA-AaoxQjXNy2advstHddShDOU0xShMkJmOGS8YQwxE7BDCFCElzw7BxchNCiaSjNZmC9GMduZ4cNlPDb2E0TjYZeDtr1sHbehBhgb2LjNIwOmp_opYpwK32IMNhh27jOBFj7Sb6y94s3qGWUl-Csll0wV8c7B--PD-vlc7J6fXpZLlaJpLyMiSxpiTitc55LZaTONedVVmFOKUaUVDUvq4xVEqOClajiZa7qqUZeS6V0JQs6BzeH3NG7j8_pV9HboEzXycG4zyAwKwvMMpqxf0gLQouMTnsO8oNUeReCN7UYve2l3wmMxB64aMURuNgDF5iICfjkuzv4zFT5yxovgrJmUEZbb1QU2tk_En4B_g2MOQ</recordid><startdate>201602</startdate><enddate>201602</enddate><creator>Zhu, Junfeng</creator><creator>Pierskalla, William P.</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><orcidid>https://orcid.org/0000-0002-6296-8637</orcidid></search><sort><creationdate>201602</creationdate><title>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</title><author>Zhu, Junfeng ; Pierskalla, William P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a398t-a838093f595acead5d99b4b19331032bf98b47ba106780b985cf0225faccdba63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Classifiers</topic><topic>Depression</topic><topic>Karst</topic><topic>LiDAR</topic><topic>Machine learning</topic><topic>Sinkhole</topic><topic>Sinkholes</topic><topic>Surface water</topic><topic>Training</topic><topic>Visual inspection</topic><topic>Weighted random forests</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Junfeng</creatorcontrib><creatorcontrib>Pierskalla, William P.</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Journal of hydrology (Amsterdam)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Junfeng</au><au>Pierskalla, William P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Applying a weighted random forests method to extract karst sinkholes from LiDAR data</atitle><jtitle>Journal of hydrology (Amsterdam)</jtitle><date>2016-02</date><risdate>2016</risdate><volume>533</volume><spage>343</spage><epage>352</epage><pages>343-352</pages><issn>0022-1694</issn><eissn>1879-2707</eissn><abstract>•We build a weighted random forests classifier to identify sinkholes.•We select depression geometry and natural/human factors as predictive variables.•Higher weight for the sinkhole class improved accuracy in identifying true sinkholes.•Including variables besides shape and depth of depressions improved the classifier.
Detailed mapping of sinkholes provides critical information for mitigating sinkhole hazards and understanding groundwater and surface water interactions in karst terrains. LiDAR (Light Detection and Ranging) measures the earth’s surface in high-resolution and high-density and has shown great potentials to drastically improve locating and delineating sinkholes. However, processing LiDAR data to extract sinkholes requires separating sinkholes from other depressions, which can be laborious because of the sheer number of the depressions commonly generated from LiDAR data. In this study, we applied the random forests, a machine learning method, to automatically separate sinkholes from other depressions in a karst region in central Kentucky. The sinkhole-extraction random forest was grown on a training dataset built from an area where LiDAR-derived depressions were manually classified through a visual inspection and field verification process. Based on the geometry of depressions, as well as natural and human factors related to sinkholes, 11 parameters were selected as predictive variables to form the dataset. Because the training dataset was imbalanced with the majority of depressions being non-sinkholes, a weighted random forests method was used to improve the accuracy of predicting sinkholes. The weighted random forest achieved an average accuracy of 89.95% for the training dataset, demonstrating that the random forest can be an effective sinkhole classifier. Testing of the random forest in another area, however, resulted in moderate success with an average accuracy rate of 73.96%. This study suggests that an automatic sinkhole extraction procedure like the random forest classifier can significantly reduce time and labor costs and makes its more tractable to map sinkholes using LiDAR data for large areas. However, the random forests method cannot totally replace manual procedures, such as visual inspection and field verification.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.jhydrol.2015.12.012</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-6296-8637</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-1694 |
ispartof | Journal of hydrology (Amsterdam), 2016-02, Vol.533, p.343-352 |
issn | 0022-1694 1879-2707 |
language | eng |
recordid | cdi_proquest_miscellaneous_1786174347 |
source | Elsevier ScienceDirect Journals |
subjects | Classifiers Depression Karst LiDAR Machine learning Sinkhole Sinkholes Surface water Training Visual inspection Weighted random forests |
title | Applying a weighted random forests method to extract karst sinkholes from LiDAR data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-16T09%3A24%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Applying%20a%20weighted%20random%20forests%20method%20to%20extract%20karst%20sinkholes%20from%20LiDAR%20data&rft.jtitle=Journal%20of%20hydrology%20(Amsterdam)&rft.au=Zhu,%20Junfeng&rft.date=2016-02&rft.volume=533&rft.spage=343&rft.epage=352&rft.pages=343-352&rft.issn=0022-1694&rft.eissn=1879-2707&rft_id=info:doi/10.1016/j.jhydrol.2015.12.012&rft_dat=%3Cproquest_cross%3E1786174347%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1762364362&rft_id=info:pmid/&rft_els_id=S0022169415009518&rfr_iscdi=true |