Impact of multi-scale predictor selection for modeling soil properties
Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data ava...
Gespeichert in:
Veröffentlicht in: | Geoderma 2015-02, Vol.239-240, p.97-106 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 106 |
---|---|
container_issue | |
container_start_page | 97 |
container_title | Geoderma |
container_volume | 239-240 |
creator | Miller, Bradley A. Koszinski, Sylvia Wehrhan, Marc Sommer, Michael |
description | Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales.
•Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive. |
doi_str_mv | 10.1016/j.geoderma.2014.09.018 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1669857778</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0016706114003504</els_id><sourcerecordid>1669857778</sourcerecordid><originalsourceid>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</originalsourceid><addsrcrecordid>eNqNkE1LAzEQhoMoWKt_QfboZdfJbj42N6VYLRS86Dmk2dmSkt3UZCv4702pnhUGhoHnnWEeQm4pVBSouN9VWwwdxsFUNVBWgaqAtmdkRltZl6Lm6pzMIJOlBEEvyVVKuzxKqGFGlqthb-xUhL4YDn5yZbLGY7GP2Dk7hVgk9GgnF8aiz9OQD3k3bosUnM9U2GOcHKZrctEbn_Dmp8_J-_LpbfFSrl-fV4vHdWkY0Km0EkXTIVJljJEbBAVG8p7zHjg3jVC07UzNOduYnoGix-o2girJDBOyaebk7rQ3n_44YJr04JJF782I4ZA0FUK1XErZ_gcFBkwKmlFxQm0MKUXs9T66wcQvTUEfHeud_nWsj441KJ0d5-DDKYj550-HUSfrcLRZXszWdBfcXyu-AVg1iB8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1660404761</pqid></control><display><type>article</type><title>Impact of multi-scale predictor selection for modeling soil properties</title><source>Elsevier ScienceDirect Journals</source><creator>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</creator><creatorcontrib>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</creatorcontrib><description>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales.
•Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</description><identifier>ISSN: 0016-7061</identifier><identifier>EISSN: 1872-6259</identifier><identifier>DOI: 10.1016/j.geoderma.2014.09.018</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Analysis scale ; Derivatives ; Digital ; Digital soil mapping ; Digital terrain analysis ; Hydrology ; Mathematical models ; Multiscale ; Pools ; Predictor variables ; Remote sensing ; Scale effect ; Soil mapping</subject><ispartof>Geoderma, 2015-02, Vol.239-240, p.97-106</ispartof><rights>2014 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</citedby><cites>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</cites><orcidid>0000-0001-8194-123X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0016706114003504$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3536,27903,27904,65309</link.rule.ids></links><search><creatorcontrib>Miller, Bradley A.</creatorcontrib><creatorcontrib>Koszinski, Sylvia</creatorcontrib><creatorcontrib>Wehrhan, Marc</creatorcontrib><creatorcontrib>Sommer, Michael</creatorcontrib><title>Impact of multi-scale predictor selection for modeling soil properties</title><title>Geoderma</title><description>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales.
•Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</description><subject>Analysis scale</subject><subject>Derivatives</subject><subject>Digital</subject><subject>Digital soil mapping</subject><subject>Digital terrain analysis</subject><subject>Hydrology</subject><subject>Mathematical models</subject><subject>Multiscale</subject><subject>Pools</subject><subject>Predictor variables</subject><subject>Remote sensing</subject><subject>Scale effect</subject><subject>Soil mapping</subject><issn>0016-7061</issn><issn>1872-6259</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqNkE1LAzEQhoMoWKt_QfboZdfJbj42N6VYLRS86Dmk2dmSkt3UZCv4702pnhUGhoHnnWEeQm4pVBSouN9VWwwdxsFUNVBWgaqAtmdkRltZl6Lm6pzMIJOlBEEvyVVKuzxKqGFGlqthb-xUhL4YDn5yZbLGY7GP2Dk7hVgk9GgnF8aiz9OQD3k3bosUnM9U2GOcHKZrctEbn_Dmp8_J-_LpbfFSrl-fV4vHdWkY0Km0EkXTIVJljJEbBAVG8p7zHjg3jVC07UzNOduYnoGix-o2girJDBOyaebk7rQ3n_44YJr04JJF782I4ZA0FUK1XErZ_gcFBkwKmlFxQm0MKUXs9T66wcQvTUEfHeud_nWsj441KJ0d5-DDKYj550-HUSfrcLRZXszWdBfcXyu-AVg1iB8</recordid><startdate>201502</startdate><enddate>201502</enddate><creator>Miller, Bradley A.</creator><creator>Koszinski, Sylvia</creator><creator>Wehrhan, Marc</creator><creator>Sommer, Michael</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><orcidid>https://orcid.org/0000-0001-8194-123X</orcidid></search><sort><creationdate>201502</creationdate><title>Impact of multi-scale predictor selection for modeling soil properties</title><author>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Analysis scale</topic><topic>Derivatives</topic><topic>Digital</topic><topic>Digital soil mapping</topic><topic>Digital terrain analysis</topic><topic>Hydrology</topic><topic>Mathematical models</topic><topic>Multiscale</topic><topic>Pools</topic><topic>Predictor variables</topic><topic>Remote sensing</topic><topic>Scale effect</topic><topic>Soil mapping</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Miller, Bradley A.</creatorcontrib><creatorcontrib>Koszinski, Sylvia</creatorcontrib><creatorcontrib>Wehrhan, Marc</creatorcontrib><creatorcontrib>Sommer, Michael</creatorcontrib><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Geoderma</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Miller, Bradley A.</au><au>Koszinski, Sylvia</au><au>Wehrhan, Marc</au><au>Sommer, Michael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Impact of multi-scale predictor selection for modeling soil properties</atitle><jtitle>Geoderma</jtitle><date>2015-02</date><risdate>2015</risdate><volume>239-240</volume><spage>97</spage><epage>106</epage><pages>97-106</pages><issn>0016-7061</issn><eissn>1872-6259</eissn><abstract>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales.
•Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.geoderma.2014.09.018</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-8194-123X</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0016-7061 |
ispartof | Geoderma, 2015-02, Vol.239-240, p.97-106 |
issn | 0016-7061 1872-6259 |
language | eng |
recordid | cdi_proquest_miscellaneous_1669857778 |
source | Elsevier ScienceDirect Journals |
subjects | Analysis scale Derivatives Digital Digital soil mapping Digital terrain analysis Hydrology Mathematical models Multiscale Pools Predictor variables Remote sensing Scale effect Soil mapping |
title | Impact of multi-scale predictor selection for modeling soil properties |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T22%3A39%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Impact%20of%20multi-scale%20predictor%20selection%20for%20modeling%20soil%20properties&rft.jtitle=Geoderma&rft.au=Miller,%20Bradley%20A.&rft.date=2015-02&rft.volume=239-240&rft.spage=97&rft.epage=106&rft.pages=97-106&rft.issn=0016-7061&rft.eissn=1872-6259&rft_id=info:doi/10.1016/j.geoderma.2014.09.018&rft_dat=%3Cproquest_cross%3E1669857778%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1660404761&rft_id=info:pmid/&rft_els_id=S0016706114003504&rfr_iscdi=true |