Impact of multi-scale predictor selection for modeling soil properties

Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data ava...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Geoderma 2015-02, Vol.239-240, p.97-106
Hauptverfasser: Miller, Bradley A., Koszinski, Sylvia, Wehrhan, Marc, Sommer, Michael
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 106
container_issue
container_start_page 97
container_title Geoderma
container_volume 239-240
creator Miller, Bradley A.
Koszinski, Sylvia
Wehrhan, Marc
Sommer, Michael
description Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales. •Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.
doi_str_mv 10.1016/j.geoderma.2014.09.018
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1669857778</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0016706114003504</els_id><sourcerecordid>1669857778</sourcerecordid><originalsourceid>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</originalsourceid><addsrcrecordid>eNqNkE1LAzEQhoMoWKt_QfboZdfJbj42N6VYLRS86Dmk2dmSkt3UZCv4702pnhUGhoHnnWEeQm4pVBSouN9VWwwdxsFUNVBWgaqAtmdkRltZl6Lm6pzMIJOlBEEvyVVKuzxKqGFGlqthb-xUhL4YDn5yZbLGY7GP2Dk7hVgk9GgnF8aiz9OQD3k3bosUnM9U2GOcHKZrctEbn_Dmp8_J-_LpbfFSrl-fV4vHdWkY0Km0EkXTIVJljJEbBAVG8p7zHjg3jVC07UzNOduYnoGix-o2girJDBOyaebk7rQ3n_44YJr04JJF782I4ZA0FUK1XErZ_gcFBkwKmlFxQm0MKUXs9T66wcQvTUEfHeud_nWsj441KJ0d5-DDKYj550-HUSfrcLRZXszWdBfcXyu-AVg1iB8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1660404761</pqid></control><display><type>article</type><title>Impact of multi-scale predictor selection for modeling soil properties</title><source>Elsevier ScienceDirect Journals</source><creator>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</creator><creatorcontrib>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</creatorcontrib><description>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales. •Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</description><identifier>ISSN: 0016-7061</identifier><identifier>EISSN: 1872-6259</identifier><identifier>DOI: 10.1016/j.geoderma.2014.09.018</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Analysis scale ; Derivatives ; Digital ; Digital soil mapping ; Digital terrain analysis ; Hydrology ; Mathematical models ; Multiscale ; Pools ; Predictor variables ; Remote sensing ; Scale effect ; Soil mapping</subject><ispartof>Geoderma, 2015-02, Vol.239-240, p.97-106</ispartof><rights>2014 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</citedby><cites>FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</cites><orcidid>0000-0001-8194-123X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0016706114003504$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3536,27903,27904,65309</link.rule.ids></links><search><creatorcontrib>Miller, Bradley A.</creatorcontrib><creatorcontrib>Koszinski, Sylvia</creatorcontrib><creatorcontrib>Wehrhan, Marc</creatorcontrib><creatorcontrib>Sommer, Michael</creatorcontrib><title>Impact of multi-scale predictor selection for modeling soil properties</title><title>Geoderma</title><description>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales. •Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</description><subject>Analysis scale</subject><subject>Derivatives</subject><subject>Digital</subject><subject>Digital soil mapping</subject><subject>Digital terrain analysis</subject><subject>Hydrology</subject><subject>Mathematical models</subject><subject>Multiscale</subject><subject>Pools</subject><subject>Predictor variables</subject><subject>Remote sensing</subject><subject>Scale effect</subject><subject>Soil mapping</subject><issn>0016-7061</issn><issn>1872-6259</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqNkE1LAzEQhoMoWKt_QfboZdfJbj42N6VYLRS86Dmk2dmSkt3UZCv4702pnhUGhoHnnWEeQm4pVBSouN9VWwwdxsFUNVBWgaqAtmdkRltZl6Lm6pzMIJOlBEEvyVVKuzxKqGFGlqthb-xUhL4YDn5yZbLGY7GP2Dk7hVgk9GgnF8aiz9OQD3k3bosUnM9U2GOcHKZrctEbn_Dmp8_J-_LpbfFSrl-fV4vHdWkY0Km0EkXTIVJljJEbBAVG8p7zHjg3jVC07UzNOduYnoGix-o2girJDBOyaebk7rQ3n_44YJr04JJF782I4ZA0FUK1XErZ_gcFBkwKmlFxQm0MKUXs9T66wcQvTUEfHeud_nWsj441KJ0d5-DDKYj550-HUSfrcLRZXszWdBfcXyu-AVg1iB8</recordid><startdate>201502</startdate><enddate>201502</enddate><creator>Miller, Bradley A.</creator><creator>Koszinski, Sylvia</creator><creator>Wehrhan, Marc</creator><creator>Sommer, Michael</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>L.G</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><orcidid>https://orcid.org/0000-0001-8194-123X</orcidid></search><sort><creationdate>201502</creationdate><title>Impact of multi-scale predictor selection for modeling soil properties</title><author>Miller, Bradley A. ; Koszinski, Sylvia ; Wehrhan, Marc ; Sommer, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a401t-c7e63dee19aaa7be090a75f55f055a36918da2554baf40910910db61974a46733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Analysis scale</topic><topic>Derivatives</topic><topic>Digital</topic><topic>Digital soil mapping</topic><topic>Digital terrain analysis</topic><topic>Hydrology</topic><topic>Mathematical models</topic><topic>Multiscale</topic><topic>Pools</topic><topic>Predictor variables</topic><topic>Remote sensing</topic><topic>Scale effect</topic><topic>Soil mapping</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Miller, Bradley A.</creatorcontrib><creatorcontrib>Koszinski, Sylvia</creatorcontrib><creatorcontrib>Wehrhan, Marc</creatorcontrib><creatorcontrib>Sommer, Michael</creatorcontrib><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Geoderma</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Miller, Bradley A.</au><au>Koszinski, Sylvia</au><au>Wehrhan, Marc</au><au>Sommer, Michael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Impact of multi-scale predictor selection for modeling soil properties</atitle><jtitle>Geoderma</jtitle><date>2015-02</date><risdate>2015</risdate><volume>239-240</volume><spage>97</spage><epage>106</epage><pages>97-106</pages><issn>0016-7061</issn><eissn>1872-6259</eissn><abstract>Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales. •Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.geoderma.2014.09.018</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-8194-123X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0016-7061
ispartof Geoderma, 2015-02, Vol.239-240, p.97-106
issn 0016-7061
1872-6259
language eng
recordid cdi_proquest_miscellaneous_1669857778
source Elsevier ScienceDirect Journals
subjects Analysis scale
Derivatives
Digital
Digital soil mapping
Digital terrain analysis
Hydrology
Mathematical models
Multiscale
Pools
Predictor variables
Remote sensing
Scale effect
Soil mapping
title Impact of multi-scale predictor selection for modeling soil properties
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T22%3A39%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Impact%20of%20multi-scale%20predictor%20selection%20for%20modeling%20soil%20properties&rft.jtitle=Geoderma&rft.au=Miller,%20Bradley%20A.&rft.date=2015-02&rft.volume=239-240&rft.spage=97&rft.epage=106&rft.pages=97-106&rft.issn=0016-7061&rft.eissn=1872-6259&rft_id=info:doi/10.1016/j.geoderma.2014.09.018&rft_dat=%3Cproquest_cross%3E1669857778%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1660404761&rft_id=info:pmid/&rft_els_id=S0016706114003504&rfr_iscdi=true