Statistical modelling of measurement error in wet chemistry soil data

There is a growing demand for high‐quality soil data. However, soil measurements are subject to many error sources. We aimed to quantify uncertainties in synthetic and real‐world wet chemistry soil data through a linear mixed‐effects model, including batch and laboratory effects. The use of syntheti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:European journal of soil science 2022-01, Vol.73 (1), p.n/a
Hauptverfasser: van Leeuwen, Cynthia C. E., Mulder, Vera L., Batjes, Niels H., Heuvelink, Gerard B. M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 1
container_start_page
container_title European journal of soil science
container_volume 73
creator van Leeuwen, Cynthia C. E.
Mulder, Vera L.
Batjes, Niels H.
Heuvelink, Gerard B. M.
description There is a growing demand for high‐quality soil data. However, soil measurements are subject to many error sources. We aimed to quantify uncertainties in synthetic and real‐world wet chemistry soil data through a linear mixed‐effects model, including batch and laboratory effects. The use of synthetic data allowed us to investigate how accurately the model parameters were estimated for various experimental measurement designs, whereas the real‐world case served to explore if estimates of the random effect variances were still accurate for unbalanced datasets with few replicates. The variance estimates for synthetic pHH2O data were unbiased, but limited laboratory information led to imprecise estimates. The same was observed for unbalanced synthetic datasets, where 20, 50 and 80% of the data were removed randomly. Removal led to a sharp increase of the interquartile range (IQR) of the variance estimates for batch effect and the residual. The model was also fitted to real‐world pHH2O and total organic carbon (TOC) data, provided by the Wageningen Evaluating Programmes for Analytical Laboratories (WEPAL). For pHH2O, the model yielded unbiased estimates with relatively small IQRs. However, the limited number of batches with replicate measurements (5.8%) caused the batch effect to be larger than expected. A strong negative correlation between batch effect and residual variance suggested that the model could not distinguish well between these two random effects. For TOC, batch effect was removed from the model as no replicates were available within batches. Again, unbiased model estimates were obtained. However, the IQRs were relatively large, which could be attributed to the smaller dataset with only a single replicate measurement. Our findings demonstrated the importance of experimental measurement design and replicate measurements in the quantification of uncertainties in wet chemistry soil data. Highlights Accurate uncertainty quantification depends on the experimental measurement design. Linear mixed‐effects models can be used as a tool to quantify uncertainty in wet chemistry soil data. Lack of replicate measurements leads to poor estimates of error variance components. Measurement error in wet chemistry soil data should not be ignored.
doi_str_mv 10.1111/ejss.13137
format Article
fullrecord <record><control><sourceid>wiley_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1111_ejss_13137</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EJSS13137</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3097-5521de9a2603da001428cc610afc82908f610f42bac575516e2df3c35dde24ce3</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMouK5e_AU5C13z0bTNUZb6xYKH6jnEZKJZ0laSyNJ_b-t6di7zHp53GB6ErinZ0HluYZ_ShnLK6xO0orwSBeONPF2yoAWpRXmOLlLaEzJDUq5Q22Wdfcre6ID70UIIfvjAo8M96PQdoYchY4hxjNgP-AAZm0_o50accBp9wFZnfYnOnA4Jrv72Gr3dt6_bx2L38vC0vdsVhhNZF0IwakFqVhFu9fxDyRpjKkq0Mw2TpHFzdiV710bUQtAKmHXccGEtsNIAX6Ob410Tx5QiOPUVfa_jpChRiwC1CFC_AmaYHuGDDzD9Q6r2ueuOnR8YHl6C</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Statistical modelling of measurement error in wet chemistry soil data</title><source>Access via Wiley Online Library</source><creator>van Leeuwen, Cynthia C. E. ; Mulder, Vera L. ; Batjes, Niels H. ; Heuvelink, Gerard B. M.</creator><creatorcontrib>van Leeuwen, Cynthia C. E. ; Mulder, Vera L. ; Batjes, Niels H. ; Heuvelink, Gerard B. M.</creatorcontrib><description>There is a growing demand for high‐quality soil data. However, soil measurements are subject to many error sources. We aimed to quantify uncertainties in synthetic and real‐world wet chemistry soil data through a linear mixed‐effects model, including batch and laboratory effects. The use of synthetic data allowed us to investigate how accurately the model parameters were estimated for various experimental measurement designs, whereas the real‐world case served to explore if estimates of the random effect variances were still accurate for unbalanced datasets with few replicates. The variance estimates for synthetic pHH2O data were unbiased, but limited laboratory information led to imprecise estimates. The same was observed for unbalanced synthetic datasets, where 20, 50 and 80% of the data were removed randomly. Removal led to a sharp increase of the interquartile range (IQR) of the variance estimates for batch effect and the residual. The model was also fitted to real‐world pHH2O and total organic carbon (TOC) data, provided by the Wageningen Evaluating Programmes for Analytical Laboratories (WEPAL). For pHH2O, the model yielded unbiased estimates with relatively small IQRs. However, the limited number of batches with replicate measurements (5.8%) caused the batch effect to be larger than expected. A strong negative correlation between batch effect and residual variance suggested that the model could not distinguish well between these two random effects. For TOC, batch effect was removed from the model as no replicates were available within batches. Again, unbiased model estimates were obtained. However, the IQRs were relatively large, which could be attributed to the smaller dataset with only a single replicate measurement. Our findings demonstrated the importance of experimental measurement design and replicate measurements in the quantification of uncertainties in wet chemistry soil data. Highlights Accurate uncertainty quantification depends on the experimental measurement design. Linear mixed‐effects models can be used as a tool to quantify uncertainty in wet chemistry soil data. Lack of replicate measurements leads to poor estimates of error variance components. Measurement error in wet chemistry soil data should not be ignored.</description><identifier>ISSN: 1351-0754</identifier><identifier>EISSN: 1365-2389</identifier><identifier>DOI: 10.1111/ejss.13137</identifier><language>eng</language><publisher>Oxford, UK: Blackwell Publishing Ltd</publisher><subject>accuracy ; experimental design ; linear mixed‐effects model ; replicate measurements ; soil chemical data ; uncertainty</subject><ispartof>European journal of soil science, 2022-01, Vol.73 (1), p.n/a</ispartof><rights>2021 The Authors. published by John Wiley &amp; Sons Ltd on behalf of British Society of Soil Science.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3097-5521de9a2603da001428cc610afc82908f610f42bac575516e2df3c35dde24ce3</citedby><cites>FETCH-LOGICAL-c3097-5521de9a2603da001428cc610afc82908f610f42bac575516e2df3c35dde24ce3</cites><orcidid>0000-0003-3108-2136 ; 0000-0003-2367-3067 ; 0000-0003-4936-0077 ; 0000-0003-0959-9358</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1111%2Fejss.13137$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1111%2Fejss.13137$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,1418,27929,27930,45579,45580</link.rule.ids></links><search><creatorcontrib>van Leeuwen, Cynthia C. E.</creatorcontrib><creatorcontrib>Mulder, Vera L.</creatorcontrib><creatorcontrib>Batjes, Niels H.</creatorcontrib><creatorcontrib>Heuvelink, Gerard B. M.</creatorcontrib><title>Statistical modelling of measurement error in wet chemistry soil data</title><title>European journal of soil science</title><description>There is a growing demand for high‐quality soil data. However, soil measurements are subject to many error sources. We aimed to quantify uncertainties in synthetic and real‐world wet chemistry soil data through a linear mixed‐effects model, including batch and laboratory effects. The use of synthetic data allowed us to investigate how accurately the model parameters were estimated for various experimental measurement designs, whereas the real‐world case served to explore if estimates of the random effect variances were still accurate for unbalanced datasets with few replicates. The variance estimates for synthetic pHH2O data were unbiased, but limited laboratory information led to imprecise estimates. The same was observed for unbalanced synthetic datasets, where 20, 50 and 80% of the data were removed randomly. Removal led to a sharp increase of the interquartile range (IQR) of the variance estimates for batch effect and the residual. The model was also fitted to real‐world pHH2O and total organic carbon (TOC) data, provided by the Wageningen Evaluating Programmes for Analytical Laboratories (WEPAL). For pHH2O, the model yielded unbiased estimates with relatively small IQRs. However, the limited number of batches with replicate measurements (5.8%) caused the batch effect to be larger than expected. A strong negative correlation between batch effect and residual variance suggested that the model could not distinguish well between these two random effects. For TOC, batch effect was removed from the model as no replicates were available within batches. Again, unbiased model estimates were obtained. However, the IQRs were relatively large, which could be attributed to the smaller dataset with only a single replicate measurement. Our findings demonstrated the importance of experimental measurement design and replicate measurements in the quantification of uncertainties in wet chemistry soil data. Highlights Accurate uncertainty quantification depends on the experimental measurement design. Linear mixed‐effects models can be used as a tool to quantify uncertainty in wet chemistry soil data. Lack of replicate measurements leads to poor estimates of error variance components. Measurement error in wet chemistry soil data should not be ignored.</description><subject>accuracy</subject><subject>experimental design</subject><subject>linear mixed‐effects model</subject><subject>replicate measurements</subject><subject>soil chemical data</subject><subject>uncertainty</subject><issn>1351-0754</issn><issn>1365-2389</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>WIN</sourceid><recordid>eNp9kE1LxDAQhoMouK5e_AU5C13z0bTNUZb6xYKH6jnEZKJZ0laSyNJ_b-t6di7zHp53GB6ErinZ0HluYZ_ShnLK6xO0orwSBeONPF2yoAWpRXmOLlLaEzJDUq5Q22Wdfcre6ID70UIIfvjAo8M96PQdoYchY4hxjNgP-AAZm0_o50accBp9wFZnfYnOnA4Jrv72Gr3dt6_bx2L38vC0vdsVhhNZF0IwakFqVhFu9fxDyRpjKkq0Mw2TpHFzdiV710bUQtAKmHXccGEtsNIAX6Ob410Tx5QiOPUVfa_jpChRiwC1CFC_AmaYHuGDDzD9Q6r2ueuOnR8YHl6C</recordid><startdate>202201</startdate><enddate>202201</enddate><creator>van Leeuwen, Cynthia C. E.</creator><creator>Mulder, Vera L.</creator><creator>Batjes, Niels H.</creator><creator>Heuvelink, Gerard B. M.</creator><general>Blackwell Publishing Ltd</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-3108-2136</orcidid><orcidid>https://orcid.org/0000-0003-2367-3067</orcidid><orcidid>https://orcid.org/0000-0003-4936-0077</orcidid><orcidid>https://orcid.org/0000-0003-0959-9358</orcidid></search><sort><creationdate>202201</creationdate><title>Statistical modelling of measurement error in wet chemistry soil data</title><author>van Leeuwen, Cynthia C. E. ; Mulder, Vera L. ; Batjes, Niels H. ; Heuvelink, Gerard B. M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3097-5521de9a2603da001428cc610afc82908f610f42bac575516e2df3c35dde24ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>accuracy</topic><topic>experimental design</topic><topic>linear mixed‐effects model</topic><topic>replicate measurements</topic><topic>soil chemical data</topic><topic>uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>van Leeuwen, Cynthia C. E.</creatorcontrib><creatorcontrib>Mulder, Vera L.</creatorcontrib><creatorcontrib>Batjes, Niels H.</creatorcontrib><creatorcontrib>Heuvelink, Gerard B. M.</creatorcontrib><collection>Wiley Online Library (Open Access Collection)</collection><collection>Wiley Online Library (Open Access Collection)</collection><collection>CrossRef</collection><jtitle>European journal of soil science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>van Leeuwen, Cynthia C. E.</au><au>Mulder, Vera L.</au><au>Batjes, Niels H.</au><au>Heuvelink, Gerard B. M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Statistical modelling of measurement error in wet chemistry soil data</atitle><jtitle>European journal of soil science</jtitle><date>2022-01</date><risdate>2022</risdate><volume>73</volume><issue>1</issue><epage>n/a</epage><issn>1351-0754</issn><eissn>1365-2389</eissn><abstract>There is a growing demand for high‐quality soil data. However, soil measurements are subject to many error sources. We aimed to quantify uncertainties in synthetic and real‐world wet chemistry soil data through a linear mixed‐effects model, including batch and laboratory effects. The use of synthetic data allowed us to investigate how accurately the model parameters were estimated for various experimental measurement designs, whereas the real‐world case served to explore if estimates of the random effect variances were still accurate for unbalanced datasets with few replicates. The variance estimates for synthetic pHH2O data were unbiased, but limited laboratory information led to imprecise estimates. The same was observed for unbalanced synthetic datasets, where 20, 50 and 80% of the data were removed randomly. Removal led to a sharp increase of the interquartile range (IQR) of the variance estimates for batch effect and the residual. The model was also fitted to real‐world pHH2O and total organic carbon (TOC) data, provided by the Wageningen Evaluating Programmes for Analytical Laboratories (WEPAL). For pHH2O, the model yielded unbiased estimates with relatively small IQRs. However, the limited number of batches with replicate measurements (5.8%) caused the batch effect to be larger than expected. A strong negative correlation between batch effect and residual variance suggested that the model could not distinguish well between these two random effects. For TOC, batch effect was removed from the model as no replicates were available within batches. Again, unbiased model estimates were obtained. However, the IQRs were relatively large, which could be attributed to the smaller dataset with only a single replicate measurement. Our findings demonstrated the importance of experimental measurement design and replicate measurements in the quantification of uncertainties in wet chemistry soil data. Highlights Accurate uncertainty quantification depends on the experimental measurement design. Linear mixed‐effects models can be used as a tool to quantify uncertainty in wet chemistry soil data. Lack of replicate measurements leads to poor estimates of error variance components. Measurement error in wet chemistry soil data should not be ignored.</abstract><cop>Oxford, UK</cop><pub>Blackwell Publishing Ltd</pub><doi>10.1111/ejss.13137</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0003-3108-2136</orcidid><orcidid>https://orcid.org/0000-0003-2367-3067</orcidid><orcidid>https://orcid.org/0000-0003-4936-0077</orcidid><orcidid>https://orcid.org/0000-0003-0959-9358</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1351-0754
ispartof European journal of soil science, 2022-01, Vol.73 (1), p.n/a
issn 1351-0754
1365-2389
language eng
recordid cdi_crossref_primary_10_1111_ejss_13137
source Access via Wiley Online Library
subjects accuracy
experimental design
linear mixed‐effects model
replicate measurements
soil chemical data
uncertainty
title Statistical modelling of measurement error in wet chemistry soil data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T11%3A32%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Statistical%20modelling%20of%20measurement%20error%20in%20wet%20chemistry%20soil%20data&rft.jtitle=European%20journal%20of%20soil%20science&rft.au=van%20Leeuwen,%20Cynthia%20C.%20E.&rft.date=2022-01&rft.volume=73&rft.issue=1&rft.epage=n/a&rft.issn=1351-0754&rft.eissn=1365-2389&rft_id=info:doi/10.1111/ejss.13137&rft_dat=%3Cwiley_cross%3EEJSS13137%3C/wiley_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true