Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy
•96Tunisian soil samples were used to calibrate and validate SIC prediction models.•MIR absorption peak-based LR and full spectra-based PLSR models were used.•Both types of model were tested on 2178French soil samples for SIC prediction.•Peak at 2510 cm−1 on Test soils samples was used to select sui...
Gespeichert in:
Veröffentlicht in: | Geoderma 2022-01, Vol.405, p.115403, Article 115403 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 115403 |
container_title | Geoderma |
container_volume | 405 |
creator | Gomez, Cécile Chevallier, Tiphaine Moulin, Patricia Arrouays, Dominique Barthès, Bernard G. |
description | •96Tunisian soil samples were used to calibrate and validate SIC prediction models.•MIR absorption peak-based LR and full spectra-based PLSR models were used.•Both types of model were tested on 2178French soil samples for SIC prediction.•Peak at 2510 cm−1 on Test soils samples was used to select suitable model.•SIC was accurately predicted by a LR and PLSR coupling.
Mid-Infrared reflectance spectroscopy (MIRS, 4000–400 cm−1) is being considered to provide accurate estimations of soil inorganic carbon (SIC) contents, based on prediction models when the test dataset is well represented by the calibration set, with similar SIC range and distribution and pedological context. This work addresses the case where the test dataset, here originating from France, is poorly represented by the calibration set, here originating from Tunisia, with different SIC distributions and pedological contexts. It aimed to demonstrate the usefulness of 1) classifying test samples according to SIC level based on the height of the carbonate absorbance peak at 2510 cm−1, and then 2) selecting a suitable prediction model according to SIC level. Two regression methods were tested: Linear Regression using the height of the carbonate peak at 2510 cm−1, called Peak-LR model; and Partial Least Squares Regression using the entire MIR spectrum, called Full-PLSR model. First, our results showed that Full-PLSR was 1) more accurate than Peak-LR on the Tunisian validation set (R2val = 0.99 vs. 0.86 and RMSEval = 3.0 vs. 9.7 g kg−1, respectively), but 2) less accurate than Peak-LR when applied on the French dataset (R2test = 0.70 vs. 0.91 and RMSEtest = 13.7 vs. 4.9 g kg−1, respectively). Secondly, on the French dataset, predictions on SIC-poor samples tended to be more accurate using Peak-LR, while predictions on SIC-rich samples tended to be more accurate using Full-PLSR. Thirdly, the height of the carbonate absorbance peak at 2510 cm−1 might be used to discriminate SIC-poor and SIC-rich test samples ( 5 g kg−1): when this height was > 0, Full-PLSR was applied; otherwise Peak-LR was applied. Coupling Peak-LR and Full-PLSR models depending on the carbonate peak yielded the best predictions on the French dataset (R2test = 0.95 and RMSEtest = 3.7 g kg−1). This study underlined the interest of using a carbonate peak to select suitable regression approach for predicting SIC content in a database with different distribution than the calibration database. |
doi_str_mv | 10.1016/j.geoderma.2021.115403 |
format | Article |
fullrecord | <record><control><sourceid>hal_cross</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_03332031v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0016706121004833</els_id><sourcerecordid>oai_HAL_hal_03332031v1</sourcerecordid><originalsourceid>FETCH-LOGICAL-c394t-195a65560fef02bcf8c25fd3fa8bec041e86cb1d7ced92dcde70ada077291833</originalsourceid><addsrcrecordid>eNqFUctuWyEURFUjxXXzCxHbLq7Lw_e1q2W1cSVL2bhrdICDjXt9sYBG8jflJ8utnW6zAg4zc2Y0hDxytuCMN1-Piz0Gi_EEC8EEX3BeL5n8QGa8a0XViLr_SGasIKuWNfyefErpWJ4tE2xGXn8lP-6pgajDCBkp6BSihtEgPSP8pjnQhAOaTPMB6SmkTNMfn0EPSCPuI6bkw1g-LA5UowuxECNab_IknIIfqB9D3MPozW0PNaHojzlCnrj6Qk_eVn50EQqzyLpp4T8P6VxuMSQTzpfP5M7BkPDhds7J7sf33XpTbZ-ffq5X28rIfpkr3tfQ1HXDHDomtHGdEbWz0kGn0bAlx64xmtvWoO2FNRZbBhZY24qed1LOyZer7AEGdY7-BPGiAni1WW3VNGNSSsEkf-EF21yxpnhMxfh_Amdqakcd1Vs7ampHXdspxG9XIpYgLx6jSsZjSWx9LImVDf49ib9pl6Gu</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Gomez, Cécile ; Chevallier, Tiphaine ; Moulin, Patricia ; Arrouays, Dominique ; Barthès, Bernard G.</creator><creatorcontrib>Gomez, Cécile ; Chevallier, Tiphaine ; Moulin, Patricia ; Arrouays, Dominique ; Barthès, Bernard G.</creatorcontrib><description>•96Tunisian soil samples were used to calibrate and validate SIC prediction models.•MIR absorption peak-based LR and full spectra-based PLSR models were used.•Both types of model were tested on 2178French soil samples for SIC prediction.•Peak at 2510 cm−1 on Test soils samples was used to select suitable model.•SIC was accurately predicted by a LR and PLSR coupling.
Mid-Infrared reflectance spectroscopy (MIRS, 4000–400 cm−1) is being considered to provide accurate estimations of soil inorganic carbon (SIC) contents, based on prediction models when the test dataset is well represented by the calibration set, with similar SIC range and distribution and pedological context. This work addresses the case where the test dataset, here originating from France, is poorly represented by the calibration set, here originating from Tunisia, with different SIC distributions and pedological contexts. It aimed to demonstrate the usefulness of 1) classifying test samples according to SIC level based on the height of the carbonate absorbance peak at 2510 cm−1, and then 2) selecting a suitable prediction model according to SIC level. Two regression methods were tested: Linear Regression using the height of the carbonate peak at 2510 cm−1, called Peak-LR model; and Partial Least Squares Regression using the entire MIR spectrum, called Full-PLSR model. First, our results showed that Full-PLSR was 1) more accurate than Peak-LR on the Tunisian validation set (R2val = 0.99 vs. 0.86 and RMSEval = 3.0 vs. 9.7 g kg−1, respectively), but 2) less accurate than Peak-LR when applied on the French dataset (R2test = 0.70 vs. 0.91 and RMSEtest = 13.7 vs. 4.9 g kg−1, respectively). Secondly, on the French dataset, predictions on SIC-poor samples tended to be more accurate using Peak-LR, while predictions on SIC-rich samples tended to be more accurate using Full-PLSR. Thirdly, the height of the carbonate absorbance peak at 2510 cm−1 might be used to discriminate SIC-poor and SIC-rich test samples (<5 vs. > 5 g kg−1): when this height was > 0, Full-PLSR was applied; otherwise Peak-LR was applied. Coupling Peak-LR and Full-PLSR models depending on the carbonate peak yielded the best predictions on the French dataset (R2test = 0.95 and RMSEtest = 3.7 g kg−1). This study underlined the interest of using a carbonate peak to select suitable regression approach for predicting SIC content in a database with different distribution than the calibration database.</description><identifier>ISSN: 0016-7061</identifier><identifier>EISSN: 1872-6259</identifier><identifier>DOI: 10.1016/j.geoderma.2021.115403</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Agricultural sciences ; Environmental Sciences ; Life Sciences ; Linear regression ; Mid-infrared reflectance spectroscopy ; National dataset ; Partial least squares regression ; Soil inorganic carbon ; Soil study</subject><ispartof>Geoderma, 2022-01, Vol.405, p.115403, Article 115403</ispartof><rights>2021 Elsevier B.V.</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c394t-195a65560fef02bcf8c25fd3fa8bec041e86cb1d7ced92dcde70ada077291833</citedby><cites>FETCH-LOGICAL-c394t-195a65560fef02bcf8c25fd3fa8bec041e86cb1d7ced92dcde70ada077291833</cites><orcidid>0000-0001-8285-3856 ; 0000-0002-6878-6498 ; 0000-0002-2986-430X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.geoderma.2021.115403$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3548,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://hal.inrae.fr/hal-03332031$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Gomez, Cécile</creatorcontrib><creatorcontrib>Chevallier, Tiphaine</creatorcontrib><creatorcontrib>Moulin, Patricia</creatorcontrib><creatorcontrib>Arrouays, Dominique</creatorcontrib><creatorcontrib>Barthès, Bernard G.</creatorcontrib><title>Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy</title><title>Geoderma</title><description>•96Tunisian soil samples were used to calibrate and validate SIC prediction models.•MIR absorption peak-based LR and full spectra-based PLSR models were used.•Both types of model were tested on 2178French soil samples for SIC prediction.•Peak at 2510 cm−1 on Test soils samples was used to select suitable model.•SIC was accurately predicted by a LR and PLSR coupling.
Mid-Infrared reflectance spectroscopy (MIRS, 4000–400 cm−1) is being considered to provide accurate estimations of soil inorganic carbon (SIC) contents, based on prediction models when the test dataset is well represented by the calibration set, with similar SIC range and distribution and pedological context. This work addresses the case where the test dataset, here originating from France, is poorly represented by the calibration set, here originating from Tunisia, with different SIC distributions and pedological contexts. It aimed to demonstrate the usefulness of 1) classifying test samples according to SIC level based on the height of the carbonate absorbance peak at 2510 cm−1, and then 2) selecting a suitable prediction model according to SIC level. Two regression methods were tested: Linear Regression using the height of the carbonate peak at 2510 cm−1, called Peak-LR model; and Partial Least Squares Regression using the entire MIR spectrum, called Full-PLSR model. First, our results showed that Full-PLSR was 1) more accurate than Peak-LR on the Tunisian validation set (R2val = 0.99 vs. 0.86 and RMSEval = 3.0 vs. 9.7 g kg−1, respectively), but 2) less accurate than Peak-LR when applied on the French dataset (R2test = 0.70 vs. 0.91 and RMSEtest = 13.7 vs. 4.9 g kg−1, respectively). Secondly, on the French dataset, predictions on SIC-poor samples tended to be more accurate using Peak-LR, while predictions on SIC-rich samples tended to be more accurate using Full-PLSR. Thirdly, the height of the carbonate absorbance peak at 2510 cm−1 might be used to discriminate SIC-poor and SIC-rich test samples (<5 vs. > 5 g kg−1): when this height was > 0, Full-PLSR was applied; otherwise Peak-LR was applied. Coupling Peak-LR and Full-PLSR models depending on the carbonate peak yielded the best predictions on the French dataset (R2test = 0.95 and RMSEtest = 3.7 g kg−1). This study underlined the interest of using a carbonate peak to select suitable regression approach for predicting SIC content in a database with different distribution than the calibration database.</description><subject>Agricultural sciences</subject><subject>Environmental Sciences</subject><subject>Life Sciences</subject><subject>Linear regression</subject><subject>Mid-infrared reflectance spectroscopy</subject><subject>National dataset</subject><subject>Partial least squares regression</subject><subject>Soil inorganic carbon</subject><subject>Soil study</subject><issn>0016-7061</issn><issn>1872-6259</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNqFUctuWyEURFUjxXXzCxHbLq7Lw_e1q2W1cSVL2bhrdICDjXt9sYBG8jflJ8utnW6zAg4zc2Y0hDxytuCMN1-Piz0Gi_EEC8EEX3BeL5n8QGa8a0XViLr_SGasIKuWNfyefErpWJ4tE2xGXn8lP-6pgajDCBkp6BSihtEgPSP8pjnQhAOaTPMB6SmkTNMfn0EPSCPuI6bkw1g-LA5UowuxECNab_IknIIfqB9D3MPozW0PNaHojzlCnrj6Qk_eVn50EQqzyLpp4T8P6VxuMSQTzpfP5M7BkPDhds7J7sf33XpTbZ-ffq5X28rIfpkr3tfQ1HXDHDomtHGdEbWz0kGn0bAlx64xmtvWoO2FNRZbBhZY24qed1LOyZer7AEGdY7-BPGiAni1WW3VNGNSSsEkf-EF21yxpnhMxfh_Amdqakcd1Vs7ampHXdspxG9XIpYgLx6jSsZjSWx9LImVDf49ib9pl6Gu</recordid><startdate>20220101</startdate><enddate>20220101</enddate><creator>Gomez, Cécile</creator><creator>Chevallier, Tiphaine</creator><creator>Moulin, Patricia</creator><creator>Arrouays, Dominique</creator><creator>Barthès, Bernard G.</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>AAYXX</scope><scope>CITATION</scope><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-8285-3856</orcidid><orcidid>https://orcid.org/0000-0002-6878-6498</orcidid><orcidid>https://orcid.org/0000-0002-2986-430X</orcidid></search><sort><creationdate>20220101</creationdate><title>Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy</title><author>Gomez, Cécile ; Chevallier, Tiphaine ; Moulin, Patricia ; Arrouays, Dominique ; Barthès, Bernard G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c394t-195a65560fef02bcf8c25fd3fa8bec041e86cb1d7ced92dcde70ada077291833</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Agricultural sciences</topic><topic>Environmental Sciences</topic><topic>Life Sciences</topic><topic>Linear regression</topic><topic>Mid-infrared reflectance spectroscopy</topic><topic>National dataset</topic><topic>Partial least squares regression</topic><topic>Soil inorganic carbon</topic><topic>Soil study</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gomez, Cécile</creatorcontrib><creatorcontrib>Chevallier, Tiphaine</creatorcontrib><creatorcontrib>Moulin, Patricia</creatorcontrib><creatorcontrib>Arrouays, Dominique</creatorcontrib><creatorcontrib>Barthès, Bernard G.</creatorcontrib><collection>CrossRef</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>Geoderma</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gomez, Cécile</au><au>Chevallier, Tiphaine</au><au>Moulin, Patricia</au><au>Arrouays, Dominique</au><au>Barthès, Bernard G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy</atitle><jtitle>Geoderma</jtitle><date>2022-01-01</date><risdate>2022</risdate><volume>405</volume><spage>115403</spage><pages>115403-</pages><artnum>115403</artnum><issn>0016-7061</issn><eissn>1872-6259</eissn><abstract>•96Tunisian soil samples were used to calibrate and validate SIC prediction models.•MIR absorption peak-based LR and full spectra-based PLSR models were used.•Both types of model were tested on 2178French soil samples for SIC prediction.•Peak at 2510 cm−1 on Test soils samples was used to select suitable model.•SIC was accurately predicted by a LR and PLSR coupling.
Mid-Infrared reflectance spectroscopy (MIRS, 4000–400 cm−1) is being considered to provide accurate estimations of soil inorganic carbon (SIC) contents, based on prediction models when the test dataset is well represented by the calibration set, with similar SIC range and distribution and pedological context. This work addresses the case where the test dataset, here originating from France, is poorly represented by the calibration set, here originating from Tunisia, with different SIC distributions and pedological contexts. It aimed to demonstrate the usefulness of 1) classifying test samples according to SIC level based on the height of the carbonate absorbance peak at 2510 cm−1, and then 2) selecting a suitable prediction model according to SIC level. Two regression methods were tested: Linear Regression using the height of the carbonate peak at 2510 cm−1, called Peak-LR model; and Partial Least Squares Regression using the entire MIR spectrum, called Full-PLSR model. First, our results showed that Full-PLSR was 1) more accurate than Peak-LR on the Tunisian validation set (R2val = 0.99 vs. 0.86 and RMSEval = 3.0 vs. 9.7 g kg−1, respectively), but 2) less accurate than Peak-LR when applied on the French dataset (R2test = 0.70 vs. 0.91 and RMSEtest = 13.7 vs. 4.9 g kg−1, respectively). Secondly, on the French dataset, predictions on SIC-poor samples tended to be more accurate using Peak-LR, while predictions on SIC-rich samples tended to be more accurate using Full-PLSR. Thirdly, the height of the carbonate absorbance peak at 2510 cm−1 might be used to discriminate SIC-poor and SIC-rich test samples (<5 vs. > 5 g kg−1): when this height was > 0, Full-PLSR was applied; otherwise Peak-LR was applied. Coupling Peak-LR and Full-PLSR models depending on the carbonate peak yielded the best predictions on the French dataset (R2test = 0.95 and RMSEtest = 3.7 g kg−1). This study underlined the interest of using a carbonate peak to select suitable regression approach for predicting SIC content in a database with different distribution than the calibration database.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.geoderma.2021.115403</doi><orcidid>https://orcid.org/0000-0001-8285-3856</orcidid><orcidid>https://orcid.org/0000-0002-6878-6498</orcidid><orcidid>https://orcid.org/0000-0002-2986-430X</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0016-7061 |
ispartof | Geoderma, 2022-01, Vol.405, p.115403, Article 115403 |
issn | 0016-7061 1872-6259 |
language | eng |
recordid | cdi_hal_primary_oai_HAL_hal_03332031v1 |
source | ScienceDirect Journals (5 years ago - present) |
subjects | Agricultural sciences Environmental Sciences Life Sciences Linear regression Mid-infrared reflectance spectroscopy National dataset Partial least squares regression Soil inorganic carbon Soil study |
title | Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T02%3A12%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20carbonate%20absorbance%20peak%20to%20select%20the%20most%20suitable%20regression%20model%20before%20predicting%20soil%20inorganic%20carbon%20concentration%20by%20mid-infrared%20reflectance%20spectroscopy&rft.jtitle=Geoderma&rft.au=Gomez,%20C%C3%A9cile&rft.date=2022-01-01&rft.volume=405&rft.spage=115403&rft.pages=115403-&rft.artnum=115403&rft.issn=0016-7061&rft.eissn=1872-6259&rft_id=info:doi/10.1016/j.geoderma.2021.115403&rft_dat=%3Chal_cross%3Eoai_HAL_hal_03332031v1%3C/hal_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_els_id=S0016706121004833&rfr_iscdi=true |