Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques

The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classific...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Science of the total environment 2023-05, Vol.873, p.162371-162371, Article 162371
Hauptverfasser: Zheng, Jiatong, Wang, Peng, Shi, Hangyuan, Zhuang, Changwei, Deng, Yirong, Yang, Xiaojun, Huang, Fei, Xiao, Rongbo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 162371
container_issue
container_start_page 162371
container_title The Science of the total environment
container_volume 873
creator Zheng, Jiatong
Wang, Peng
Shi, Hangyuan
Zhuang, Changwei
Deng, Yirong
Yang, Xiaojun
Huang, Fei
Xiao, Rongbo
description The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution. [Display omitted] •A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.
doi_str_mv 10.1016/j.scitotenv.2023.162371
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2849893972</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0048969723009877</els_id><sourcerecordid>2849893972</sourcerecordid><originalsourceid>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</originalsourceid><addsrcrecordid>eNqFUU1v1DAUtBCILoW_AD5yyWLHWX8cq4oCUiWEBGfrxX5hvUqcxXYiVfx5HG3ptb5YejPzRm-GkA-c7Tnj8tNpn10oc8G47lvWij2XrVD8BdlxrUzDWStfkh1jnW6MNOqKvMn5xOpTmr8mV0LqVjMpd-TvjwViCQVKWJHmeUkOKZzPcyphjhPGQiF66lOFEw2-DsIQHGwonYeqCCM9IqwPdMICY6ZLDvE3Bb9CdOjpBO4YItIRIcUNKeiOMfxZML8lr4aqwHeP_zX5dff55-3X5v77l2-3N_eN61hXGu4OUsheqw67vlfe1Os0Z8xxACHAH4TXG6q1aZkRB-i7OgQ-gNRukL24Jh8ve89p3nyLnUJ2OI4QcV6ybXVntBFGtc9Tla4ZyhpypaoL1aU554SDPacwQXqwnNmtJHuyTyXZrSR7Kakq3z-aLP2E_kn3v5VKuLkQsKayBkzbItziDAldsX4Oz5r8AzKCqkE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2780076187</pqid></control><display><type>article</type><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</creator><creatorcontrib>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</creatorcontrib><description>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution. [Display omitted] •A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</description><identifier>ISSN: 0048-9697</identifier><identifier>EISSN: 1879-1026</identifier><identifier>DOI: 10.1016/j.scitotenv.2023.162371</identifier><identifier>PMID: 36828066</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>atmospheric deposition ; decision support systems ; environment ; environmental protection ; Heavy metals ; irrigation ; Machine learning ; Positive matrix factorization ; risk ; sewage ; soil ; Soil pollution ; Source apportionment</subject><ispartof>The Science of the total environment, 2023-05, Vol.873, p.162371-162371, Article 162371</ispartof><rights>2023 Elsevier B.V.</rights><rights>Copyright © 2023 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</citedby><cites>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0048969723009877$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36828066$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zheng, Jiatong</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Shi, Hangyuan</creatorcontrib><creatorcontrib>Zhuang, Changwei</creatorcontrib><creatorcontrib>Deng, Yirong</creatorcontrib><creatorcontrib>Yang, Xiaojun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Xiao, Rongbo</creatorcontrib><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><title>The Science of the total environment</title><addtitle>Sci Total Environ</addtitle><description>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution. [Display omitted] •A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</description><subject>atmospheric deposition</subject><subject>decision support systems</subject><subject>environment</subject><subject>environmental protection</subject><subject>Heavy metals</subject><subject>irrigation</subject><subject>Machine learning</subject><subject>Positive matrix factorization</subject><subject>risk</subject><subject>sewage</subject><subject>soil</subject><subject>Soil pollution</subject><subject>Source apportionment</subject><issn>0048-9697</issn><issn>1879-1026</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNqFUU1v1DAUtBCILoW_AD5yyWLHWX8cq4oCUiWEBGfrxX5hvUqcxXYiVfx5HG3ptb5YejPzRm-GkA-c7Tnj8tNpn10oc8G47lvWij2XrVD8BdlxrUzDWStfkh1jnW6MNOqKvMn5xOpTmr8mV0LqVjMpd-TvjwViCQVKWJHmeUkOKZzPcyphjhPGQiF66lOFEw2-DsIQHGwonYeqCCM9IqwPdMICY6ZLDvE3Bb9CdOjpBO4YItIRIcUNKeiOMfxZML8lr4aqwHeP_zX5dff55-3X5v77l2-3N_eN61hXGu4OUsheqw67vlfe1Os0Z8xxACHAH4TXG6q1aZkRB-i7OgQ-gNRukL24Jh8ve89p3nyLnUJ2OI4QcV6ybXVntBFGtc9Tla4ZyhpypaoL1aU554SDPacwQXqwnNmtJHuyTyXZrSR7Kakq3z-aLP2E_kn3v5VKuLkQsKayBkzbItziDAldsX4Oz5r8AzKCqkE</recordid><startdate>20230515</startdate><enddate>20230515</enddate><creator>Zheng, Jiatong</creator><creator>Wang, Peng</creator><creator>Shi, Hangyuan</creator><creator>Zhuang, Changwei</creator><creator>Deng, Yirong</creator><creator>Yang, Xiaojun</creator><creator>Huang, Fei</creator><creator>Xiao, Rongbo</creator><general>Elsevier B.V</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope></search><sort><creationdate>20230515</creationdate><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><author>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>atmospheric deposition</topic><topic>decision support systems</topic><topic>environment</topic><topic>environmental protection</topic><topic>Heavy metals</topic><topic>irrigation</topic><topic>Machine learning</topic><topic>Positive matrix factorization</topic><topic>risk</topic><topic>sewage</topic><topic>soil</topic><topic>Soil pollution</topic><topic>Source apportionment</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zheng, Jiatong</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Shi, Hangyuan</creatorcontrib><creatorcontrib>Zhuang, Changwei</creatorcontrib><creatorcontrib>Deng, Yirong</creatorcontrib><creatorcontrib>Yang, Xiaojun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Xiao, Rongbo</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>The Science of the total environment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zheng, Jiatong</au><au>Wang, Peng</au><au>Shi, Hangyuan</au><au>Zhuang, Changwei</au><au>Deng, Yirong</au><au>Yang, Xiaojun</au><au>Huang, Fei</au><au>Xiao, Rongbo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</atitle><jtitle>The Science of the total environment</jtitle><addtitle>Sci Total Environ</addtitle><date>2023-05-15</date><risdate>2023</risdate><volume>873</volume><spage>162371</spage><epage>162371</epage><pages>162371-162371</pages><artnum>162371</artnum><issn>0048-9697</issn><eissn>1879-1026</eissn><abstract>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution. [Display omitted] •A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>36828066</pmid><doi>10.1016/j.scitotenv.2023.162371</doi><tpages>1</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0048-9697
ispartof The Science of the total environment, 2023-05, Vol.873, p.162371-162371, Article 162371
issn 0048-9697
1879-1026
language eng
recordid cdi_proquest_miscellaneous_2849893972
source Elsevier ScienceDirect Journals Complete
subjects atmospheric deposition
decision support systems
environment
environmental protection
Heavy metals
irrigation
Machine learning
Positive matrix factorization
risk
sewage
soil
Soil pollution
Source apportionment
title Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T15%3A26%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Quantitative%20source%20apportionment%20and%20driver%20identification%20of%20soil%20heavy%20metals%20using%20advanced%20machine%20learning%20techniques&rft.jtitle=The%20Science%20of%20the%20total%20environment&rft.au=Zheng,%20Jiatong&rft.date=2023-05-15&rft.volume=873&rft.spage=162371&rft.epage=162371&rft.pages=162371-162371&rft.artnum=162371&rft.issn=0048-9697&rft.eissn=1879-1026&rft_id=info:doi/10.1016/j.scitotenv.2023.162371&rft_dat=%3Cproquest_cross%3E2849893972%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2780076187&rft_id=info:pmid/36828066&rft_els_id=S0048969723009877&rfr_iscdi=true