Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques
The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classific...
Gespeichert in:
Veröffentlicht in: | The Science of the total environment 2023-05, Vol.873, p.162371-162371, Article 162371 |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 162371 |
---|---|
container_issue | |
container_start_page | 162371 |
container_title | The Science of the total environment |
container_volume | 873 |
creator | Zheng, Jiatong Wang, Peng Shi, Hangyuan Zhuang, Changwei Deng, Yirong Yang, Xiaojun Huang, Fei Xiao, Rongbo |
description | The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution.
[Display omitted]
•A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources. |
doi_str_mv | 10.1016/j.scitotenv.2023.162371 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2849893972</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0048969723009877</els_id><sourcerecordid>2849893972</sourcerecordid><originalsourceid>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</originalsourceid><addsrcrecordid>eNqFUU1v1DAUtBCILoW_AD5yyWLHWX8cq4oCUiWEBGfrxX5hvUqcxXYiVfx5HG3ptb5YejPzRm-GkA-c7Tnj8tNpn10oc8G47lvWij2XrVD8BdlxrUzDWStfkh1jnW6MNOqKvMn5xOpTmr8mV0LqVjMpd-TvjwViCQVKWJHmeUkOKZzPcyphjhPGQiF66lOFEw2-DsIQHGwonYeqCCM9IqwPdMICY6ZLDvE3Bb9CdOjpBO4YItIRIcUNKeiOMfxZML8lr4aqwHeP_zX5dff55-3X5v77l2-3N_eN61hXGu4OUsheqw67vlfe1Os0Z8xxACHAH4TXG6q1aZkRB-i7OgQ-gNRukL24Jh8ve89p3nyLnUJ2OI4QcV6ybXVntBFGtc9Tla4ZyhpypaoL1aU554SDPacwQXqwnNmtJHuyTyXZrSR7Kakq3z-aLP2E_kn3v5VKuLkQsKayBkzbItziDAldsX4Oz5r8AzKCqkE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2780076187</pqid></control><display><type>article</type><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</creator><creatorcontrib>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</creatorcontrib><description>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution.
[Display omitted]
•A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</description><identifier>ISSN: 0048-9697</identifier><identifier>EISSN: 1879-1026</identifier><identifier>DOI: 10.1016/j.scitotenv.2023.162371</identifier><identifier>PMID: 36828066</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>atmospheric deposition ; decision support systems ; environment ; environmental protection ; Heavy metals ; irrigation ; Machine learning ; Positive matrix factorization ; risk ; sewage ; soil ; Soil pollution ; Source apportionment</subject><ispartof>The Science of the total environment, 2023-05, Vol.873, p.162371-162371, Article 162371</ispartof><rights>2023 Elsevier B.V.</rights><rights>Copyright © 2023 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</citedby><cites>FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0048969723009877$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36828066$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zheng, Jiatong</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Shi, Hangyuan</creatorcontrib><creatorcontrib>Zhuang, Changwei</creatorcontrib><creatorcontrib>Deng, Yirong</creatorcontrib><creatorcontrib>Yang, Xiaojun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Xiao, Rongbo</creatorcontrib><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><title>The Science of the total environment</title><addtitle>Sci Total Environ</addtitle><description>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution.
[Display omitted]
•A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</description><subject>atmospheric deposition</subject><subject>decision support systems</subject><subject>environment</subject><subject>environmental protection</subject><subject>Heavy metals</subject><subject>irrigation</subject><subject>Machine learning</subject><subject>Positive matrix factorization</subject><subject>risk</subject><subject>sewage</subject><subject>soil</subject><subject>Soil pollution</subject><subject>Source apportionment</subject><issn>0048-9697</issn><issn>1879-1026</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNqFUU1v1DAUtBCILoW_AD5yyWLHWX8cq4oCUiWEBGfrxX5hvUqcxXYiVfx5HG3ptb5YejPzRm-GkA-c7Tnj8tNpn10oc8G47lvWij2XrVD8BdlxrUzDWStfkh1jnW6MNOqKvMn5xOpTmr8mV0LqVjMpd-TvjwViCQVKWJHmeUkOKZzPcyphjhPGQiF66lOFEw2-DsIQHGwonYeqCCM9IqwPdMICY6ZLDvE3Bb9CdOjpBO4YItIRIcUNKeiOMfxZML8lr4aqwHeP_zX5dff55-3X5v77l2-3N_eN61hXGu4OUsheqw67vlfe1Os0Z8xxACHAH4TXG6q1aZkRB-i7OgQ-gNRukL24Jh8ve89p3nyLnUJ2OI4QcV6ybXVntBFGtc9Tla4ZyhpypaoL1aU554SDPacwQXqwnNmtJHuyTyXZrSR7Kakq3z-aLP2E_kn3v5VKuLkQsKayBkzbItziDAldsX4Oz5r8AzKCqkE</recordid><startdate>20230515</startdate><enddate>20230515</enddate><creator>Zheng, Jiatong</creator><creator>Wang, Peng</creator><creator>Shi, Hangyuan</creator><creator>Zhuang, Changwei</creator><creator>Deng, Yirong</creator><creator>Yang, Xiaojun</creator><creator>Huang, Fei</creator><creator>Xiao, Rongbo</creator><general>Elsevier B.V</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope></search><sort><creationdate>20230515</creationdate><title>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</title><author>Zheng, Jiatong ; Wang, Peng ; Shi, Hangyuan ; Zhuang, Changwei ; Deng, Yirong ; Yang, Xiaojun ; Huang, Fei ; Xiao, Rongbo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c404t-1c5636b874e4bb7d91028100c1aa33ad53d8b87488920935ab4ad5a1fa68cf6b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>atmospheric deposition</topic><topic>decision support systems</topic><topic>environment</topic><topic>environmental protection</topic><topic>Heavy metals</topic><topic>irrigation</topic><topic>Machine learning</topic><topic>Positive matrix factorization</topic><topic>risk</topic><topic>sewage</topic><topic>soil</topic><topic>Soil pollution</topic><topic>Source apportionment</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zheng, Jiatong</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Shi, Hangyuan</creatorcontrib><creatorcontrib>Zhuang, Changwei</creatorcontrib><creatorcontrib>Deng, Yirong</creatorcontrib><creatorcontrib>Yang, Xiaojun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Xiao, Rongbo</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>The Science of the total environment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zheng, Jiatong</au><au>Wang, Peng</au><au>Shi, Hangyuan</au><au>Zhuang, Changwei</au><au>Deng, Yirong</au><au>Yang, Xiaojun</au><au>Huang, Fei</au><au>Xiao, Rongbo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques</atitle><jtitle>The Science of the total environment</jtitle><addtitle>Sci Total Environ</addtitle><date>2023-05-15</date><risdate>2023</risdate><volume>873</volume><spage>162371</spage><epage>162371</epage><pages>162371-162371</pages><artnum>162371</artnum><issn>0048-9697</issn><eissn>1879-1026</eissn><abstract>The accurate identification of pollution sources is essential for the prevention and control of possible pollution from soil heavy metals (SHMs). However, the positive matrix factorisation (PMF) model has been widely used as a conventional method for pollution source apportionment, and the classification of source apportionment results mainly relies on existing research and expert experience, which can result in high subjectivity in the source interpretation. To address this limitation, a comprehensive source apportionment framework was developed based on advanced machine learning techniques that combine self-organizing mapping and PMF with a gradient boosting decision tree (GBDT) model. Analysis of Cd, Pb, Zn, Cu, Cr, and Ni in 272 topsoils showed that the average contents of six heavy metals were 1.72–13.79 times greater than corresponding background values, among which Cd pollution was relatively serious, with 66.91 % of the sites having higher values than the specified soil risk screening values. The PMF results revealed that 79.43 % of Pb was related to vehicle emissions and atmospheric deposition, 79.32 % of Cd and 38.84 % of Zn were related to sewage irrigation, and 85.97 % of Cr and 85.50 % of Ni were from natural sources. Moreover, the GBDT detected that industrial network density, water network density, and Fe2O3 content were the major drivers influencing each pollution source. Overall, the novelty of this study lies in the development of an improved framework based on advanced machine learning techniques that led to the accurate identification of the sources of SHM pollution, which can provide more detailed support for environmental protection departments to propose targeted control measures for soil pollution.
[Display omitted]
•A comprehensive method based on advanced machine learning techniques was proposed.•12 explanatory variables were incorporated into the source apportionment process.•Gradient boosting decision tree reliably identified nonlinear effect of driving variable on the pollution sources.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>36828066</pmid><doi>10.1016/j.scitotenv.2023.162371</doi><tpages>1</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0048-9697 |
ispartof | The Science of the total environment, 2023-05, Vol.873, p.162371-162371, Article 162371 |
issn | 0048-9697 1879-1026 |
language | eng |
recordid | cdi_proquest_miscellaneous_2849893972 |
source | Elsevier ScienceDirect Journals Complete |
subjects | atmospheric deposition decision support systems environment environmental protection Heavy metals irrigation Machine learning Positive matrix factorization risk sewage soil Soil pollution Source apportionment |
title | Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T15%3A26%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Quantitative%20source%20apportionment%20and%20driver%20identification%20of%20soil%20heavy%20metals%20using%20advanced%20machine%20learning%20techniques&rft.jtitle=The%20Science%20of%20the%20total%20environment&rft.au=Zheng,%20Jiatong&rft.date=2023-05-15&rft.volume=873&rft.spage=162371&rft.epage=162371&rft.pages=162371-162371&rft.artnum=162371&rft.issn=0048-9697&rft.eissn=1879-1026&rft_id=info:doi/10.1016/j.scitotenv.2023.162371&rft_dat=%3Cproquest_cross%3E2849893972%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2780076187&rft_id=info:pmid/36828066&rft_els_id=S0048969723009877&rfr_iscdi=true |