Large-scale prediction of stream water quality using an interpretable deep learning approach
Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to th...
Gespeichert in:
Veröffentlicht in: | Journal of environmental management 2023-04, Vol.331, p.117309-117309, Article 117309 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 117309 |
---|---|
container_issue | |
container_start_page | 117309 |
container_title | Journal of environmental management |
container_volume | 331 |
creator | Zheng, Hang Liu, Yueyi Wan, Wenhua Zhao, Jianshi Xie, Guanti |
description | Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively.
[Display omitted]
•An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors. |
doi_str_mv | 10.1016/j.jenvman.2023.117309 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3153640938</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030147972300097X</els_id><sourcerecordid>3153640938</sourcerecordid><originalsourceid>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</originalsourceid><addsrcrecordid>eNqFkc1v1DAQxS0EokvhTwD5yCWLvx2fEKr4qLQSF7ghWRNnUrxKnNROWvW_x2UXrj2NNPN786T3CHnL2Z4zbj4c90dMdxOkvWBC7jm3krlnZMeZ001rJHtOdkwy3ijr7AV5VcqRMSYFty_JhTRGW8HUjvw6QL7BpgQYkS4Z-xjWOCc6D7SsGWGi97BiprcbjHF9oFuJ6YZCojHVdRWs0FVlj7jQESGnv-dlyTOE36_JiwHGgm_O85L8_PL5x9W35vD96_XVp0MTpGvXBhx0qDtAMCCC0iZwYDoMQpm27RXTLXImOqeEUWhZpwdhUBsUwnZGOZSX5P3pb7W93bCsfool4DhCwnkrXnItjWJOtk-iwpq2_nXWVVSf0JDnUjIOfslxgvzgOfOPHfijP3fgHzvwpw6q7t3ZYusm7P-r_oVegY8nAGsmdxGzLyFiCjX8jGH1_RyfsPgDbZua-g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2768227979</pqid></control><display><type>article</type><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</creator><creatorcontrib>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</creatorcontrib><description>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively.
[Display omitted]
•An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</description><identifier>ISSN: 0301-4797</identifier><identifier>EISSN: 1095-8630</identifier><identifier>DOI: 10.1016/j.jenvman.2023.117309</identifier><identifier>PMID: 36657204</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>air temperature ; ammonium nitrogen ; chemical oxygen demand ; China ; Deep Learning ; dissolved oxygen ; environmental management ; Environmental Monitoring - methods ; forests ; hydrology ; Interpretable ; land use ; Large scale ; mechanics ; model validation ; Nitrogen - analysis ; Phosphorus - analysis ; population density ; Prediction ; rain ; Rivers ; streams ; subwatersheds ; total nitrogen ; total phosphorus ; turbidity ; urban areas ; Water Pollutants, Chemical - analysis ; Water Quality</subject><ispartof>Journal of environmental management, 2023-04, Vol.331, p.117309-117309, Article 117309</ispartof><rights>2023 Elsevier Ltd</rights><rights>Copyright © 2023 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</citedby><cites>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</cites><orcidid>0000-0002-1356-0467 ; 0000-0001-9789-8555</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jenvman.2023.117309$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27911,27912,45982</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36657204$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zheng, Hang</creatorcontrib><creatorcontrib>Liu, Yueyi</creatorcontrib><creatorcontrib>Wan, Wenhua</creatorcontrib><creatorcontrib>Zhao, Jianshi</creatorcontrib><creatorcontrib>Xie, Guanti</creatorcontrib><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><title>Journal of environmental management</title><addtitle>J Environ Manage</addtitle><description>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively.
[Display omitted]
•An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</description><subject>air temperature</subject><subject>ammonium nitrogen</subject><subject>chemical oxygen demand</subject><subject>China</subject><subject>Deep Learning</subject><subject>dissolved oxygen</subject><subject>environmental management</subject><subject>Environmental Monitoring - methods</subject><subject>forests</subject><subject>hydrology</subject><subject>Interpretable</subject><subject>land use</subject><subject>Large scale</subject><subject>mechanics</subject><subject>model validation</subject><subject>Nitrogen - analysis</subject><subject>Phosphorus - analysis</subject><subject>population density</subject><subject>Prediction</subject><subject>rain</subject><subject>Rivers</subject><subject>streams</subject><subject>subwatersheds</subject><subject>total nitrogen</subject><subject>total phosphorus</subject><subject>turbidity</subject><subject>urban areas</subject><subject>Water Pollutants, Chemical - analysis</subject><subject>Water Quality</subject><issn>0301-4797</issn><issn>1095-8630</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkc1v1DAQxS0EokvhTwD5yCWLvx2fEKr4qLQSF7ghWRNnUrxKnNROWvW_x2UXrj2NNPN786T3CHnL2Z4zbj4c90dMdxOkvWBC7jm3krlnZMeZ001rJHtOdkwy3ijr7AV5VcqRMSYFty_JhTRGW8HUjvw6QL7BpgQYkS4Z-xjWOCc6D7SsGWGi97BiprcbjHF9oFuJ6YZCojHVdRWs0FVlj7jQESGnv-dlyTOE36_JiwHGgm_O85L8_PL5x9W35vD96_XVp0MTpGvXBhx0qDtAMCCC0iZwYDoMQpm27RXTLXImOqeEUWhZpwdhUBsUwnZGOZSX5P3pb7W93bCsfool4DhCwnkrXnItjWJOtk-iwpq2_nXWVVSf0JDnUjIOfslxgvzgOfOPHfijP3fgHzvwpw6q7t3ZYusm7P-r_oVegY8nAGsmdxGzLyFiCjX8jGH1_RyfsPgDbZua-g</recordid><startdate>20230401</startdate><enddate>20230401</enddate><creator>Zheng, Hang</creator><creator>Liu, Yueyi</creator><creator>Wan, Wenhua</creator><creator>Zhao, Jianshi</creator><creator>Xie, Guanti</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope><orcidid>https://orcid.org/0000-0002-1356-0467</orcidid><orcidid>https://orcid.org/0000-0001-9789-8555</orcidid></search><sort><creationdate>20230401</creationdate><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><author>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>air temperature</topic><topic>ammonium nitrogen</topic><topic>chemical oxygen demand</topic><topic>China</topic><topic>Deep Learning</topic><topic>dissolved oxygen</topic><topic>environmental management</topic><topic>Environmental Monitoring - methods</topic><topic>forests</topic><topic>hydrology</topic><topic>Interpretable</topic><topic>land use</topic><topic>Large scale</topic><topic>mechanics</topic><topic>model validation</topic><topic>Nitrogen - analysis</topic><topic>Phosphorus - analysis</topic><topic>population density</topic><topic>Prediction</topic><topic>rain</topic><topic>Rivers</topic><topic>streams</topic><topic>subwatersheds</topic><topic>total nitrogen</topic><topic>total phosphorus</topic><topic>turbidity</topic><topic>urban areas</topic><topic>Water Pollutants, Chemical - analysis</topic><topic>Water Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zheng, Hang</creatorcontrib><creatorcontrib>Liu, Yueyi</creatorcontrib><creatorcontrib>Wan, Wenhua</creatorcontrib><creatorcontrib>Zhao, Jianshi</creatorcontrib><creatorcontrib>Xie, Guanti</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>Journal of environmental management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zheng, Hang</au><au>Liu, Yueyi</au><au>Wan, Wenhua</au><au>Zhao, Jianshi</au><au>Xie, Guanti</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-scale prediction of stream water quality using an interpretable deep learning approach</atitle><jtitle>Journal of environmental management</jtitle><addtitle>J Environ Manage</addtitle><date>2023-04-01</date><risdate>2023</risdate><volume>331</volume><spage>117309</spage><epage>117309</epage><pages>117309-117309</pages><artnum>117309</artnum><issn>0301-4797</issn><eissn>1095-8630</eissn><abstract>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively.
[Display omitted]
•An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>36657204</pmid><doi>10.1016/j.jenvman.2023.117309</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-1356-0467</orcidid><orcidid>https://orcid.org/0000-0001-9789-8555</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0301-4797 |
ispartof | Journal of environmental management, 2023-04, Vol.331, p.117309-117309, Article 117309 |
issn | 0301-4797 1095-8630 |
language | eng |
recordid | cdi_proquest_miscellaneous_3153640938 |
source | MEDLINE; ScienceDirect Journals (5 years ago - present) |
subjects | air temperature ammonium nitrogen chemical oxygen demand China Deep Learning dissolved oxygen environmental management Environmental Monitoring - methods forests hydrology Interpretable land use Large scale mechanics model validation Nitrogen - analysis Phosphorus - analysis population density Prediction rain Rivers streams subwatersheds total nitrogen total phosphorus turbidity urban areas Water Pollutants, Chemical - analysis Water Quality |
title | Large-scale prediction of stream water quality using an interpretable deep learning approach |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T11%3A55%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-scale%20prediction%20of%20stream%20water%20quality%20using%20an%20interpretable%20deep%20learning%20approach&rft.jtitle=Journal%20of%20environmental%20management&rft.au=Zheng,%20Hang&rft.date=2023-04-01&rft.volume=331&rft.spage=117309&rft.epage=117309&rft.pages=117309-117309&rft.artnum=117309&rft.issn=0301-4797&rft.eissn=1095-8630&rft_id=info:doi/10.1016/j.jenvman.2023.117309&rft_dat=%3Cproquest_cross%3E3153640938%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2768227979&rft_id=info:pmid/36657204&rft_els_id=S030147972300097X&rfr_iscdi=true |