Large-scale prediction of stream water quality using an interpretable deep learning approach

Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of environmental management 2023-04, Vol.331, p.117309-117309, Article 117309
Hauptverfasser: Zheng, Hang, Liu, Yueyi, Wan, Wenhua, Zhao, Jianshi, Xie, Guanti
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 117309
container_issue
container_start_page 117309
container_title Journal of environmental management
container_volume 331
creator Zheng, Hang
Liu, Yueyi
Wan, Wenhua
Zhao, Jianshi
Xie, Guanti
description Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively. [Display omitted] •An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.
doi_str_mv 10.1016/j.jenvman.2023.117309
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3153640938</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030147972300097X</els_id><sourcerecordid>3153640938</sourcerecordid><originalsourceid>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</originalsourceid><addsrcrecordid>eNqFkc1v1DAQxS0EokvhTwD5yCWLvx2fEKr4qLQSF7ghWRNnUrxKnNROWvW_x2UXrj2NNPN786T3CHnL2Z4zbj4c90dMdxOkvWBC7jm3krlnZMeZ001rJHtOdkwy3ijr7AV5VcqRMSYFty_JhTRGW8HUjvw6QL7BpgQYkS4Z-xjWOCc6D7SsGWGi97BiprcbjHF9oFuJ6YZCojHVdRWs0FVlj7jQESGnv-dlyTOE36_JiwHGgm_O85L8_PL5x9W35vD96_XVp0MTpGvXBhx0qDtAMCCC0iZwYDoMQpm27RXTLXImOqeEUWhZpwdhUBsUwnZGOZSX5P3pb7W93bCsfool4DhCwnkrXnItjWJOtk-iwpq2_nXWVVSf0JDnUjIOfslxgvzgOfOPHfijP3fgHzvwpw6q7t3ZYusm7P-r_oVegY8nAGsmdxGzLyFiCjX8jGH1_RyfsPgDbZua-g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2768227979</pqid></control><display><type>article</type><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</creator><creatorcontrib>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</creatorcontrib><description>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively. [Display omitted] •An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</description><identifier>ISSN: 0301-4797</identifier><identifier>EISSN: 1095-8630</identifier><identifier>DOI: 10.1016/j.jenvman.2023.117309</identifier><identifier>PMID: 36657204</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>air temperature ; ammonium nitrogen ; chemical oxygen demand ; China ; Deep Learning ; dissolved oxygen ; environmental management ; Environmental Monitoring - methods ; forests ; hydrology ; Interpretable ; land use ; Large scale ; mechanics ; model validation ; Nitrogen - analysis ; Phosphorus - analysis ; population density ; Prediction ; rain ; Rivers ; streams ; subwatersheds ; total nitrogen ; total phosphorus ; turbidity ; urban areas ; Water Pollutants, Chemical - analysis ; Water Quality</subject><ispartof>Journal of environmental management, 2023-04, Vol.331, p.117309-117309, Article 117309</ispartof><rights>2023 Elsevier Ltd</rights><rights>Copyright © 2023 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</citedby><cites>FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</cites><orcidid>0000-0002-1356-0467 ; 0000-0001-9789-8555</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jenvman.2023.117309$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27911,27912,45982</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36657204$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zheng, Hang</creatorcontrib><creatorcontrib>Liu, Yueyi</creatorcontrib><creatorcontrib>Wan, Wenhua</creatorcontrib><creatorcontrib>Zhao, Jianshi</creatorcontrib><creatorcontrib>Xie, Guanti</creatorcontrib><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><title>Journal of environmental management</title><addtitle>J Environ Manage</addtitle><description>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively. [Display omitted] •An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</description><subject>air temperature</subject><subject>ammonium nitrogen</subject><subject>chemical oxygen demand</subject><subject>China</subject><subject>Deep Learning</subject><subject>dissolved oxygen</subject><subject>environmental management</subject><subject>Environmental Monitoring - methods</subject><subject>forests</subject><subject>hydrology</subject><subject>Interpretable</subject><subject>land use</subject><subject>Large scale</subject><subject>mechanics</subject><subject>model validation</subject><subject>Nitrogen - analysis</subject><subject>Phosphorus - analysis</subject><subject>population density</subject><subject>Prediction</subject><subject>rain</subject><subject>Rivers</subject><subject>streams</subject><subject>subwatersheds</subject><subject>total nitrogen</subject><subject>total phosphorus</subject><subject>turbidity</subject><subject>urban areas</subject><subject>Water Pollutants, Chemical - analysis</subject><subject>Water Quality</subject><issn>0301-4797</issn><issn>1095-8630</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkc1v1DAQxS0EokvhTwD5yCWLvx2fEKr4qLQSF7ghWRNnUrxKnNROWvW_x2UXrj2NNPN786T3CHnL2Z4zbj4c90dMdxOkvWBC7jm3krlnZMeZ001rJHtOdkwy3ijr7AV5VcqRMSYFty_JhTRGW8HUjvw6QL7BpgQYkS4Z-xjWOCc6D7SsGWGi97BiprcbjHF9oFuJ6YZCojHVdRWs0FVlj7jQESGnv-dlyTOE36_JiwHGgm_O85L8_PL5x9W35vD96_XVp0MTpGvXBhx0qDtAMCCC0iZwYDoMQpm27RXTLXImOqeEUWhZpwdhUBsUwnZGOZSX5P3pb7W93bCsfool4DhCwnkrXnItjWJOtk-iwpq2_nXWVVSf0JDnUjIOfslxgvzgOfOPHfijP3fgHzvwpw6q7t3ZYusm7P-r_oVegY8nAGsmdxGzLyFiCjX8jGH1_RyfsPgDbZua-g</recordid><startdate>20230401</startdate><enddate>20230401</enddate><creator>Zheng, Hang</creator><creator>Liu, Yueyi</creator><creator>Wan, Wenhua</creator><creator>Zhao, Jianshi</creator><creator>Xie, Guanti</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope><orcidid>https://orcid.org/0000-0002-1356-0467</orcidid><orcidid>https://orcid.org/0000-0001-9789-8555</orcidid></search><sort><creationdate>20230401</creationdate><title>Large-scale prediction of stream water quality using an interpretable deep learning approach</title><author>Zheng, Hang ; Liu, Yueyi ; Wan, Wenhua ; Zhao, Jianshi ; Xie, Guanti</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c398t-a9abe5baea6a2c456c1a05cf24688d4058e102b94264e70b5f26e56e227b649e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>air temperature</topic><topic>ammonium nitrogen</topic><topic>chemical oxygen demand</topic><topic>China</topic><topic>Deep Learning</topic><topic>dissolved oxygen</topic><topic>environmental management</topic><topic>Environmental Monitoring - methods</topic><topic>forests</topic><topic>hydrology</topic><topic>Interpretable</topic><topic>land use</topic><topic>Large scale</topic><topic>mechanics</topic><topic>model validation</topic><topic>Nitrogen - analysis</topic><topic>Phosphorus - analysis</topic><topic>population density</topic><topic>Prediction</topic><topic>rain</topic><topic>Rivers</topic><topic>streams</topic><topic>subwatersheds</topic><topic>total nitrogen</topic><topic>total phosphorus</topic><topic>turbidity</topic><topic>urban areas</topic><topic>Water Pollutants, Chemical - analysis</topic><topic>Water Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zheng, Hang</creatorcontrib><creatorcontrib>Liu, Yueyi</creatorcontrib><creatorcontrib>Wan, Wenhua</creatorcontrib><creatorcontrib>Zhao, Jianshi</creatorcontrib><creatorcontrib>Xie, Guanti</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>Journal of environmental management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zheng, Hang</au><au>Liu, Yueyi</au><au>Wan, Wenhua</au><au>Zhao, Jianshi</au><au>Xie, Guanti</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-scale prediction of stream water quality using an interpretable deep learning approach</atitle><jtitle>Journal of environmental management</jtitle><addtitle>J Environ Manage</addtitle><date>2023-04-01</date><risdate>2023</risdate><volume>331</volume><spage>117309</spage><epage>117309</epage><pages>117309-117309</pages><artnum>117309</artnum><issn>0301-4797</issn><eissn>1095-8630</eissn><abstract>Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3–N), TN, TP, and turbidity in the stream water in the case area, respectively. [Display omitted] •An innovative interpretable deep learning method on water quality predictions.•The SHapley Additive exPlanations method could interpret the prediction results.•Economic categorical data could explain water quality variations at a large scale.•The prediction accuracy could be improved by involving land-use predictors.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>36657204</pmid><doi>10.1016/j.jenvman.2023.117309</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-1356-0467</orcidid><orcidid>https://orcid.org/0000-0001-9789-8555</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0301-4797
ispartof Journal of environmental management, 2023-04, Vol.331, p.117309-117309, Article 117309
issn 0301-4797
1095-8630
language eng
recordid cdi_proquest_miscellaneous_3153640938
source MEDLINE; ScienceDirect Journals (5 years ago - present)
subjects air temperature
ammonium nitrogen
chemical oxygen demand
China
Deep Learning
dissolved oxygen
environmental management
Environmental Monitoring - methods
forests
hydrology
Interpretable
land use
Large scale
mechanics
model validation
Nitrogen - analysis
Phosphorus - analysis
population density
Prediction
rain
Rivers
streams
subwatersheds
total nitrogen
total phosphorus
turbidity
urban areas
Water Pollutants, Chemical - analysis
Water Quality
title Large-scale prediction of stream water quality using an interpretable deep learning approach
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T11%3A55%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-scale%20prediction%20of%20stream%20water%20quality%20using%20an%20interpretable%20deep%20learning%20approach&rft.jtitle=Journal%20of%20environmental%20management&rft.au=Zheng,%20Hang&rft.date=2023-04-01&rft.volume=331&rft.spage=117309&rft.epage=117309&rft.pages=117309-117309&rft.artnum=117309&rft.issn=0301-4797&rft.eissn=1095-8630&rft_id=info:doi/10.1016/j.jenvman.2023.117309&rft_dat=%3Cproquest_cross%3E3153640938%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2768227979&rft_id=info:pmid/36657204&rft_els_id=S030147972300097X&rfr_iscdi=true