Influential point detection diagnostics in the context of hydrological model calibration
•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more gen...
Gespeichert in:
Veröffentlicht in: | Journal of hydrology (Amsterdam) 2015-08, Vol.527, p.1161-1172 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1172 |
---|---|
container_issue | |
container_start_page | 1161 |
container_title | Journal of hydrology (Amsterdam) |
container_volume | 527 |
creator | Wright, David P. Thyer, Mark Westra, Seth |
description | •Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general.
Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hy |
doi_str_mv | 10.1016/j.jhydrol.2015.05.047 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1778013744</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022169415003972</els_id><sourcerecordid>1778013744</sourcerecordid><originalsourceid>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</originalsourceid><addsrcrecordid>eNqNUE1LJDEQDbKCs-pPEHLcS4-pTneSPomIXyDsZQVvIZ1Ua4aeZEwyov_ezI53LR5UQb33inqEnAFbAgNxvlquXj5civOyZdAvWUUnD8gClByaVjL5iywYa9sGxNAdkd85r1gtzrsFeboP07zFULyZ6Sb6UKjDgrb4GKjz5jnEXLzN1AdaXpDaGAq-Fxonur8Zn72t0nV0ONM6-TGZnfiEHE5mznj61Y_J4831v6u75uHv7f3V5UNj-KBK07dqGiZrgQvE1nGYHBgDIxc9l2awdTWqEUAKMUrHeiVdL4xFCaDc1Ap-TP7sfTcpvm4xF7322eI8m4BxmzVIqRhw2XU_oHZCKdH_d-33VJtizgknvUl-bdKHBqZ3oeuV_gpd70LXrKKTVXex12F9-c1j0tl6DBadTzVU7aL_xuETleKPYg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1746886526</pqid></control><display><type>article</type><title>Influential point detection diagnostics in the context of hydrological model calibration</title><source>Elsevier ScienceDirect Journals</source><creator>Wright, David P. ; Thyer, Mark ; Westra, Seth</creator><creatorcontrib>Wright, David P. ; Thyer, Mark ; Westra, Seth</creatorcontrib><description>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general.
Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</description><identifier>ISSN: 0022-1694</identifier><identifier>EISSN: 1879-2707</identifier><identifier>DOI: 10.1016/j.jhydrol.2015.05.047</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Calibration ; Case-deletion ; Catchments ; Cook’s distance ; Diagnostic systems ; Heating ; Hydrologic calibration ; Hydrology models ; Influence diagnostics ; Mahalanobis distance ; Mathematical analysis ; Mathematical models ; Nonlinearity</subject><ispartof>Journal of hydrology (Amsterdam), 2015-08, Vol.527, p.1161-1172</ispartof><rights>2015 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</citedby><cites>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jhydrol.2015.05.047$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27907,27908,45978</link.rule.ids></links><search><creatorcontrib>Wright, David P.</creatorcontrib><creatorcontrib>Thyer, Mark</creatorcontrib><creatorcontrib>Westra, Seth</creatorcontrib><title>Influential point detection diagnostics in the context of hydrological model calibration</title><title>Journal of hydrology (Amsterdam)</title><description>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general.
Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</description><subject>Calibration</subject><subject>Case-deletion</subject><subject>Catchments</subject><subject>Cook’s distance</subject><subject>Diagnostic systems</subject><subject>Heating</subject><subject>Hydrologic calibration</subject><subject>Hydrology models</subject><subject>Influence diagnostics</subject><subject>Mahalanobis distance</subject><subject>Mathematical analysis</subject><subject>Mathematical models</subject><subject>Nonlinearity</subject><issn>0022-1694</issn><issn>1879-2707</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqNUE1LJDEQDbKCs-pPEHLcS4-pTneSPomIXyDsZQVvIZ1Ua4aeZEwyov_ezI53LR5UQb33inqEnAFbAgNxvlquXj5civOyZdAvWUUnD8gClByaVjL5iywYa9sGxNAdkd85r1gtzrsFeboP07zFULyZ6Sb6UKjDgrb4GKjz5jnEXLzN1AdaXpDaGAq-Fxonur8Zn72t0nV0ONM6-TGZnfiEHE5mznj61Y_J4831v6u75uHv7f3V5UNj-KBK07dqGiZrgQvE1nGYHBgDIxc9l2awdTWqEUAKMUrHeiVdL4xFCaDc1Ap-TP7sfTcpvm4xF7322eI8m4BxmzVIqRhw2XU_oHZCKdH_d-33VJtizgknvUl-bdKHBqZ3oeuV_gpd70LXrKKTVXex12F9-c1j0tl6DBadTzVU7aL_xuETleKPYg</recordid><startdate>20150801</startdate><enddate>20150801</enddate><creator>Wright, David P.</creator><creator>Thyer, Mark</creator><creator>Westra, Seth</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7ST</scope><scope>7TG</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>KL.</scope><scope>L.G</scope><scope>SOI</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope></search><sort><creationdate>20150801</creationdate><title>Influential point detection diagnostics in the context of hydrological model calibration</title><author>Wright, David P. ; Thyer, Mark ; Westra, Seth</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Calibration</topic><topic>Case-deletion</topic><topic>Catchments</topic><topic>Cook’s distance</topic><topic>Diagnostic systems</topic><topic>Heating</topic><topic>Hydrologic calibration</topic><topic>Hydrology models</topic><topic>Influence diagnostics</topic><topic>Mahalanobis distance</topic><topic>Mathematical analysis</topic><topic>Mathematical models</topic><topic>Nonlinearity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wright, David P.</creatorcontrib><creatorcontrib>Thyer, Mark</creatorcontrib><creatorcontrib>Westra, Seth</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Environment Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Journal of hydrology (Amsterdam)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wright, David P.</au><au>Thyer, Mark</au><au>Westra, Seth</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Influential point detection diagnostics in the context of hydrological model calibration</atitle><jtitle>Journal of hydrology (Amsterdam)</jtitle><date>2015-08-01</date><risdate>2015</risdate><volume>527</volume><spage>1161</spage><epage>1172</epage><pages>1161-1172</pages><issn>0022-1694</issn><eissn>1879-2707</eissn><abstract>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general.
Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.jhydrol.2015.05.047</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-1694 |
ispartof | Journal of hydrology (Amsterdam), 2015-08, Vol.527, p.1161-1172 |
issn | 0022-1694 1879-2707 |
language | eng |
recordid | cdi_proquest_miscellaneous_1778013744 |
source | Elsevier ScienceDirect Journals |
subjects | Calibration Case-deletion Catchments Cook’s distance Diagnostic systems Heating Hydrologic calibration Hydrology models Influence diagnostics Mahalanobis distance Mathematical analysis Mathematical models Nonlinearity |
title | Influential point detection diagnostics in the context of hydrological model calibration |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T07%3A11%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Influential%20point%20detection%20diagnostics%20in%20the%20context%20of%20hydrological%20model%20calibration&rft.jtitle=Journal%20of%20hydrology%20(Amsterdam)&rft.au=Wright,%20David%20P.&rft.date=2015-08-01&rft.volume=527&rft.spage=1161&rft.epage=1172&rft.pages=1161-1172&rft.issn=0022-1694&rft.eissn=1879-2707&rft_id=info:doi/10.1016/j.jhydrol.2015.05.047&rft_dat=%3Cproquest_cross%3E1778013744%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1746886526&rft_id=info:pmid/&rft_els_id=S0022169415003972&rfr_iscdi=true |