Influential point detection diagnostics in the context of hydrological model calibration

•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more gen...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of hydrology (Amsterdam) 2015-08, Vol.527, p.1161-1172
Hauptverfasser: Wright, David P., Thyer, Mark, Westra, Seth
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1172
container_issue
container_start_page 1161
container_title Journal of hydrology (Amsterdam)
container_volume 527
creator Wright, David P.
Thyer, Mark
Westra, Seth
description •Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general. Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hy
doi_str_mv 10.1016/j.jhydrol.2015.05.047
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1778013744</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022169415003972</els_id><sourcerecordid>1778013744</sourcerecordid><originalsourceid>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</originalsourceid><addsrcrecordid>eNqNUE1LJDEQDbKCs-pPEHLcS4-pTneSPomIXyDsZQVvIZ1Ua4aeZEwyov_ezI53LR5UQb33inqEnAFbAgNxvlquXj5civOyZdAvWUUnD8gClByaVjL5iywYa9sGxNAdkd85r1gtzrsFeboP07zFULyZ6Sb6UKjDgrb4GKjz5jnEXLzN1AdaXpDaGAq-Fxonur8Zn72t0nV0ONM6-TGZnfiEHE5mznj61Y_J4831v6u75uHv7f3V5UNj-KBK07dqGiZrgQvE1nGYHBgDIxc9l2awdTWqEUAKMUrHeiVdL4xFCaDc1Ap-TP7sfTcpvm4xF7322eI8m4BxmzVIqRhw2XU_oHZCKdH_d-33VJtizgknvUl-bdKHBqZ3oeuV_gpd70LXrKKTVXex12F9-c1j0tl6DBadTzVU7aL_xuETleKPYg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1746886526</pqid></control><display><type>article</type><title>Influential point detection diagnostics in the context of hydrological model calibration</title><source>Elsevier ScienceDirect Journals</source><creator>Wright, David P. ; Thyer, Mark ; Westra, Seth</creator><creatorcontrib>Wright, David P. ; Thyer, Mark ; Westra, Seth</creatorcontrib><description>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general. Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</description><identifier>ISSN: 0022-1694</identifier><identifier>EISSN: 1879-2707</identifier><identifier>DOI: 10.1016/j.jhydrol.2015.05.047</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Calibration ; Case-deletion ; Catchments ; Cook’s distance ; Diagnostic systems ; Heating ; Hydrologic calibration ; Hydrology models ; Influence diagnostics ; Mahalanobis distance ; Mathematical analysis ; Mathematical models ; Nonlinearity</subject><ispartof>Journal of hydrology (Amsterdam), 2015-08, Vol.527, p.1161-1172</ispartof><rights>2015 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</citedby><cites>FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jhydrol.2015.05.047$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27907,27908,45978</link.rule.ids></links><search><creatorcontrib>Wright, David P.</creatorcontrib><creatorcontrib>Thyer, Mark</creatorcontrib><creatorcontrib>Westra, Seth</creatorcontrib><title>Influential point detection diagnostics in the context of hydrological model calibration</title><title>Journal of hydrology (Amsterdam)</title><description>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general. Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</description><subject>Calibration</subject><subject>Case-deletion</subject><subject>Catchments</subject><subject>Cook’s distance</subject><subject>Diagnostic systems</subject><subject>Heating</subject><subject>Hydrologic calibration</subject><subject>Hydrology models</subject><subject>Influence diagnostics</subject><subject>Mahalanobis distance</subject><subject>Mathematical analysis</subject><subject>Mathematical models</subject><subject>Nonlinearity</subject><issn>0022-1694</issn><issn>1879-2707</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqNUE1LJDEQDbKCs-pPEHLcS4-pTneSPomIXyDsZQVvIZ1Ua4aeZEwyov_ezI53LR5UQb33inqEnAFbAgNxvlquXj5civOyZdAvWUUnD8gClByaVjL5iywYa9sGxNAdkd85r1gtzrsFeboP07zFULyZ6Sb6UKjDgrb4GKjz5jnEXLzN1AdaXpDaGAq-Fxonur8Zn72t0nV0ONM6-TGZnfiEHE5mznj61Y_J4831v6u75uHv7f3V5UNj-KBK07dqGiZrgQvE1nGYHBgDIxc9l2awdTWqEUAKMUrHeiVdL4xFCaDc1Ap-TP7sfTcpvm4xF7322eI8m4BxmzVIqRhw2XU_oHZCKdH_d-33VJtizgknvUl-bdKHBqZ3oeuV_gpd70LXrKKTVXex12F9-c1j0tl6DBadTzVU7aL_xuETleKPYg</recordid><startdate>20150801</startdate><enddate>20150801</enddate><creator>Wright, David P.</creator><creator>Thyer, Mark</creator><creator>Westra, Seth</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7ST</scope><scope>7TG</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>KL.</scope><scope>L.G</scope><scope>SOI</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope></search><sort><creationdate>20150801</creationdate><title>Influential point detection diagnostics in the context of hydrological model calibration</title><author>Wright, David P. ; Thyer, Mark ; Westra, Seth</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a398t-528f9fcc136ee2d31fd1aa1b36537a9cfccb8b11766b7d0587d56ace7118df263</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Calibration</topic><topic>Case-deletion</topic><topic>Catchments</topic><topic>Cook’s distance</topic><topic>Diagnostic systems</topic><topic>Heating</topic><topic>Hydrologic calibration</topic><topic>Hydrology models</topic><topic>Influence diagnostics</topic><topic>Mahalanobis distance</topic><topic>Mathematical analysis</topic><topic>Mathematical models</topic><topic>Nonlinearity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wright, David P.</creatorcontrib><creatorcontrib>Thyer, Mark</creatorcontrib><creatorcontrib>Westra, Seth</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Environment Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Journal of hydrology (Amsterdam)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wright, David P.</au><au>Thyer, Mark</au><au>Westra, Seth</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Influential point detection diagnostics in the context of hydrological model calibration</atitle><jtitle>Journal of hydrology (Amsterdam)</jtitle><date>2015-08-01</date><risdate>2015</risdate><volume>527</volume><spage>1161</spage><epage>1172</epage><pages>1161-1172</pages><issn>0022-1694</issn><eissn>1879-2707</eissn><abstract>•Evaluating influence of individual data points is rare in hydrological modelling.•Highly influential points can change mean/max predictions by 13/25%.•Comparing numerical/analytical approaches identified similar influential points.•Analytical approaches are faster, numerical approaches are more general. Influential data are those that have a disproportionate impact on model performance, parameters and/or predictions. This paper evaluates two classes of diagnostics that identify influential data for hydrological model calibration: (1) numerical “case-deletion” diagnostics, which directly measure the influence of each data point on the calibrated model; and (2) analytical diagnostics based on Cook’s distance, which combine information on the model residuals with a measure of the distance of each input point from the centre of the range of the input data (i.e., the leverage). Case-deletion methods rank influence by changes in the model parameters (measured through the Mahalanobis distance), performance (using objective function displacement) and predictions (e.g. mean and maximum streamflow). For the analytical methods, both linear and nonlinear estimates of leverage are used to calculate Cook’s distance, which is used to rank influential data. We apply these diagnostics to three case studies and show that a single point could change mean/maximum streamflow predictions by 7%/9% for a rating curve model, and 13%/25%, for a hydrological model (GR4J) in an ephemeral catchment. In contrast, the influence was far less for GR4J in a humid catchment (0.2%/2.3%). Assuming the data are of high quality this indicates deficiencies in the ability of the GR4J model structure to reproduce the flow regime in the ephemeral catchment. The linear Cook’s distance-based metric produced reasonably similar rankings to the case-deletion metrics at a fraction of the computational cost (300–1000 times faster), but with less flexibility to rank influence using specific aspects of model behaviour. The nonlinear distance produced rankings that were virtually the same as the case-deletion metrics for all case studies – this highlights the importance of its use for nonlinear hydrological models. Visual assessment was not a reliable method of influence analysis as there was no direct relationship between the most influential data and the highest observed streamflows. The findings establish the feasibility and importance of including influence detection diagnostics as a standard tool in hydrological model calibration.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.jhydrol.2015.05.047</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0022-1694
ispartof Journal of hydrology (Amsterdam), 2015-08, Vol.527, p.1161-1172
issn 0022-1694
1879-2707
language eng
recordid cdi_proquest_miscellaneous_1778013744
source Elsevier ScienceDirect Journals
subjects Calibration
Case-deletion
Catchments
Cook’s distance
Diagnostic systems
Heating
Hydrologic calibration
Hydrology models
Influence diagnostics
Mahalanobis distance
Mathematical analysis
Mathematical models
Nonlinearity
title Influential point detection diagnostics in the context of hydrological model calibration
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T07%3A11%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Influential%20point%20detection%20diagnostics%20in%20the%20context%20of%20hydrological%20model%20calibration&rft.jtitle=Journal%20of%20hydrology%20(Amsterdam)&rft.au=Wright,%20David%20P.&rft.date=2015-08-01&rft.volume=527&rft.spage=1161&rft.epage=1172&rft.pages=1161-1172&rft.issn=0022-1694&rft.eissn=1879-2707&rft_id=info:doi/10.1016/j.jhydrol.2015.05.047&rft_dat=%3Cproquest_cross%3E1778013744%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1746886526&rft_id=info:pmid/&rft_els_id=S0022169415003972&rfr_iscdi=true