Optimized Predictive Coverage by Averaging Time‐Windowed Bayesian Distributions

Hydrogeological models require reliable uncertainty intervals that honestly reflect the total uncertainties of model predictions. The operation of a conventional Bayesian framework only produces realistic (interpretable in the context of the natural system) inference results if the model structure m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Water resources research 2024-05, Vol.60 (5), p.n/a
Hauptverfasser:	Hsueh, Han‐Fang, Guthke, Anneli, Wöhling, Thomas, Nowak, Wolfgang
Format:	Artikel
Sprache:	eng
Schlagworte:	Bayesian analysis Bayesian method Bayesian theory Calibration Components Conditioning data collection Datasets data‐driven Geology Hydrogeological models Hydrogeology Intervals Mathematical models Moisture content Parameters PDF averaging prediction Predictions predictive uncertainty Probability theory Soil moisture Soil moisture models soil water Statistical analysis Statistical inference System dynamics time series time series analysis time‐windowed Bayesian Uncertainty Windows (intervals)
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Hydrogeological models require reliable uncertainty intervals that honestly reflect the total uncertainties of model predictions. The operation of a conventional Bayesian framework only produces realistic (interpretable in the context of the natural system) inference results if the model structure matches the data‐generating process, that is, applying Bayes' theorem implicitly assumes the underlying model to be true. With an imperfect model, we may obtain a too‐narrow‐for‐its‐bias uncertainty interval when conditioning on a long time‐series of calibration data, because the assumption of a quasi‐true model becomes too strict. To overcome the problem of overconfident posteriors, we propose a non‐parametric Bayesian method, called Tau‐averaging method: it applies Bayesian analysis on sliding time windows along the data time series for calibration. Thus, it obtains so‐called transitional posteriors per time window. Then, we average these into a wider predictive posterior. With the proposed routine, we explicitly capture the time‐varying impact of model error on prediction uncertainty. The length of the calibration window is optimized to maximize goal‐oriented statistical skill scores for predictive coverage. Our method loosens the perfect‐model‐assumption by conditioning only on small windows of the data set at a time, that is, it assumes that “the model is sufficient to follow the system dynamics for a smaller duration.” We test our method on two cases of soil moisture modeling and show how it improves predictive coverage as compared to the conventional Bayesian approach. Our findings demonstrate that the proposed method convincingly overcomes the overconfidence drawback of Bayesian inference under model misspecification and long calibration time‐series. Plain Language Summary Mathematical models mimic environmental systems to match what we see, and to predict what will happen. Unfortunately, such models are always simplifications of reality, balancing their complexity between manageability and accuracy. Consequently, interpreting model‐based conclusions requires caution. Assume a model has ten adjustable parameters to make it match with a system. The best‐possible achievable fit to observations is imperfect. Yet, statistical tools indicate we knew these parameters perfectly well after adjustment, especially when adjusting on long data series. Then, we might start believing that this model's adjusted predictions are perfect. We call this “overconfidence.” Way
ISSN:	0043-1397 1944-7973
DOI:	10.1029/2022WR033280