Toward an improved ensemble of multi-source daily precipitation via joint machine learning classification and regression

Accurate estimation of precipitation at local to global scales can considerably enhance our understanding of climate system dynamics. While numerous precipitation products are available as indispensable tools for investigating precipitation and its associated processes, none can consistently provide...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Atmospheric research 2024-07, Vol.304, p.107385, Article 107385
Hauptverfasser: Chen, Hao, Wang, Tiejun, Montzka, Carsten, Gao, Huiran, Guo, Ning, Chen, Xi, Vereecken, Harry
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Accurate estimation of precipitation at local to global scales can considerably enhance our understanding of climate system dynamics. While numerous precipitation products are available as indispensable tools for investigating precipitation and its associated processes, none can consistently provide the lowest estimation error across environmental conditions. The multiple source precipitation ensemble (MSPE) methods have been considered a vital solution. A new MSPE framework is proposed here, which simultaneously uses machine learning (ML) classification and regression techniques within an automatic workflow (MSPEaml). Six precipitation products and their ensembles based on different MSPE strategies were evaluated at 2365 gauged and 800 randomly selected ungauged sites over China. Results revealed significant precision inconsistencies among the products primarily due to their different data sources and retrieval algorithms; while MSPEaml can effectively reduce the random and classification errors of estimated precipitation according to the Kling-Gupta efficiency and Heidke skill score. The improvements demonstrated the unique features of MSPEaml, particularly the necessity of the joint use of ML classifiers and regressors and assigning spatiotemporal dynamic weights for merging precipitation data. Moreover, MSPEaml can substantially improve its generalizability through a simple binning procedure, making it applicable under more complex conditions. The varying contributions of predictor variables (indicated by Shapely values) in different ML models identified the complexity of the MSPE issue and further the importance of designing proper ML models according to specific targets. The proposed MSPE framework is expected to be a suitable solution for assembling multiple precipitation data sources with different time periods and scales. •A machine learning-based framework for merging multi-source daily precipitation datasets is proposed.•It is effective in producing high-precision precipitation estimates in both gauged and ungauged regions.•The new method provides enhanced generalizability through a simple binning procedure.•It is necessary to combine machine learning classification and regression for an ensemble.•Spatiotemporal dynamically weighted ensembles based on machine learning classification have great potential.
ISSN:0169-8095
DOI:10.1016/j.atmosres.2024.107385