Effectiveness of Integrating Ensemble-Based Feature Selection and Novel Gradient Boosted Trees in Runoff Prediction: A Case Study in Vu Gia Thu Bon River Basin, Vietnam

Traditional rainfall-runoff modeling techniques require large datasets and often an exhaustive calibration process, which is challenging, especially in poorly-gauged basins and resource-limited settings. Therefore, it is necessary to examine new ways of constructing predictive models for runoff that...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pure and applied geophysics 2024-05, Vol.181 (5), p.1725-1744
Hauptverfasser: Aiyelokun, Oluwatobi, Pham, Quoc Bao, Aiyelokun, Oluwafunbi, Linh, Nguyen Thi Thuy, Roy, Tirthankar, Anh, Duong Tran, Łupikasza, Ewa
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Traditional rainfall-runoff modeling techniques require large datasets and often an exhaustive calibration process, which is challenging, especially in poorly-gauged basins and resource-limited settings. Therefore, it is necessary to examine new ways of constructing predictive models for runoff that can achieve satisfactory results, while also minimizing the data requirement and model construction time. In this study, the effectiveness of integrating the Random Forest (RF) as an important feature identifier with novel gradient boosted trees to achieve satisfactory results was examined for two adjacent catchments in Vietnam. Antecedent daily runoff in combination with daily and one-day antecedent rainfall was found to significantly influence the runoff at the outlet of the catchments. Categorical Boosting (CatBoost) and Extreme Gradient Boosting (XGBoost) were effective in predicting day-ahead runoff. For instance, CatBoost with NSE, d, r, and R 2 values of 0.92, 0.98, 0.96, and 0.92, respectively, and XGBoost with NSE, d, r, and R 2 values of 0.91, 0.98, 0.96, and 0.92, respectively, are well suited for predicting runoff. A comparative analysis of their results with previous studies revealed that the models were very effective since they were able to better reduce generalization errors at different calibration and validation phases. This study presents the integration of RF and gradient boosted trees as a simplified alternative to computationally expensive and data-intensive physically-based rainfall-runoff models. The practitioners can build upon the experimentation presented in this study to minimize the computational time requirement, construction process complexity, and data requirement, which are often serious constraints in physically-based rainfall-runoff modeling.
ISSN:0033-4553
1420-9136
DOI:10.1007/s00024-024-03486-0