Effectiveness of Integrating Ensemble-Based Feature Selection and Novel Gradient Boosted Trees in Runoff Prediction: A Case Study in Vu Gia Thu Bon River Basin, Vietnam
Traditional rainfall-runoff modeling techniques require large datasets and often an exhaustive calibration process, which is challenging, especially in poorly-gauged basins and resource-limited settings. Therefore, it is necessary to examine new ways of constructing predictive models for runoff that...
Gespeichert in:
Veröffentlicht in: | Pure and applied geophysics 2024-05, Vol.181 (5), p.1725-1744 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Traditional rainfall-runoff modeling techniques require large datasets and often an exhaustive calibration process, which is challenging, especially in poorly-gauged basins and resource-limited settings. Therefore, it is necessary to examine new ways of constructing predictive models for runoff that can achieve satisfactory results, while also minimizing the data requirement and model construction time. In this study, the effectiveness of integrating the Random Forest (RF) as an important feature identifier with novel gradient boosted trees to achieve satisfactory results was examined for two adjacent catchments in Vietnam. Antecedent daily runoff in combination with daily and one-day antecedent rainfall was found to significantly influence the runoff at the outlet of the catchments. Categorical Boosting (CatBoost) and Extreme Gradient Boosting (XGBoost) were effective in predicting day-ahead runoff. For instance, CatBoost with NSE, d, r, and R
2
values of 0.92, 0.98, 0.96, and 0.92, respectively, and XGBoost with NSE, d, r, and R
2
values of 0.91, 0.98, 0.96, and 0.92, respectively, are well suited for predicting runoff. A comparative analysis of their results with previous studies revealed that the models were very effective since they were able to better reduce generalization errors at different calibration and validation phases. This study presents the integration of RF and gradient boosted trees as a simplified alternative to computationally expensive and data-intensive physically-based rainfall-runoff models. The practitioners can build upon the experimentation presented in this study to minimize the computational time requirement, construction process complexity, and data requirement, which are often serious constraints in physically-based rainfall-runoff modeling. |
---|---|
ISSN: | 0033-4553 1420-9136 |
DOI: | 10.1007/s00024-024-03486-0 |