Enhancing flood risk assessment through integration of ensemble learning approaches and physical-based hydrological modeling

This study aims to examine three machine learning (ML) techniques, namely random forest (RF), LightGBM, and CatBoost for flooding susceptibility maps (FSMs) in the Vietnamese Vu Gia-Thu Bon (VGTB). The results of ML are compared with those of the rainfall-runoff model, and different training dataset...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Geomatics, natural hazards and risk natural hazards and risk, 2023-12, Vol.14 (1)
Hauptverfasser:	Saber, Mohamed, Boulmaiz, Tayeb, Guermoui, Mawloud, Abdrabo, Karim I., Kantoush, Sameh A., Sumi, Tetsuya, Boutaghane, Hamouda, Hori, Tomoharu, Binh, Doan Van, Nguyen, Binh Quang, Bui, Thao T. P., Vo, Ngoc Duong, Habib, Emad, Mabrouk, Emad
Format:	Artikel
Sprache:	eng
Schlagworte:	CatBoost Data points Datasets Ensemble learning Environmental risk Flood mapping Flood risk flood susceptibility mapping Flooding Floods Hydrologic models LightGBM Machine learning Modelling Performance assessment Performance testing Precipitation Rainfall rainfall-runoff inundation model Rainfall-runoff modeling Rainfall-runoff relationships random forest Risk assessment Runoff Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This study aims to examine three machine learning (ML) techniques, namely random forest (RF), LightGBM, and CatBoost for flooding susceptibility maps (FSMs) in the Vietnamese Vu Gia-Thu Bon (VGTB). The results of ML are compared with those of the rainfall-runoff model, and different training dataset sizes are utilized in the performance assessment. Ten independent factors are assessed. An inventory map with approximately 850 flooding sites is based on several post-flood surveys. The inventory dataset is randomly split between training (70%) and testing (30%). The AUC-ROC results are 97.9%, 99.5%, and 99.5% for CatBoost, LightGBM, and RF, respectively. The FSMs developed by the ML methods show good agreement in terms of an extension with flood inundation maps developed using the rainfall-runoff model. The models' FSMs showed 10-13% of the total area to be highly susceptible to flooding, consistent with RRI's flood map. The FSMs show that downstream areas (both urbanized and agricultural) are under high and very high levels of susceptibility. Additionally, different sizes of the input datasets are tested to determine the least number of data points having acceptable reliability. The results demonstrate that the ML methods can realistically predict FSMs, regardless of the number of training samples.
ISSN:	1947-5705 1947-5713
DOI:	10.1080/19475705.2023.2203798