Soil erosion susceptibility mapping using ensemble machine learning models: A case study of upper Congo river sub-basin

•Soil erosion susceptibility mapping in the Elila catchment (Congo) was produced.•Four machine learning algorithms (ML-AL) RF-SVM, RF-BRT, RF-LB, and RF-KNN were implemented.•Elevation, rainfall, and slope were among the highest contributed factors in soil erosion.•The RF-BRT had the highest perform...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Catena (Giessen) 2023-03, Vol.222, p.106858, Article 106858
Hauptverfasser: Cimusa Kulimushi, Luc, Bigabwa Bashagaluke, Janvier, Prasad, Pankaj, Heri-Kazi, Aimé B., Lal Kushwaha, Nand, Masroor, Md, Choudhari, Pandurang, Elbeltagi, Ahmed, Sajjad, Haroon, Mohammed, Safwan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Soil erosion susceptibility mapping in the Elila catchment (Congo) was produced.•Four machine learning algorithms (ML-AL) RF-SVM, RF-BRT, RF-LB, and RF-KNN were implemented.•Elevation, rainfall, and slope were among the highest contributed factors in soil erosion.•The RF-BRT had the highest performance in predicting soil erosion.•The ML-AL accuracy: RF-BRT > RF-SVM > RF-KNN > RF-LB. Despite its large size, the Congo Basin (CB), which spans ten countries, has remained an area of particular interest for scientific discovery due to gaps in Earth science, environmental and hydrological research. This includes mapping soil erosion, which is the dominant form of land degradation in the basin, particularly severe in its upper part. For these reasons, the use of predictive machine learning algorithms (ML-ALs) rather than conventional models is necessary. In particular, the introduction of the ensemble model which is becoming widely popular, but is still little used throughout Africa. In this study, ensemble ML-ALs were applied and aimed at evaluating the predictive power of combining different algorithms such as Support Vector Machine (SVM), Boosted Regression Trees (BRT), Logit Boost (LB) and K-Nearest Neighbor (KNN) with Random Forest (RF) as the base classifier for erosion susceptibility mapping (ESM) in the Elila catchment located in the Upper Congo sub-basin. To achieve this goal, 500 erosion sites were identified by the RUSLE model and Google Earth historical maps and then a field survey was conducted to validate the identified erosion sites. The input data were randomly divided into training and test datasets. Fifteen important features were selected using Boruta's approach to produce the ESMs. The accuracy of the models was evaluated using the area under the receiver operating characteristic curve (AUROC) and four statistical measures (sensitivity, accuracy, specificity and kappa index). The overall accuracy in terms of AUROC values shows that RF-BRT (87.26%) was superior to all other algorithms, followed by RF-SVM, RF-KNN, and RF-LB with AUROC values ranging from 86.51%, and 85.37%, to 84.21%, respectively. In conclusion, the results of this work can be used to control and prevent erosion throughout the CB, and the methodology of this work can be useful in similar geo-environmental characteristic sites.
ISSN:0341-8162
1872-6887
DOI:10.1016/j.catena.2022.106858