Estimation of the soil arsenic concentration using a geographically weighted XGBoost model based on hyperspectral data

Considering the high toxicity of arsenic (As), its contamination of soil represents an alarming environmental and public health issue. Existing soil heavy metal concentration estimation models based on hyperspectral data ignore the spatial nonstationarity of the relationship between the soil spectru...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Science of the total environment 2023-02, Vol.858, p.159798, Article 159798
Hauptverfasser: Ye, Miao, Zhu, Lin, Li, Xiaojuan, Ke, Yinghai, Huang, Yong, Chen, Beibei, Yu, Huilin, Li, Huan, Feng, Hui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Considering the high toxicity of arsenic (As), its contamination of soil represents an alarming environmental and public health issue. Existing soil heavy metal concentration estimation models based on hyperspectral data ignore the spatial nonstationarity of the relationship between the soil spectrum and heavy metal concentration. A novel model (geographically weighted eXtreme gradient boosting or GW-XGBoost model) combining geographically weighted regression (GWR) method with XGBoost algorithm was proposed. The northeast district of Beijing, China, was chosen as a case study area to assess the effectiveness of the proposed model. The GW-XGBoost model was established to estimate the As concentration based on the typical spectrum of As and the spatial correlation between the spectrum and As concentration obtained using the GWR method, and the result was compared to that obtained with the XGBoost and GWR models. The accuracy of the GW-XGBoost model was obviously better than that of the other models (R2GW-XGBoost = 0.90, R2XGBoost = 0.48, and R2GWR = 0.74). Therefore, the proposed model is reliable, as it considers the spatial correlation between the spectrum and As concentration. [Display omitted] •Spatial correlation was integrated with XGBoost algorithm to improve the soil arsenic concentration estimation accuracy.•The adsorption mechanism of spectrally active substances was considered to select the typical spectrum of arsenic.•Spectrum transformation could be used to highlight the spectral absorption features of soil elements.
ISSN:0048-9697
DOI:10.1016/j.scitotenv.2022.159798