Ensemble averaging using remote sensing data to model spatiotemporal PM10 concentrations in sparsely monitored South Africa

There is a paucity of air quality data in sub-Saharan African countries to inform science driven air quality management and epidemiological studies. We investigated the use of available remote-sensing aerosol optical depth (AOD) data to develop spatially and temporally resolved models to predict dai...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Environmental pollution (1987) 2022-10, Vol.310, p.119883-119883, Article 119883
Hauptverfasser: Arowosegbe, Oluwaseyi Olalekan, Röösli, Martin, Künzli, Nino, Saucy, Apolline, Adebayo-Ojo, Temitope C., Schwartz, Joel, Kebalepile, Moses, Jeebhay, Mohamed Fareed, Dalvie, Mohamed Aqiel, de Hoogh, Kees
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:There is a paucity of air quality data in sub-Saharan African countries to inform science driven air quality management and epidemiological studies. We investigated the use of available remote-sensing aerosol optical depth (AOD) data to develop spatially and temporally resolved models to predict daily particulate matter (PM10) concentrations across four provinces of South Africa (Gauteng, Mpumalanga, KwaZulu-Natal and Western Cape) for the year 2016 in a two-staged approach. In stage 1, a Random Forest (RF) model was used to impute Multiangle Implementation of Atmospheric Correction AOD data for days where it was missing. In stage 2, the machine learner algorithms RF, Gradient Boosting and Support Vector Regression were used to model the relationship between ground-monitored PM10 data, AOD and other spatial and temporal predictors. These were subsequently combined in an ensemble model to predict daily PM10 concentrations at 1 km × 1 km spatial resolution across the four provinces. An out-of-bag R2 of 0.96 was achieved for the first stage model. The stage 2 cross-validated (CV) ensemble model captured 0.84 variability in ground-monitored PM10 with a spatial CV R2 of 0.48 and temporal CV R2 of 0.80. The stage 2 model indicated an optimal performance of the daily predictions when aggregated to monthly and annual means. Our results suggest that a combination of remote sensing data, chemical transport model estimates and other spatiotemporal predictors has the potential to improve air quality exposure data in South Africa's major industrial provinces. In particular, the use of a combined ensemble approach was found to be useful for this area with limited availability of air pollution ground monitoring data. [Display omitted] •Ensemble averaging outperformed the individual machine learner's performances.•Ensemble averaging explained 81% of overall variation in PM10 concentrations.•Modelled PM10 as a predictor was more influential than satellite-derived PM10.
ISSN:0269-7491
1873-6424
DOI:10.1016/j.envpol.2022.119883