A comparison of statistical and machine learning models for spatio-temporal prediction of ambient air pollutant concentrations in Scotland

The spatio-temporal prediction of air pollutant concentrations is vital for assessing regulatory compliance and for producing exposure estimates in epidemiological studies. Numerous approaches have been utilised for making such predictions, including land use regression models, additive models, spat...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Environmental and ecological statistics 2024-12, Vol.31 (4), p.1085-1108
Hauptverfasser:	Zhu, Qiangqiang, Lee, Duncan, Stoner, Oliver
Format:	Artikel
Sprache:	eng
Schlagworte:	air pollutants Air pollution Algorithms Biomedical and Life Sciences Chemistry and Earth Sciences compliance Computer Science Ecology Epidemiology Health Sciences Land use Learning algorithms Life Sciences Machine learning Math. Appl. in Environmental Science Medicine Particulate matter Physics Pollutants prediction Predictions Regression analysis Regression models Regulatory compliance Scotland Statistical analysis Statistical models Statistics for Engineering Statistics for Life Sciences Theoretical Ecology/Statistics uncertainty
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The spatio-temporal prediction of air pollutant concentrations is vital for assessing regulatory compliance and for producing exposure estimates in epidemiological studies. Numerous approaches have been utilised for making such predictions, including land use regression models, additive models, spatio-temporal smoothing models and machine learning prediction algorithms. However, relatively few studies have compared the predictive performance of these models thoroughly, which is one of the novel contributions of this paper. For the specific challenge of predicting monthly average concentrations of NO 2 , PM 10 and PM 2.5 in Scotland, we find that random forests typically outperform (or are as good as) more traditional statistical prediction approaches. Additionally, we utilise the best performing model to provide a new data resource, namely, predictions of monthly average concentrations (with uncertainty quantification) of the above pollutants on a regular 1 km 2 grid for all of Scotland between 2016 and 2020.
ISSN:	1352-8505 1573-3009
DOI:	10.1007/s10651-024-00635-5