Establishment of aerosol optical depth dataset in the Sichuan Basin by the random forest approach

The Sichuan Basin has become one of the four city clusters and heavy polluted regions in China. In this study, the random forest (RF) machine learning method and multiple datasets are used to establish aerosol optical depth (AOD) dataset in the cloudy Sichuan Basin. Multiple datasets include ground-...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Atmospheric pollution research 2022-05, Vol.13 (5), p.101394, Article 101394
Hauptverfasser: Jiang, Mengjiao, Chen, Zhihang, Yang, Yinshan, Ni, Changjian, Yang, Qi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The Sichuan Basin has become one of the four city clusters and heavy polluted regions in China. In this study, the random forest (RF) machine learning method and multiple datasets are used to establish aerosol optical depth (AOD) dataset in the cloudy Sichuan Basin. Multiple datasets include ground-based PM10 and PM2.5, the AOD from the Sun-sky radiometer Observation Network (SONET) and the Second Modern-Era Retrospective analysis for Research and Applications (MERRA-2) aerosol reanalysis, and several meteorological variables. The correlation analysis, variance inflation factor method, covariance test, and important scores are used to select variables for the model. Eight independent variables, including MERRA-2 AOD, PM10, PM2.5/PM10, low cloud cover, 2 m air temperature, relative humidity, wind direction and boundary layer height, and one dependent variable SONET AOD are selected for the model in Chengdu, the capital of Sichuan, and then extended to the Sichuan Basin. The 10-fold cross validation and statistical comparison of the Multi-Angle implementation of Atmospheric Correction (MAIAC) and the MERRA-2 AOD are conducted. Results show that the values of PM10 and PM2.5, and MERRA-2 AOD are highest at the bottom of the basin, followed by that at the edge of the basin, and the lowest at the plateau areas. Comparing with the SONET AOD, the MERRA-2 and MAIAC underestimate the AOD in the Sichuan Basin, with the linear regression slope of 0.57 and 0.74, respectively. The RF AOD shows the best accuracy with the 10-fold cross-validation correlation coefficient of 0.79, the smallest RMSE of 0.17 and MAE of 0.14. [Display omitted] •The AOD dataset in the cloudy Sichuan Basin is established Based on the random forest.•The AOD values are highest in winter, and lowest in summer.•The established RF AOD shows better accuracy and is suitable for the Sichuan Basin.
ISSN:1309-1042
1309-1042
DOI:10.1016/j.apr.2022.101394