Establishment of aerosol optical depth dataset in the Sichuan Basin by the random forest approach
The Sichuan Basin has become one of the four city clusters and heavy polluted regions in China. In this study, the random forest (RF) machine learning method and multiple datasets are used to establish aerosol optical depth (AOD) dataset in the cloudy Sichuan Basin. Multiple datasets include ground-...
Gespeichert in:
Veröffentlicht in: | Atmospheric pollution research 2022-05, Vol.13 (5), p.101394, Article 101394 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The Sichuan Basin has become one of the four city clusters and heavy polluted regions in China. In this study, the random forest (RF) machine learning method and multiple datasets are used to establish aerosol optical depth (AOD) dataset in the cloudy Sichuan Basin. Multiple datasets include ground-based PM10 and PM2.5, the AOD from the Sun-sky radiometer Observation Network (SONET) and the Second Modern-Era Retrospective analysis for Research and Applications (MERRA-2) aerosol reanalysis, and several meteorological variables. The correlation analysis, variance inflation factor method, covariance test, and important scores are used to select variables for the model. Eight independent variables, including MERRA-2 AOD, PM10, PM2.5/PM10, low cloud cover, 2 m air temperature, relative humidity, wind direction and boundary layer height, and one dependent variable SONET AOD are selected for the model in Chengdu, the capital of Sichuan, and then extended to the Sichuan Basin. The 10-fold cross validation and statistical comparison of the Multi-Angle implementation of Atmospheric Correction (MAIAC) and the MERRA-2 AOD are conducted. Results show that the values of PM10 and PM2.5, and MERRA-2 AOD are highest at the bottom of the basin, followed by that at the edge of the basin, and the lowest at the plateau areas. Comparing with the SONET AOD, the MERRA-2 and MAIAC underestimate the AOD in the Sichuan Basin, with the linear regression slope of 0.57 and 0.74, respectively. The RF AOD shows the best accuracy with the 10-fold cross-validation correlation coefficient of 0.79, the smallest RMSE of 0.17 and MAE of 0.14.
[Display omitted]
•The AOD dataset in the cloudy Sichuan Basin is established Based on the random forest.•The AOD values are highest in winter, and lowest in summer.•The established RF AOD shows better accuracy and is suitable for the Sichuan Basin. |
---|---|
ISSN: | 1309-1042 1309-1042 |
DOI: | 10.1016/j.apr.2022.101394 |