Classification of Particulate Matter (PM2.5) Concentrations Using Feature Selection and Machine Learning Strategies

This research employed machine learning approaches to classify acceptable or non-acceptable particulate matter (PM ) concentrations using a dataset that was obtained from the Nairobi expressway road corridor. The dataset contained air quality data, traffic volume, and meteorological data. The Boruta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:LOGI - scientific journal on transport and logistics 2024-01, Vol.15 (1), p.85-96
Hauptverfasser: Matara, Caroline Mongina, Nyambane, Simpson Osano, Yusuf, Amir Okeyo, Ochungo, Elisha Akech, Khattak, Afaq
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This research employed machine learning approaches to classify acceptable or non-acceptable particulate matter (PM ) concentrations using a dataset that was obtained from the Nairobi expressway road corridor. The dataset contained air quality data, traffic volume, and meteorological data. The Boruta Algorithm (BA) was utilized in conjunction with the Random Forests (RF) classifier to select the most appropriate features from the dataset. The findings of the BA analysis indicated that humidity was the most influential factor in determining air quality. This was closely followed by the variables of ‘day_of_week’ and the volume of traffic bound for Nairobi. The temperature of the site was determined to have a lower significance. The comparison among different machine learning classifiers for the classification of acceptable and unacceptable PM concentrations revealed that the Extreme Gradient Boosting (XGBoost) classifier displayed superior performance in terms of Sensitivity (0.774), Specificity (0.943), F1-Score (0.833), and AU-ROC (0.874). The Binary Logistic Regression (BLR) model demonstrated comparatively poorer performance in terms of Sensitivity (0.244), Specificity (0.614), F1-Score (0.455), and AU-ROC (0.508) when compared to other ML models. The prediction of PM has the potential to provide valuable insights to transport policymakers in their deliberations on urban transport policy formulation.
ISSN:2336-3037
2336-3037
DOI:10.2478/logi-2024-0008