Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine

Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and effici...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Building and environment 2021-07, Vol.199, p.107879, Article 107879
Hauptverfasser: Hay Chung, Lamuel Chi, Xie, Jing, Ren, Chao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.
ISSN:0360-1323
1873-684X
DOI:10.1016/j.buildenv.2021.107879