Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine

Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and effici...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Building and environment 2021-07, Vol.199, p.107879, Article 107879
Hauptverfasser: Hay Chung, Lamuel Chi, Xie, Jing, Ren, Chao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 107879
container_title Building and environment
container_volume 199
creator Hay Chung, Lamuel Chi
Xie, Jing
Ren, Chao
description Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.
doi_str_mv 10.1016/j.buildenv.2021.107879
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2548693621</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0360132321002857</els_id><sourcerecordid>2548693621</sourcerecordid><originalsourceid>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</originalsourceid><addsrcrecordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2548693621</pqid></control><display><type>article</type><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><source>Elsevier ScienceDirect Journals</source><creator>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creator><creatorcontrib>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creatorcontrib><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><identifier>ISSN: 0360-1323</identifier><identifier>EISSN: 1873-684X</identifier><identifier>DOI: 10.1016/j.buildenv.2021.107879</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Accuracy ; Algorithms ; Classification ; Classifiers ; Computer applications ; Datasets ; Earth observation ; Environmental studies ; Google earth engine ; Greater bay area ; Land-use and land-cover ; Learning algorithms ; Local climate zone ; Machine learning ; Metropolitan areas ; Pixels ; Remote sensing ; Support vector machines ; Workflow</subject><ispartof>Building and environment, 2021-07, Vol.199, p.107879, Article 107879</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Jul 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</citedby><cites>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</cites><orcidid>0000-0003-2442-7984 ; 0000-0002-8494-2585 ; 0000-0003-3787-9648</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0360132321002857$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><title>Building and environment</title><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Computer applications</subject><subject>Datasets</subject><subject>Earth observation</subject><subject>Environmental studies</subject><subject>Google earth engine</subject><subject>Greater bay area</subject><subject>Land-use and land-cover</subject><subject>Learning algorithms</subject><subject>Local climate zone</subject><subject>Machine learning</subject><subject>Metropolitan areas</subject><subject>Pixels</subject><subject>Remote sensing</subject><subject>Support vector machines</subject><subject>Workflow</subject><issn>0360-1323</issn><issn>1873-684X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</recordid><startdate>20210715</startdate><enddate>20210715</enddate><creator>Hay Chung, Lamuel Chi</creator><creator>Xie, Jing</creator><creator>Ren, Chao</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8FD</scope><scope>C1K</scope><scope>F28</scope><scope>FR3</scope><scope>KR7</scope><scope>SOI</scope><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></search><sort><creationdate>20210715</creationdate><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><author>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Computer applications</topic><topic>Datasets</topic><topic>Earth observation</topic><topic>Environmental studies</topic><topic>Google earth engine</topic><topic>Greater bay area</topic><topic>Land-use and land-cover</topic><topic>Learning algorithms</topic><topic>Local climate zone</topic><topic>Machine learning</topic><topic>Metropolitan areas</topic><topic>Pixels</topic><topic>Remote sensing</topic><topic>Support vector machines</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Building and environment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hay Chung, Lamuel Chi</au><au>Xie, Jing</au><au>Ren, Chao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</atitle><jtitle>Building and environment</jtitle><date>2021-07-15</date><risdate>2021</risdate><volume>199</volume><spage>107879</spage><pages>107879-</pages><artnum>107879</artnum><issn>0360-1323</issn><eissn>1873-684X</eissn><abstract>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.buildenv.2021.107879</doi><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0360-1323
ispartof Building and environment, 2021-07, Vol.199, p.107879, Article 107879
issn 0360-1323
1873-684X
language eng
recordid cdi_proquest_journals_2548693621
source Elsevier ScienceDirect Journals
subjects Accuracy
Algorithms
Classification
Classifiers
Computer applications
Datasets
Earth observation
Environmental studies
Google earth engine
Greater bay area
Land-use and land-cover
Learning algorithms
Local climate zone
Machine learning
Metropolitan areas
Pixels
Remote sensing
Support vector machines
Workflow
title Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T15%3A15%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20machine-learning%20mapping%20of%20local%20climate%20zones%20in%20metropolitan%20areas%20using%20composite%20Earth%20observation%20data%20in%20Google%20Earth%20Engine&rft.jtitle=Building%20and%20environment&rft.au=Hay%20Chung,%20Lamuel%20Chi&rft.date=2021-07-15&rft.volume=199&rft.spage=107879&rft.pages=107879-&rft.artnum=107879&rft.issn=0360-1323&rft.eissn=1873-684X&rft_id=info:doi/10.1016/j.buildenv.2021.107879&rft_dat=%3Cproquest_cross%3E2548693621%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2548693621&rft_id=info:pmid/&rft_els_id=S0360132321002857&rfr_iscdi=true