Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine

Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and effici...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Building and environment 2021-07, Vol.199, p.107879, Article 107879
Hauptverfasser:	Hay Chung, Lamuel Chi, Xie, Jing, Ren, Chao
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Classification Classifiers Computer applications Datasets Earth observation Environmental studies Google earth engine Greater bay area Land-use and land-cover Learning algorithms Local climate zone Machine learning Metropolitan areas Pixels Remote sensing Support vector machines Workflow
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	107879
container_title	Building and environment
container_volume	199
creator	Hay Chung, Lamuel Chi Xie, Jing Ren, Chao
description	Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.
doi_str_mv	10.1016/j.buildenv.2021.107879
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2548693621</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0360132321002857</els_id><sourcerecordid>2548693621</sourcerecordid><originalsourceid>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</originalsourceid><addsrcrecordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2548693621</pqid></control><display><type>article</type><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><source>Elsevier ScienceDirect Journals</source><creator>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creator><creatorcontrib>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creatorcontrib><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><identifier>ISSN: 0360-1323</identifier><identifier>EISSN: 1873-684X</identifier><identifier>DOI: 10.1016/j.buildenv.2021.107879</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Accuracy ; Algorithms ; Classification ; Classifiers ; Computer applications ; Datasets ; Earth observation ; Environmental studies ; Google earth engine ; Greater bay area ; Land-use and land-cover ; Learning algorithms ; Local climate zone ; Machine learning ; Metropolitan areas ; Pixels ; Remote sensing ; Support vector machines ; Workflow</subject><ispartof>Building and environment, 2021-07, Vol.199, p.107879, Article 107879</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Jul 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</citedby><cites>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</cites><orcidid>0000-0003-2442-7984 ; 0000-0002-8494-2585 ; 0000-0003-3787-9648</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0360132321002857$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><title>Building and environment</title><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Computer applications</subject><subject>Datasets</subject><subject>Earth observation</subject><subject>Environmental studies</subject><subject>Google earth engine</subject><subject>Greater bay area</subject><subject>Land-use and land-cover</subject><subject>Learning algorithms</subject><subject>Local climate zone</subject><subject>Machine learning</subject><subject>Metropolitan areas</subject><subject>Pixels</subject><subject>Remote sensing</subject><subject>Support vector machines</subject><subject>Workflow</subject><issn>0360-1323</issn><issn>1873-684X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</recordid><startdate>20210715</startdate><enddate>20210715</enddate><creator>Hay Chung, Lamuel Chi</creator><creator>Xie, Jing</creator><creator>Ren, Chao</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8FD</scope><scope>C1K</scope><scope>F28</scope><scope>FR3</scope><scope>KR7</scope><scope>SOI</scope><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></search><sort><creationdate>20210715</creationdate><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><author>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Computer applications</topic><topic>Datasets</topic><topic>Earth observation</topic><topic>Environmental studies</topic><topic>Google earth engine</topic><topic>Greater bay area</topic><topic>Land-use and land-cover</topic><topic>Learning algorithms</topic><topic>Local climate zone</topic><topic>Machine learning</topic><topic>Metropolitan areas</topic><topic>Pixels</topic><topic>Remote sensing</topic><topic>Support vector machines</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Building and environment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hay Chung, Lamuel Chi</au><au>Xie, Jing</au><au>Ren, Chao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</atitle><jtitle>Building and environment</jtitle><date>2021-07-15</date><risdate>2021</risdate><volume>199</volume><spage>107879</spage><pages>107879-</pages><artnum>107879</artnum><issn>0360-1323</issn><eissn>1873-684X</eissn><abstract>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally. •We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.buildenv.2021.107879</doi><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0360-1323
ispartof	Building and environment, 2021-07, Vol.199, p.107879, Article 107879
issn	0360-1323 1873-684X
language	eng
recordid	cdi_proquest_journals_2548693621
source	Elsevier ScienceDirect Journals
subjects	Accuracy Algorithms Classification Classifiers Computer applications Datasets Earth observation Environmental studies Google earth engine Greater bay area Land-use and land-cover Learning algorithms Local climate zone Machine learning Metropolitan areas Pixels Remote sensing Support vector machines Workflow
title	Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T15%3A15%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20machine-learning%20mapping%20of%20local%20climate%20zones%20in%20metropolitan%20areas%20using%20composite%20Earth%20observation%20data%20in%20Google%20Earth%20Engine&rft.jtitle=Building%20and%20environment&rft.au=Hay%20Chung,%20Lamuel%20Chi&rft.date=2021-07-15&rft.volume=199&rft.spage=107879&rft.pages=107879-&rft.artnum=107879&rft.issn=0360-1323&rft.eissn=1873-684X&rft_id=info:doi/10.1016/j.buildenv.2021.107879&rft_dat=%3Cproquest_cross%3E2548693621%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2548693621&rft_id=info:pmid/&rft_els_id=S0360132321002857&rfr_iscdi=true