Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine
Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and effici...
Gespeichert in:
Veröffentlicht in: | Building and environment 2021-07, Vol.199, p.107879, Article 107879 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 107879 |
container_title | Building and environment |
container_volume | 199 |
creator | Hay Chung, Lamuel Chi Xie, Jing Ren, Chao |
description | Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally.
•We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency. |
doi_str_mv | 10.1016/j.buildenv.2021.107879 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2548693621</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0360132321002857</els_id><sourcerecordid>2548693621</sourcerecordid><originalsourceid>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</originalsourceid><addsrcrecordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2548693621</pqid></control><display><type>article</type><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><source>Elsevier ScienceDirect Journals</source><creator>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creator><creatorcontrib>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</creatorcontrib><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally.
•We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><identifier>ISSN: 0360-1323</identifier><identifier>EISSN: 1873-684X</identifier><identifier>DOI: 10.1016/j.buildenv.2021.107879</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Accuracy ; Algorithms ; Classification ; Classifiers ; Computer applications ; Datasets ; Earth observation ; Environmental studies ; Google earth engine ; Greater bay area ; Land-use and land-cover ; Learning algorithms ; Local climate zone ; Machine learning ; Metropolitan areas ; Pixels ; Remote sensing ; Support vector machines ; Workflow</subject><ispartof>Building and environment, 2021-07, Vol.199, p.107879, Article 107879</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Jul 15, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</citedby><cites>FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</cites><orcidid>0000-0003-2442-7984 ; 0000-0002-8494-2585 ; 0000-0003-3787-9648</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0360132321002857$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><title>Building and environment</title><description>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally.
•We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Computer applications</subject><subject>Datasets</subject><subject>Earth observation</subject><subject>Environmental studies</subject><subject>Google earth engine</subject><subject>Greater bay area</subject><subject>Land-use and land-cover</subject><subject>Learning algorithms</subject><subject>Local climate zone</subject><subject>Machine learning</subject><subject>Metropolitan areas</subject><subject>Pixels</subject><subject>Remote sensing</subject><subject>Support vector machines</subject><subject>Workflow</subject><issn>0360-1323</issn><issn>1873-684X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkM9KxDAQh4MouP55BQl47po03bS9KbKuguBFwVuYptPdLG1Sk-yCPoMPbcrq2dPAzO-bYT5Crjibc8blzXbe7Ezfot3Pc5bz1Cyrsj4iM16VIpNV8X5MZkxIlnGRi1NyFsKWJbAWxYx8Pw2jd3ts6QB6YyxmPYK3xq5TYxyn6jraOw091b0ZICL9chYDNZYOGL0bXW8iWAoeIdBdmBDthtEFk7JL8HFDXRPQ7yEaZ2kLESZ45dy6_wss7TrdviAnHfQBL3_rOXl7WL7eP2bPL6un-7vnTBdMxkzkDSxq1slOML4Qugbe1KzlwHjdSY1Moyyqsm3KklWcN4tOszINoWi0BC7EObk-7E2vf-wwRLV1O2_TSZUviiqZkTlPKXlIae9C8Nip0ScB_lNxpibzaqv-zKvJvDqYT-DtAcT0w96gV0EbtBpb41FH1Trz34ofmamStA</recordid><startdate>20210715</startdate><enddate>20210715</enddate><creator>Hay Chung, Lamuel Chi</creator><creator>Xie, Jing</creator><creator>Ren, Chao</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7ST</scope><scope>8FD</scope><scope>C1K</scope><scope>F28</scope><scope>FR3</scope><scope>KR7</scope><scope>SOI</scope><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></search><sort><creationdate>20210715</creationdate><title>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</title><author>Hay Chung, Lamuel Chi ; Xie, Jing ; Ren, Chao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c406t-32ba590f6f30153c9a1b90d1a019f6ce0ce6487db770811b5fc07a01a4bc6a133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Computer applications</topic><topic>Datasets</topic><topic>Earth observation</topic><topic>Environmental studies</topic><topic>Google earth engine</topic><topic>Greater bay area</topic><topic>Land-use and land-cover</topic><topic>Learning algorithms</topic><topic>Local climate zone</topic><topic>Machine learning</topic><topic>Metropolitan areas</topic><topic>Pixels</topic><topic>Remote sensing</topic><topic>Support vector machines</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hay Chung, Lamuel Chi</creatorcontrib><creatorcontrib>Xie, Jing</creatorcontrib><creatorcontrib>Ren, Chao</creatorcontrib><collection>CrossRef</collection><collection>Environment Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Environment Abstracts</collection><jtitle>Building and environment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hay Chung, Lamuel Chi</au><au>Xie, Jing</au><au>Ren, Chao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine</atitle><jtitle>Building and environment</jtitle><date>2021-07-15</date><risdate>2021</risdate><volume>199</volume><spage>107879</spage><pages>107879-</pages><artnum>107879</artnum><issn>0360-1323</issn><eissn>1873-684X</eissn><abstract>Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally.
•We explored a large-scale pixel-based local climate zone mapping workflow.•Data composition based on machine learning was tested in Google Earth Engine.•Random forests show high accuracy and support-vector machines show high stability.•Thermal-infrared variables influenced RF classification the most.•Using “elite variables”, random forests offer competitive accuracy and efficiency.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.buildenv.2021.107879</doi><orcidid>https://orcid.org/0000-0003-2442-7984</orcidid><orcidid>https://orcid.org/0000-0002-8494-2585</orcidid><orcidid>https://orcid.org/0000-0003-3787-9648</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0360-1323 |
ispartof | Building and environment, 2021-07, Vol.199, p.107879, Article 107879 |
issn | 0360-1323 1873-684X |
language | eng |
recordid | cdi_proquest_journals_2548693621 |
source | Elsevier ScienceDirect Journals |
subjects | Accuracy Algorithms Classification Classifiers Computer applications Datasets Earth observation Environmental studies Google earth engine Greater bay area Land-use and land-cover Learning algorithms Local climate zone Machine learning Metropolitan areas Pixels Remote sensing Support vector machines Workflow |
title | Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T15%3A15%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20machine-learning%20mapping%20of%20local%20climate%20zones%20in%20metropolitan%20areas%20using%20composite%20Earth%20observation%20data%20in%20Google%20Earth%20Engine&rft.jtitle=Building%20and%20environment&rft.au=Hay%20Chung,%20Lamuel%20Chi&rft.date=2021-07-15&rft.volume=199&rft.spage=107879&rft.pages=107879-&rft.artnum=107879&rft.issn=0360-1323&rft.eissn=1873-684X&rft_id=info:doi/10.1016/j.buildenv.2021.107879&rft_dat=%3Cproquest_cross%3E2548693621%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2548693621&rft_id=info:pmid/&rft_els_id=S0360132321002857&rfr_iscdi=true |