Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations

Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of environmental management 2022-03, Vol.306, p.114434-114434, Article 114434
Hauptverfasser: Zulkepli, Nur Fariha Syaqina, Noorani, Mohd Salmi Md, Razak, Fatimah Abdul, Ismail, Munira, Alias, Mohd Almie
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 114434
container_issue
container_start_page 114434
container_title Journal of environmental management
container_volume 306
creator Zulkepli, Nur Fariha Syaqina
Noorani, Mohd Salmi Md
Razak, Fatimah Abdul
Ismail, Munira
Alias, Mohd Almie
description Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations. •A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.
doi_str_mv 10.1016/j.jenvman.2022.114434
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2622281215</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030147972200007X</els_id><sourcerecordid>2622281215</sourcerecordid><originalsourceid>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EokvbjwDykUsW24md7AmhqlCkSr2Us-U_k8arxN56nFbbMx-cbHfhymmkN7-Zp5lHyEfO1pxx9WW73kJ8mkxcCybEmvOmqZs3ZMXZRladqtlbsmI141XTbtoz8gFxyxirBW_fk7NaMiVrJVbk983e5uDDiykhRZp6OgTIJrshODNSN85YIIf4QJ9DGegOMoZFiYUOaUpjetjTEKlBBMQDNZgXoLALmDwgtVCeAZZ-yPRxNmMoezqlGEp6XYnl1RUvyLvejAiXp3pOfn2_vr-6qW7vfvy8-nZbuYaLUsneMt8Kya1qjPAKbO1714D0m84ZD1x4sEpxBq0RznfOW2Nk29lFZa7j9Tn5fNy7y-lxBix6CuhgHE2ENKMWSgjRccHlgsoj6nJCzNDrXQ6TyXvNmT4EoLf6FIA-BKCPASxzn04Ws53A_5v6-_EF-HoEYDn0afm1RhcgOvAhgyvap_Afiz-yj58y</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2622281215</pqid></control><display><type>article</type><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</creator><creatorcontrib>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</creatorcontrib><description>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations. •A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</description><identifier>ISSN: 0301-4797</identifier><identifier>EISSN: 1095-8630</identifier><identifier>DOI: 10.1016/j.jenvman.2022.114434</identifier><identifier>PMID: 35065362</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Air Pollutants - analysis ; Air Pollution - analysis ; Cluster Analysis ; Environmental Monitoring ; Haze ; Hierarchical clustering ; Particulate Matter - analysis ; Persistent homology ; Time delay embedding ; Topological data analysis</subject><ispartof>Journal of environmental management, 2022-03, Vol.306, p.114434-114434, Article 114434</ispartof><rights>2022 Elsevier Ltd</rights><rights>Copyright © 2022 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</citedby><cites>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</cites><orcidid>0000-0002-4204-2618 ; 0000-0002-3045-7963</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jenvman.2022.114434$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35065362$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zulkepli, Nur Fariha Syaqina</creatorcontrib><creatorcontrib>Noorani, Mohd Salmi Md</creatorcontrib><creatorcontrib>Razak, Fatimah Abdul</creatorcontrib><creatorcontrib>Ismail, Munira</creatorcontrib><creatorcontrib>Alias, Mohd Almie</creatorcontrib><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><title>Journal of environmental management</title><addtitle>J Environ Manage</addtitle><description>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations. •A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</description><subject>Air Pollutants - analysis</subject><subject>Air Pollution - analysis</subject><subject>Cluster Analysis</subject><subject>Environmental Monitoring</subject><subject>Haze</subject><subject>Hierarchical clustering</subject><subject>Particulate Matter - analysis</subject><subject>Persistent homology</subject><subject>Time delay embedding</subject><subject>Topological data analysis</subject><issn>0301-4797</issn><issn>1095-8630</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkU9v1DAQxS0EokvbjwDykUsW24md7AmhqlCkSr2Us-U_k8arxN56nFbbMx-cbHfhymmkN7-Zp5lHyEfO1pxx9WW73kJ8mkxcCybEmvOmqZs3ZMXZRladqtlbsmI141XTbtoz8gFxyxirBW_fk7NaMiVrJVbk983e5uDDiykhRZp6OgTIJrshODNSN85YIIf4QJ9DGegOMoZFiYUOaUpjetjTEKlBBMQDNZgXoLALmDwgtVCeAZZ-yPRxNmMoezqlGEp6XYnl1RUvyLvejAiXp3pOfn2_vr-6qW7vfvy8-nZbuYaLUsneMt8Kya1qjPAKbO1714D0m84ZD1x4sEpxBq0RznfOW2Nk29lFZa7j9Tn5fNy7y-lxBix6CuhgHE2ENKMWSgjRccHlgsoj6nJCzNDrXQ6TyXvNmT4EoLf6FIA-BKCPASxzn04Ws53A_5v6-_EF-HoEYDn0afm1RhcgOvAhgyvap_Afiz-yj58y</recordid><startdate>20220315</startdate><enddate>20220315</enddate><creator>Zulkepli, Nur Fariha Syaqina</creator><creator>Noorani, Mohd Salmi Md</creator><creator>Razak, Fatimah Abdul</creator><creator>Ismail, Munira</creator><creator>Alias, Mohd Almie</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4204-2618</orcidid><orcidid>https://orcid.org/0000-0002-3045-7963</orcidid></search><sort><creationdate>20220315</creationdate><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><author>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Air Pollutants - analysis</topic><topic>Air Pollution - analysis</topic><topic>Cluster Analysis</topic><topic>Environmental Monitoring</topic><topic>Haze</topic><topic>Hierarchical clustering</topic><topic>Particulate Matter - analysis</topic><topic>Persistent homology</topic><topic>Time delay embedding</topic><topic>Topological data analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zulkepli, Nur Fariha Syaqina</creatorcontrib><creatorcontrib>Noorani, Mohd Salmi Md</creatorcontrib><creatorcontrib>Razak, Fatimah Abdul</creatorcontrib><creatorcontrib>Ismail, Munira</creatorcontrib><creatorcontrib>Alias, Mohd Almie</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of environmental management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zulkepli, Nur Fariha Syaqina</au><au>Noorani, Mohd Salmi Md</au><au>Razak, Fatimah Abdul</au><au>Ismail, Munira</au><au>Alias, Mohd Almie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</atitle><jtitle>Journal of environmental management</jtitle><addtitle>J Environ Manage</addtitle><date>2022-03-15</date><risdate>2022</risdate><volume>306</volume><spage>114434</spage><epage>114434</epage><pages>114434-114434</pages><artnum>114434</artnum><issn>0301-4797</issn><eissn>1095-8630</eissn><abstract>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations. •A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>35065362</pmid><doi>10.1016/j.jenvman.2022.114434</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-4204-2618</orcidid><orcidid>https://orcid.org/0000-0002-3045-7963</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0301-4797
ispartof Journal of environmental management, 2022-03, Vol.306, p.114434-114434, Article 114434
issn 0301-4797
1095-8630
language eng
recordid cdi_proquest_miscellaneous_2622281215
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Air Pollutants - analysis
Air Pollution - analysis
Cluster Analysis
Environmental Monitoring
Haze
Hierarchical clustering
Particulate Matter - analysis
Persistent homology
Time delay embedding
Topological data analysis
title Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T05%3A11%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hybridization%20of%20hierarchical%20clustering%20with%20persistent%20homology%20in%20assessing%20haze%20episodes%20between%20air%20quality%20monitoring%20stations&rft.jtitle=Journal%20of%20environmental%20management&rft.au=Zulkepli,%20Nur%20Fariha%20Syaqina&rft.date=2022-03-15&rft.volume=306&rft.spage=114434&rft.epage=114434&rft.pages=114434-114434&rft.artnum=114434&rft.issn=0301-4797&rft.eissn=1095-8630&rft_id=info:doi/10.1016/j.jenvman.2022.114434&rft_dat=%3Cproquest_cross%3E2622281215%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2622281215&rft_id=info:pmid/35065362&rft_els_id=S030147972200007X&rfr_iscdi=true