Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations
Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distan...
Gespeichert in:
Veröffentlicht in: | Journal of environmental management 2022-03, Vol.306, p.114434-114434, Article 114434 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 114434 |
---|---|
container_issue | |
container_start_page | 114434 |
container_title | Journal of environmental management |
container_volume | 306 |
creator | Zulkepli, Nur Fariha Syaqina Noorani, Mohd Salmi Md Razak, Fatimah Abdul Ismail, Munira Alias, Mohd Almie |
description | Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.
•A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas. |
doi_str_mv | 10.1016/j.jenvman.2022.114434 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2622281215</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030147972200007X</els_id><sourcerecordid>2622281215</sourcerecordid><originalsourceid>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EokvbjwDykUsW24md7AmhqlCkSr2Us-U_k8arxN56nFbbMx-cbHfhymmkN7-Zp5lHyEfO1pxx9WW73kJ8mkxcCybEmvOmqZs3ZMXZRladqtlbsmI141XTbtoz8gFxyxirBW_fk7NaMiVrJVbk983e5uDDiykhRZp6OgTIJrshODNSN85YIIf4QJ9DGegOMoZFiYUOaUpjetjTEKlBBMQDNZgXoLALmDwgtVCeAZZ-yPRxNmMoezqlGEp6XYnl1RUvyLvejAiXp3pOfn2_vr-6qW7vfvy8-nZbuYaLUsneMt8Kya1qjPAKbO1714D0m84ZD1x4sEpxBq0RznfOW2Nk29lFZa7j9Tn5fNy7y-lxBix6CuhgHE2ENKMWSgjRccHlgsoj6nJCzNDrXQ6TyXvNmT4EoLf6FIA-BKCPASxzn04Ws53A_5v6-_EF-HoEYDn0afm1RhcgOvAhgyvap_Afiz-yj58y</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2622281215</pqid></control><display><type>article</type><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</creator><creatorcontrib>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</creatorcontrib><description>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.
•A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</description><identifier>ISSN: 0301-4797</identifier><identifier>EISSN: 1095-8630</identifier><identifier>DOI: 10.1016/j.jenvman.2022.114434</identifier><identifier>PMID: 35065362</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Air Pollutants - analysis ; Air Pollution - analysis ; Cluster Analysis ; Environmental Monitoring ; Haze ; Hierarchical clustering ; Particulate Matter - analysis ; Persistent homology ; Time delay embedding ; Topological data analysis</subject><ispartof>Journal of environmental management, 2022-03, Vol.306, p.114434-114434, Article 114434</ispartof><rights>2022 Elsevier Ltd</rights><rights>Copyright © 2022 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</citedby><cites>FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</cites><orcidid>0000-0002-4204-2618 ; 0000-0002-3045-7963</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jenvman.2022.114434$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35065362$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zulkepli, Nur Fariha Syaqina</creatorcontrib><creatorcontrib>Noorani, Mohd Salmi Md</creatorcontrib><creatorcontrib>Razak, Fatimah Abdul</creatorcontrib><creatorcontrib>Ismail, Munira</creatorcontrib><creatorcontrib>Alias, Mohd Almie</creatorcontrib><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><title>Journal of environmental management</title><addtitle>J Environ Manage</addtitle><description>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.
•A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</description><subject>Air Pollutants - analysis</subject><subject>Air Pollution - analysis</subject><subject>Cluster Analysis</subject><subject>Environmental Monitoring</subject><subject>Haze</subject><subject>Hierarchical clustering</subject><subject>Particulate Matter - analysis</subject><subject>Persistent homology</subject><subject>Time delay embedding</subject><subject>Topological data analysis</subject><issn>0301-4797</issn><issn>1095-8630</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkU9v1DAQxS0EokvbjwDykUsW24md7AmhqlCkSr2Us-U_k8arxN56nFbbMx-cbHfhymmkN7-Zp5lHyEfO1pxx9WW73kJ8mkxcCybEmvOmqZs3ZMXZRladqtlbsmI141XTbtoz8gFxyxirBW_fk7NaMiVrJVbk983e5uDDiykhRZp6OgTIJrshODNSN85YIIf4QJ9DGegOMoZFiYUOaUpjetjTEKlBBMQDNZgXoLALmDwgtVCeAZZ-yPRxNmMoezqlGEp6XYnl1RUvyLvejAiXp3pOfn2_vr-6qW7vfvy8-nZbuYaLUsneMt8Kya1qjPAKbO1714D0m84ZD1x4sEpxBq0RznfOW2Nk29lFZa7j9Tn5fNy7y-lxBix6CuhgHE2ENKMWSgjRccHlgsoj6nJCzNDrXQ6TyXvNmT4EoLf6FIA-BKCPASxzn04Ws53A_5v6-_EF-HoEYDn0afm1RhcgOvAhgyvap_Afiz-yj58y</recordid><startdate>20220315</startdate><enddate>20220315</enddate><creator>Zulkepli, Nur Fariha Syaqina</creator><creator>Noorani, Mohd Salmi Md</creator><creator>Razak, Fatimah Abdul</creator><creator>Ismail, Munira</creator><creator>Alias, Mohd Almie</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4204-2618</orcidid><orcidid>https://orcid.org/0000-0002-3045-7963</orcidid></search><sort><creationdate>20220315</creationdate><title>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</title><author>Zulkepli, Nur Fariha Syaqina ; Noorani, Mohd Salmi Md ; Razak, Fatimah Abdul ; Ismail, Munira ; Alias, Mohd Almie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c412t-5fb0d7251b64a2d6eb3dfc4e5d98cade12deb6610e7a2cd8cdbaa578beb60c813</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Air Pollutants - analysis</topic><topic>Air Pollution - analysis</topic><topic>Cluster Analysis</topic><topic>Environmental Monitoring</topic><topic>Haze</topic><topic>Hierarchical clustering</topic><topic>Particulate Matter - analysis</topic><topic>Persistent homology</topic><topic>Time delay embedding</topic><topic>Topological data analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zulkepli, Nur Fariha Syaqina</creatorcontrib><creatorcontrib>Noorani, Mohd Salmi Md</creatorcontrib><creatorcontrib>Razak, Fatimah Abdul</creatorcontrib><creatorcontrib>Ismail, Munira</creatorcontrib><creatorcontrib>Alias, Mohd Almie</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of environmental management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zulkepli, Nur Fariha Syaqina</au><au>Noorani, Mohd Salmi Md</au><au>Razak, Fatimah Abdul</au><au>Ismail, Munira</au><au>Alias, Mohd Almie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations</atitle><jtitle>Journal of environmental management</jtitle><addtitle>J Environ Manage</addtitle><date>2022-03-15</date><risdate>2022</risdate><volume>306</volume><spage>114434</spage><epage>114434</epage><pages>114434-114434</pages><artnum>114434</artnum><issn>0301-4797</issn><eissn>1095-8630</eissn><abstract>Haze has been a major issue afflicting Southeast Asian countries, including Malaysia, for the past few decades. Hierarchical agglomerative cluster analysis (HACA) is commonly used to evaluate the spatial behavior between areas in which pollutants interact. Typically, using HACA, the Euclidean distance acts as the dissimilarity measure and air quality monitoring stations are grouped according to this measure, thus revealing the most polluted areas. In this study, a framework for the hybridization of the HACA technique is proposed by considering the topological similarity (Wasserstein distance) between stations to evaluate the spatial patterns of the affected areas by haze episodes. For this, a tool in the topological data analysis (TDA), namely, persistent homology, is used to extract essential topological features hidden in the dataset. The performance of the proposed method is compared with that of traditional HACA and evaluated based on its ability to categorize areas according to the exceedance level of the particulate matter (PM10). Results show that additional topological features have yielded better accuracy compared to without the case that does not consider topological features. The cluster validity indices are computed to verify the results, and the proposed method outperforms the traditional method, suggesting a practical alternative approach for assessing the similarity in air pollution behaviors based on topological characterizations.
•A framework of hybrid hierarchical clustering (HACA) with persistent homology is proposed.•Hybrid HACA outperformed traditional HACA in categorizing stations according to haze severity.•The superiority of hybrid HACA is verified by cluster validity indices.•The most affected areas could be identified through hybrid HACA.•A better air quality management can be achieved by using the new approach to categorize affected areas.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>35065362</pmid><doi>10.1016/j.jenvman.2022.114434</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-4204-2618</orcidid><orcidid>https://orcid.org/0000-0002-3045-7963</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0301-4797 |
ispartof | Journal of environmental management, 2022-03, Vol.306, p.114434-114434, Article 114434 |
issn | 0301-4797 1095-8630 |
language | eng |
recordid | cdi_proquest_miscellaneous_2622281215 |
source | MEDLINE; Elsevier ScienceDirect Journals Complete |
subjects | Air Pollutants - analysis Air Pollution - analysis Cluster Analysis Environmental Monitoring Haze Hierarchical clustering Particulate Matter - analysis Persistent homology Time delay embedding Topological data analysis |
title | Hybridization of hierarchical clustering with persistent homology in assessing haze episodes between air quality monitoring stations |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T05%3A11%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hybridization%20of%20hierarchical%20clustering%20with%20persistent%20homology%20in%20assessing%20haze%20episodes%20between%20air%20quality%20monitoring%20stations&rft.jtitle=Journal%20of%20environmental%20management&rft.au=Zulkepli,%20Nur%20Fariha%20Syaqina&rft.date=2022-03-15&rft.volume=306&rft.spage=114434&rft.epage=114434&rft.pages=114434-114434&rft.artnum=114434&rft.issn=0301-4797&rft.eissn=1095-8630&rft_id=info:doi/10.1016/j.jenvman.2022.114434&rft_dat=%3Cproquest_cross%3E2622281215%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2622281215&rft_id=info:pmid/35065362&rft_els_id=S030147972200007X&rfr_iscdi=true |