MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm
•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative expe...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2021-08, Vol.175, p.114830, Article 114830 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 114830 |
container_title | Expert systems with applications |
container_volume | 175 |
creator | Tran, Vanha Wang, Lizhen Chen, Hongmei Xiao, Qing |
description | •A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments.
Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms. |
doi_str_mv | 10.1016/j.eswa.2021.114830 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2554665706</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421002712</els_id><sourcerecordid>2554665706</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2554665706</pqid></control><display><type>article</type><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><source>Elsevier ScienceDirect Journals</source><creator>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creator><creatorcontrib>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creatorcontrib><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments.
Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.114830</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Algorithms ; Boolean algebra ; Computing time ; Data mining ; Datasets ; Hash table ; Maximal clique ; Maximal co-location pattern ; Model testing ; Pattern analysis ; Spatial data ; Spatial data mining</subject><ispartof>Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Aug 1, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</citedby><cites>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2021.114830$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids></links><search><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><title>Expert systems with applications</title><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments.
Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><subject>Algorithms</subject><subject>Boolean algebra</subject><subject>Computing time</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Hash table</subject><subject>Maximal clique</subject><subject>Maximal co-location pattern</subject><subject>Model testing</subject><subject>Pattern analysis</subject><subject>Spatial data</subject><subject>Spatial data mining</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</recordid><startdate>20210801</startdate><enddate>20210801</enddate><creator>Tran, Vanha</creator><creator>Wang, Lizhen</creator><creator>Chen, Hongmei</creator><creator>Xiao, Qing</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210801</creationdate><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><author>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Boolean algebra</topic><topic>Computing time</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Hash table</topic><topic>Maximal clique</topic><topic>Maximal co-location pattern</topic><topic>Model testing</topic><topic>Pattern analysis</topic><topic>Spatial data</topic><topic>Spatial data mining</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tran, Vanha</au><au>Wang, Lizhen</au><au>Chen, Hongmei</au><au>Xiao, Qing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</atitle><jtitle>Expert systems with applications</jtitle><date>2021-08-01</date><risdate>2021</risdate><volume>175</volume><spage>114830</spage><pages>114830-</pages><artnum>114830</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments.
Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.114830</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_2554665706 |
source | Elsevier ScienceDirect Journals |
subjects | Algorithms Boolean algebra Computing time Data mining Datasets Hash table Maximal clique Maximal co-location pattern Model testing Pattern analysis Spatial data Spatial data mining |
title | MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T23%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MCHT:%20A%20maximal%20clique%20and%20hash%20table-based%20maximal%20prevalent%20co-location%20pattern%20mining%20algorithm&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Tran,%20Vanha&rft.date=2021-08-01&rft.volume=175&rft.spage=114830&rft.pages=114830-&rft.artnum=114830&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.114830&rft_dat=%3Cproquest_cross%3E2554665706%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2554665706&rft_id=info:pmid/&rft_els_id=S0957417421002712&rfr_iscdi=true |