MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm

•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative expe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2021-08, Vol.175, p.114830, Article 114830
Hauptverfasser: Tran, Vanha, Wang, Lizhen, Chen, Hongmei, Xiao, Qing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 114830
container_title Expert systems with applications
container_volume 175
creator Tran, Vanha
Wang, Lizhen
Chen, Hongmei
Xiao, Qing
description •A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.
doi_str_mv 10.1016/j.eswa.2021.114830
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2554665706</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421002712</els_id><sourcerecordid>2554665706</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2554665706</pqid></control><display><type>article</type><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><source>Elsevier ScienceDirect Journals</source><creator>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creator><creatorcontrib>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creatorcontrib><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.114830</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Algorithms ; Boolean algebra ; Computing time ; Data mining ; Datasets ; Hash table ; Maximal clique ; Maximal co-location pattern ; Model testing ; Pattern analysis ; Spatial data ; Spatial data mining</subject><ispartof>Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Aug 1, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</citedby><cites>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2021.114830$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids></links><search><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><title>Expert systems with applications</title><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><subject>Algorithms</subject><subject>Boolean algebra</subject><subject>Computing time</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Hash table</subject><subject>Maximal clique</subject><subject>Maximal co-location pattern</subject><subject>Model testing</subject><subject>Pattern analysis</subject><subject>Spatial data</subject><subject>Spatial data mining</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</recordid><startdate>20210801</startdate><enddate>20210801</enddate><creator>Tran, Vanha</creator><creator>Wang, Lizhen</creator><creator>Chen, Hongmei</creator><creator>Xiao, Qing</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210801</creationdate><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><author>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Boolean algebra</topic><topic>Computing time</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Hash table</topic><topic>Maximal clique</topic><topic>Maximal co-location pattern</topic><topic>Model testing</topic><topic>Pattern analysis</topic><topic>Spatial data</topic><topic>Spatial data mining</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tran, Vanha</au><au>Wang, Lizhen</au><au>Chen, Hongmei</au><au>Xiao, Qing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</atitle><jtitle>Expert systems with applications</jtitle><date>2021-08-01</date><risdate>2021</risdate><volume>175</volume><spage>114830</spage><pages>114830-</pages><artnum>114830</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.114830</doi></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2554665706
source Elsevier ScienceDirect Journals
subjects Algorithms
Boolean algebra
Computing time
Data mining
Datasets
Hash table
Maximal clique
Maximal co-location pattern
Model testing
Pattern analysis
Spatial data
Spatial data mining
title MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T23%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MCHT:%20A%20maximal%20clique%20and%20hash%20table-based%20maximal%20prevalent%20co-location%20pattern%20mining%20algorithm&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Tran,%20Vanha&rft.date=2021-08-01&rft.volume=175&rft.spage=114830&rft.pages=114830-&rft.artnum=114830&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.114830&rft_dat=%3Cproquest_cross%3E2554665706%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2554665706&rft_id=info:pmid/&rft_els_id=S0957417421002712&rfr_iscdi=true