MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm

•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative expe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Expert systems with applications 2021-08, Vol.175, p.114830, Article 114830
Hauptverfasser:	Tran, Vanha, Wang, Lizhen, Chen, Hongmei, Xiao, Qing
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Boolean algebra Computing time Data mining Datasets Hash table Maximal clique Maximal co-location pattern Model testing Pattern analysis Spatial data Spatial data mining
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	114830
container_title	Expert systems with applications
container_volume	175
creator	Tran, Vanha Wang, Lizhen Chen, Hongmei Xiao, Qing
description	•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.
doi_str_mv	10.1016/j.eswa.2021.114830
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2554665706</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421002712</els_id><sourcerecordid>2554665706</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2554665706</pqid></control><display><type>article</type><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><source>Elsevier ScienceDirect Journals</source><creator>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creator><creatorcontrib>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</creatorcontrib><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.114830</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Algorithms ; Boolean algebra ; Computing time ; Data mining ; Datasets ; Hash table ; Maximal clique ; Maximal co-location pattern ; Model testing ; Pattern analysis ; Spatial data ; Spatial data mining</subject><ispartof>Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Aug 1, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</citedby><cites>FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2021.114830$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids></links><search><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><title>Expert systems with applications</title><description>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</description><subject>Algorithms</subject><subject>Boolean algebra</subject><subject>Computing time</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Hash table</subject><subject>Maximal clique</subject><subject>Maximal co-location pattern</subject><subject>Model testing</subject><subject>Pattern analysis</subject><subject>Spatial data</subject><subject>Spatial data mining</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwB5gsMafY8WcQS1UBRQKxlNly7UvrKk2K7Rb496QKYmS65Xnv7n0QuqZkQgmVt5sJpE87KUlJJ5RyzcgJGlGtWCFVxU7RiFRCFZwqfo4uUtoQQhUhaoSWr7P54g5P8dZ-ha1tsGvCxx6wbT1e27TG2S4bKJY2gf9jdhEOtoE2Y9cVTedsDl2LdzZniC3ehja0K2ybVRdDXm8v0VltmwRXv3OM3h8fFrN58fL29DybvhSOlToXqnKWVkJzwhXXQkPlNfMUJHglifay8sBU6YQXUmhJGSe-5pYxUtcVF56N0c2wdxe7vkLKZtPtY9ufNKUQXEqhiOypcqBc7FKKUJtd7EvFb0OJObo0G3N0aY4uzeCyD90PIej_PwSIJrkArQMfIrhsfBf-i_8ATil8Nw</recordid><startdate>20210801</startdate><enddate>20210801</enddate><creator>Tran, Vanha</creator><creator>Wang, Lizhen</creator><creator>Chen, Hongmei</creator><creator>Xiao, Qing</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20210801</creationdate><title>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</title><author>Tran, Vanha ; Wang, Lizhen ; Chen, Hongmei ; Xiao, Qing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-79ca195840474858e9d83d1e6ed7608d69de372c5d565861340df4a330ff945d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Boolean algebra</topic><topic>Computing time</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Hash table</topic><topic>Maximal clique</topic><topic>Maximal co-location pattern</topic><topic>Model testing</topic><topic>Pattern analysis</topic><topic>Spatial data</topic><topic>Spatial data mining</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tran, Vanha</creatorcontrib><creatorcontrib>Wang, Lizhen</creatorcontrib><creatorcontrib>Chen, Hongmei</creatorcontrib><creatorcontrib>Xiao, Qing</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tran, Vanha</au><au>Wang, Lizhen</au><au>Chen, Hongmei</au><au>Xiao, Qing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm</atitle><jtitle>Expert systems with applications</jtitle><date>2021-08-01</date><risdate>2021</risdate><volume>175</volume><spage>114830</spage><pages>114830-</pages><artnum>114830</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A novel maximal prevalent co-location pattern mining framework is presented.•The time and space costs are reduced efficiently by maximal cliques and hash tables.•Enumerating maximal cliques is accelerated by bit string operations.•The performance of the proposed method is proved by comparative experiments. Co-location patterns refer to subsets of Boolean spatial features with instances of these features frequently appear in nearby geographic space. Maximal co-location patterns are a compact representation of these patterns that lead users more easily to absorb results and make meaningful inferences. The current algorithms for maximal co-location pattern mining are based on a generate-test candidate model. The main execution time of this model is occupied by collecting co-location instances of candidates, which makes discovering maximal co-location patterns is still very challenging when data is big and/or dense. To take up the challenge, a novel maximal co-location pattern mining framework based on maximal cliques and hash tables (MCHT) is developed in this study. First, all maximal cliques that can compactly represent neighbor relationships between instances of a spatial data set are enumerated. The advantages of bit string operations are fully utilized to speed up the process of enumerating maximal cliques. Next, a participating instance hash table structure is constructed based on these maximal cliques. Then information about the co-location instances of maximal patterns can be queried and collected efficiently from the hash table. After that, by calculating participation indexes of these patterns to measure their prevalence, maximal prevalent co-location patterns can be filtered efficiently. Finally, a series of experiments is conducted on both synthetic and real-facility data sets to demonstrate that the proposed algorithm can efficiently reduce both the computational time and the memory consumption compared with the existing algorithms.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.114830</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0957-4174
ispartof	Expert systems with applications, 2021-08, Vol.175, p.114830, Article 114830
issn	0957-4174 1873-6793
language	eng
recordid	cdi_proquest_journals_2554665706
source	Elsevier ScienceDirect Journals
subjects	Algorithms Boolean algebra Computing time Data mining Datasets Hash table Maximal clique Maximal co-location pattern Model testing Pattern analysis Spatial data Spatial data mining
title	MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T23%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MCHT:%20A%20maximal%20clique%20and%20hash%20table-based%20maximal%20prevalent%20co-location%20pattern%20mining%20algorithm&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Tran,%20Vanha&rft.date=2021-08-01&rft.volume=175&rft.spage=114830&rft.pages=114830-&rft.artnum=114830&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.114830&rft_dat=%3Cproquest_cross%3E2554665706%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2554665706&rft_id=info:pmid/&rft_els_id=S0957417421002712&rfr_iscdi=true