Efficient spatial co-location pattern mining on multiple GPUs

•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Minin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2018-03, Vol.93, p.465-483
Hauptverfasser: Andrzejewski, W., Boinski, P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 483
container_issue
container_start_page 465
container_title Expert systems with applications
container_volume 93
creator Andrzejewski, W.
Boinski, P.
description •A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.
doi_str_mv 10.1016/j.eswa.2017.10.025
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1982204863</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417417306991</els_id><sourcerecordid>1982204863</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</originalsourceid><addsrcrecordid>eNp9kE9LwzAYh4MoOKdfwFPBc2uS5i_oQcY2hYEe3Dk0aSIpXVuTTPHbmzLPnvLy8Pu9eXkAuEWwQhCx-66y8bupMEQ8gwpiegYWSPC6ZFzW52ABJeUlQZxcgqsYO5iDEPIFeFw75423Qyri1CTf9IUZy340eR6HIqNkw1Ac_OCHjyKTw7FPfuptsX3bx2tw4Zo-2pu_dwn2m_X76rncvW5fVk-70tRYpJIbyTSmTmsnCRTaWUwQMkhb1GqmZUONZralQrgGMeJqyRkmFDFphOCc1ktwd9o7hfHzaGNS3XgMQ_5SISkwhkSwOqfwKWXCGGOwTk3BH5rwoxBUsybVqVmTmjXNLGvKpYdTyeb7v7wNKs46jG19sCapdvT_1X8B5KNwJA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1982204863</pqid></control><display><type>article</type><title>Efficient spatial co-location pattern mining on multiple GPUs</title><source>Elsevier ScienceDirect Journals Complete - AutoHoldings</source><creator>Andrzejewski, W. ; Boinski, P.</creator><creatorcontrib>Andrzejewski, W. ; Boinski, P.</creatorcontrib><description>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2017.10.025</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Co-location pattern mining ; Compression ; Data analysis ; Data compression ; Data mining ; Data structures ; Datasets ; Energy consumption ; Environmental science ; Epidemiology ; GPGPU ; Graphics processing units ; Parallel computing ; Pattern analysis ; Performance enhancement ; Spatial co-location ; Spatial data ; Studies</subject><ispartof>Expert systems with applications, 2018-03, Vol.93, p.465-483</ispartof><rights>2017 Elsevier Ltd</rights><rights>Copyright Elsevier BV Mar 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</citedby><cites>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2017.10.025$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3549,27923,27924,45994</link.rule.ids></links><search><creatorcontrib>Andrzejewski, W.</creatorcontrib><creatorcontrib>Boinski, P.</creatorcontrib><title>Efficient spatial co-location pattern mining on multiple GPUs</title><title>Expert systems with applications</title><description>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</description><subject>Co-location pattern mining</subject><subject>Compression</subject><subject>Data analysis</subject><subject>Data compression</subject><subject>Data mining</subject><subject>Data structures</subject><subject>Datasets</subject><subject>Energy consumption</subject><subject>Environmental science</subject><subject>Epidemiology</subject><subject>GPGPU</subject><subject>Graphics processing units</subject><subject>Parallel computing</subject><subject>Pattern analysis</subject><subject>Performance enhancement</subject><subject>Spatial co-location</subject><subject>Spatial data</subject><subject>Studies</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LwzAYh4MoOKdfwFPBc2uS5i_oQcY2hYEe3Dk0aSIpXVuTTPHbmzLPnvLy8Pu9eXkAuEWwQhCx-66y8bupMEQ8gwpiegYWSPC6ZFzW52ABJeUlQZxcgqsYO5iDEPIFeFw75423Qyri1CTf9IUZy340eR6HIqNkw1Ac_OCHjyKTw7FPfuptsX3bx2tw4Zo-2pu_dwn2m_X76rncvW5fVk-70tRYpJIbyTSmTmsnCRTaWUwQMkhb1GqmZUONZralQrgGMeJqyRkmFDFphOCc1ktwd9o7hfHzaGNS3XgMQ_5SISkwhkSwOqfwKWXCGGOwTk3BH5rwoxBUsybVqVmTmjXNLGvKpYdTyeb7v7wNKs46jG19sCapdvT_1X8B5KNwJA</recordid><startdate>20180301</startdate><enddate>20180301</enddate><creator>Andrzejewski, W.</creator><creator>Boinski, P.</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20180301</creationdate><title>Efficient spatial co-location pattern mining on multiple GPUs</title><author>Andrzejewski, W. ; Boinski, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Co-location pattern mining</topic><topic>Compression</topic><topic>Data analysis</topic><topic>Data compression</topic><topic>Data mining</topic><topic>Data structures</topic><topic>Datasets</topic><topic>Energy consumption</topic><topic>Environmental science</topic><topic>Epidemiology</topic><topic>GPGPU</topic><topic>Graphics processing units</topic><topic>Parallel computing</topic><topic>Pattern analysis</topic><topic>Performance enhancement</topic><topic>Spatial co-location</topic><topic>Spatial data</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Andrzejewski, W.</creatorcontrib><creatorcontrib>Boinski, P.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Andrzejewski, W.</au><au>Boinski, P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient spatial co-location pattern mining on multiple GPUs</atitle><jtitle>Expert systems with applications</jtitle><date>2018-03-01</date><risdate>2018</risdate><volume>93</volume><spage>465</spage><epage>483</epage><pages>465-483</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2017.10.025</doi><tpages>19</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2018-03, Vol.93, p.465-483
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_1982204863
source Elsevier ScienceDirect Journals Complete - AutoHoldings
subjects Co-location pattern mining
Compression
Data analysis
Data compression
Data mining
Data structures
Datasets
Energy consumption
Environmental science
Epidemiology
GPGPU
Graphics processing units
Parallel computing
Pattern analysis
Performance enhancement
Spatial co-location
Spatial data
Studies
title Efficient spatial co-location pattern mining on multiple GPUs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T13%3A37%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20spatial%20co-location%20pattern%20mining%20on%20multiple%20GPUs&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Andrzejewski,%20W.&rft.date=2018-03-01&rft.volume=93&rft.spage=465&rft.epage=483&rft.pages=465-483&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2017.10.025&rft_dat=%3Cproquest_cross%3E1982204863%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1982204863&rft_id=info:pmid/&rft_els_id=S0957417417306991&rfr_iscdi=true