Efficient spatial co-location pattern mining on multiple GPUs
•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data. In this paper, we investigate Co-location Pattern Minin...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2018-03, Vol.93, p.465-483 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 483 |
---|---|
container_issue | |
container_start_page | 465 |
container_title | Expert systems with applications |
container_volume | 93 |
creator | Andrzejewski, W. Boinski, P. |
description | •A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data.
In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations. |
doi_str_mv | 10.1016/j.eswa.2017.10.025 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1982204863</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417417306991</els_id><sourcerecordid>1982204863</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</originalsourceid><addsrcrecordid>eNp9kE9LwzAYh4MoOKdfwFPBc2uS5i_oQcY2hYEe3Dk0aSIpXVuTTPHbmzLPnvLy8Pu9eXkAuEWwQhCx-66y8bupMEQ8gwpiegYWSPC6ZFzW52ABJeUlQZxcgqsYO5iDEPIFeFw75423Qyri1CTf9IUZy340eR6HIqNkw1Ac_OCHjyKTw7FPfuptsX3bx2tw4Zo-2pu_dwn2m_X76rncvW5fVk-70tRYpJIbyTSmTmsnCRTaWUwQMkhb1GqmZUONZralQrgGMeJqyRkmFDFphOCc1ktwd9o7hfHzaGNS3XgMQ_5SISkwhkSwOqfwKWXCGGOwTk3BH5rwoxBUsybVqVmTmjXNLGvKpYdTyeb7v7wNKs46jG19sCapdvT_1X8B5KNwJA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1982204863</pqid></control><display><type>article</type><title>Efficient spatial co-location pattern mining on multiple GPUs</title><source>Elsevier ScienceDirect Journals Complete - AutoHoldings</source><creator>Andrzejewski, W. ; Boinski, P.</creator><creatorcontrib>Andrzejewski, W. ; Boinski, P.</creatorcontrib><description>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data.
In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2017.10.025</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Co-location pattern mining ; Compression ; Data analysis ; Data compression ; Data mining ; Data structures ; Datasets ; Energy consumption ; Environmental science ; Epidemiology ; GPGPU ; Graphics processing units ; Parallel computing ; Pattern analysis ; Performance enhancement ; Spatial co-location ; Spatial data ; Studies</subject><ispartof>Expert systems with applications, 2018-03, Vol.93, p.465-483</ispartof><rights>2017 Elsevier Ltd</rights><rights>Copyright Elsevier BV Mar 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</citedby><cites>FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2017.10.025$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3549,27923,27924,45994</link.rule.ids></links><search><creatorcontrib>Andrzejewski, W.</creatorcontrib><creatorcontrib>Boinski, P.</creatorcontrib><title>Efficient spatial co-location pattern mining on multiple GPUs</title><title>Expert systems with applications</title><description>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data.
In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</description><subject>Co-location pattern mining</subject><subject>Compression</subject><subject>Data analysis</subject><subject>Data compression</subject><subject>Data mining</subject><subject>Data structures</subject><subject>Datasets</subject><subject>Energy consumption</subject><subject>Environmental science</subject><subject>Epidemiology</subject><subject>GPGPU</subject><subject>Graphics processing units</subject><subject>Parallel computing</subject><subject>Pattern analysis</subject><subject>Performance enhancement</subject><subject>Spatial co-location</subject><subject>Spatial data</subject><subject>Studies</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LwzAYh4MoOKdfwFPBc2uS5i_oQcY2hYEe3Dk0aSIpXVuTTPHbmzLPnvLy8Pu9eXkAuEWwQhCx-66y8bupMEQ8gwpiegYWSPC6ZFzW52ABJeUlQZxcgqsYO5iDEPIFeFw75423Qyri1CTf9IUZy340eR6HIqNkw1Ac_OCHjyKTw7FPfuptsX3bx2tw4Zo-2pu_dwn2m_X76rncvW5fVk-70tRYpJIbyTSmTmsnCRTaWUwQMkhb1GqmZUONZralQrgGMeJqyRkmFDFphOCc1ktwd9o7hfHzaGNS3XgMQ_5SISkwhkSwOqfwKWXCGGOwTk3BH5rwoxBUsybVqVmTmjXNLGvKpYdTyeb7v7wNKs46jG19sCapdvT_1X8B5KNwJA</recordid><startdate>20180301</startdate><enddate>20180301</enddate><creator>Andrzejewski, W.</creator><creator>Boinski, P.</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20180301</creationdate><title>Efficient spatial co-location pattern mining on multiple GPUs</title><author>Andrzejewski, W. ; Boinski, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-7c96b25fbbf9408bfe2411c1be1db6b9a5cb6ed588fa164f3976245169c887753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Co-location pattern mining</topic><topic>Compression</topic><topic>Data analysis</topic><topic>Data compression</topic><topic>Data mining</topic><topic>Data structures</topic><topic>Datasets</topic><topic>Energy consumption</topic><topic>Environmental science</topic><topic>Epidemiology</topic><topic>GPGPU</topic><topic>Graphics processing units</topic><topic>Parallel computing</topic><topic>Pattern analysis</topic><topic>Performance enhancement</topic><topic>Spatial co-location</topic><topic>Spatial data</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Andrzejewski, W.</creatorcontrib><creatorcontrib>Boinski, P.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Andrzejewski, W.</au><au>Boinski, P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient spatial co-location pattern mining on multiple GPUs</atitle><jtitle>Expert systems with applications</jtitle><date>2018-03-01</date><risdate>2018</risdate><volume>93</volume><spage>465</spage><epage>483</epage><pages>465-483</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A generic parallel algorithm for Co-location Pattern Mining.•Support for multi GPU architectures.•A specialized variant of the algorithm optimized for the NVIDIA GPUs.•Memory-aware solution.•A dedicated algorithm for compression of input data.
In this paper, we investigate Co-location Pattern Mining (CPM) from big spatial datasets. CPM consists in searching for types of objects that are frequently located together in a spatial neighborhood. Knowledge about such patterns is very important in fields like biology, environmental sciences, epidemiology etc. However, CPM is computationally challenging, mainly due to the large number of pattern instances hidden in spatial data. In this work, we propose a new solution that can utilize the power of multiple GPUs to increase the performance of CPM. The proposed solution is also capable of coping with the GPU memory limits by dividing the work into multiple packages and compressing internal data structures. Experiments performed on large synthetic and real-world datasets prove that we can achieve an order of magnitude speedups in comparison to the efficient multithreaded CPU implementation. Our solution can greatly improve the performance of data analysis, using widely available and energy efficient graphics cards. As a result, CPM in large datasets is more viable for university researchers as well as smaller companies and organizations.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2017.10.025</doi><tpages>19</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2018-03, Vol.93, p.465-483 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_1982204863 |
source | Elsevier ScienceDirect Journals Complete - AutoHoldings |
subjects | Co-location pattern mining Compression Data analysis Data compression Data mining Data structures Datasets Energy consumption Environmental science Epidemiology GPGPU Graphics processing units Parallel computing Pattern analysis Performance enhancement Spatial co-location Spatial data Studies |
title | Efficient spatial co-location pattern mining on multiple GPUs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T13%3A37%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20spatial%20co-location%20pattern%20mining%20on%20multiple%20GPUs&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Andrzejewski,%20W.&rft.date=2018-03-01&rft.volume=93&rft.spage=465&rft.epage=483&rft.pages=465-483&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2017.10.025&rft_dat=%3Cproquest_cross%3E1982204863%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1982204863&rft_id=info:pmid/&rft_els_id=S0957417417306991&rfr_iscdi=true |