Data-parallel query processing on non-uniform data
Graphics processing units (GPUs) promise spectacular performance advantages when used as database coprocessors. Their massive compute capacity, however, is often hampered by control flow divergence caused by non-uniform data distributions. When data-parallel work items demand for different amounts o...
Gespeichert in:
Veröffentlicht in: | Proceedings of the VLDB Endowment 2020-02, Vol.13 (6), p.884-897 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 897 |
---|---|
container_issue | 6 |
container_start_page | 884 |
container_title | Proceedings of the VLDB Endowment |
container_volume | 13 |
creator | Funke, Henning Teubner, Jens |
description | Graphics processing units (GPUs) promise spectacular performance advantages when used as database coprocessors. Their massive compute capacity, however, is often hampered by
control flow divergence
caused by non-uniform data distributions. When data-parallel work items demand for different amounts or types of processing, instructions execute with lowered efficiency.
Query compilation
techniques---a recent advance in GPU-accelerated database processing---suffer from the problem even more, because divergence effects are amplified during the execution of fused pipeline operators.
In this work, we identify two types of control flow divergence---
filter divergence
and
expansion divergence
---that frequently occur in real world workloads. We quantify the problem for two poster cases and propose techniques to balance these divergence effects. By balancing divergence effects, our approach is able to restore processing efficiency even when pipelines contain heavily skewed operations. Our query compiler DogQC has a wider range of functionality than other query coprocessors
and
achieves performance improvements. We observe shorter execution times for TPC-H benchmark queries by factors up to 4.51x compared with existing GPU query compilers and by factors up to 4.54x compared with CPU-based systems. |
doi_str_mv | 10.14778/3380750.3380758 |
format | Article |
fullrecord | <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_14778_3380750_3380758</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_14778_3380750_3380758</sourcerecordid><originalsourceid>FETCH-LOGICAL-c243t-621fb885b1a8af182e7c7b21a8f1f22320d95177bd815a7a5defd3b1de311bc83</originalsourceid><addsrcrecordid>eNpNj0tLAzEUhYMoWKt7l_kDqfcmzeR2KfVRoeBG10OeMjLNjEm76L-32Fm4-s6Bw4GPsXuEBS6NoQelCIyGxZl0wWYSNQiClbn8l6_ZTa3fAA01SDMmn-zeitEW2_ex5z-HWI58LIOPtXb5iw-Z5yGLQ-7SUHY8nNa37CrZvsa7iXP2-fL8sd6I7fvr2_pxK7xcqr1oJCZHpB1asglJRuONk6eWMEmpJISVRmNcINTWWB1iCsphiArReVJzBudfX4ZaS0ztWLqdLccWof1zbifniaR-Ae18SS4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Data-parallel query processing on non-uniform data</title><source>ACM Digital Library Complete</source><creator>Funke, Henning ; Teubner, Jens</creator><creatorcontrib>Funke, Henning ; Teubner, Jens</creatorcontrib><description>Graphics processing units (GPUs) promise spectacular performance advantages when used as database coprocessors. Their massive compute capacity, however, is often hampered by
control flow divergence
caused by non-uniform data distributions. When data-parallel work items demand for different amounts or types of processing, instructions execute with lowered efficiency.
Query compilation
techniques---a recent advance in GPU-accelerated database processing---suffer from the problem even more, because divergence effects are amplified during the execution of fused pipeline operators.
In this work, we identify two types of control flow divergence---
filter divergence
and
expansion divergence
---that frequently occur in real world workloads. We quantify the problem for two poster cases and propose techniques to balance these divergence effects. By balancing divergence effects, our approach is able to restore processing efficiency even when pipelines contain heavily skewed operations. Our query compiler DogQC has a wider range of functionality than other query coprocessors
and
achieves performance improvements. We observe shorter execution times for TPC-H benchmark queries by factors up to 4.51x compared with existing GPU query compilers and by factors up to 4.54x compared with CPU-based systems.</description><identifier>ISSN: 2150-8097</identifier><identifier>EISSN: 2150-8097</identifier><identifier>DOI: 10.14778/3380750.3380758</identifier><language>eng</language><ispartof>Proceedings of the VLDB Endowment, 2020-02, Vol.13 (6), p.884-897</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c243t-621fb885b1a8af182e7c7b21a8f1f22320d95177bd815a7a5defd3b1de311bc83</citedby><cites>FETCH-LOGICAL-c243t-621fb885b1a8af182e7c7b21a8f1f22320d95177bd815a7a5defd3b1de311bc83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Funke, Henning</creatorcontrib><creatorcontrib>Teubner, Jens</creatorcontrib><title>Data-parallel query processing on non-uniform data</title><title>Proceedings of the VLDB Endowment</title><description>Graphics processing units (GPUs) promise spectacular performance advantages when used as database coprocessors. Their massive compute capacity, however, is often hampered by
control flow divergence
caused by non-uniform data distributions. When data-parallel work items demand for different amounts or types of processing, instructions execute with lowered efficiency.
Query compilation
techniques---a recent advance in GPU-accelerated database processing---suffer from the problem even more, because divergence effects are amplified during the execution of fused pipeline operators.
In this work, we identify two types of control flow divergence---
filter divergence
and
expansion divergence
---that frequently occur in real world workloads. We quantify the problem for two poster cases and propose techniques to balance these divergence effects. By balancing divergence effects, our approach is able to restore processing efficiency even when pipelines contain heavily skewed operations. Our query compiler DogQC has a wider range of functionality than other query coprocessors
and
achieves performance improvements. We observe shorter execution times for TPC-H benchmark queries by factors up to 4.51x compared with existing GPU query compilers and by factors up to 4.54x compared with CPU-based systems.</description><issn>2150-8097</issn><issn>2150-8097</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpNj0tLAzEUhYMoWKt7l_kDqfcmzeR2KfVRoeBG10OeMjLNjEm76L-32Fm4-s6Bw4GPsXuEBS6NoQelCIyGxZl0wWYSNQiClbn8l6_ZTa3fAA01SDMmn-zeitEW2_ex5z-HWI58LIOPtXb5iw-Z5yGLQ-7SUHY8nNa37CrZvsa7iXP2-fL8sd6I7fvr2_pxK7xcqr1oJCZHpB1asglJRuONk6eWMEmpJISVRmNcINTWWB1iCsphiArReVJzBudfX4ZaS0ztWLqdLccWof1zbifniaR-Ae18SS4</recordid><startdate>20200201</startdate><enddate>20200201</enddate><creator>Funke, Henning</creator><creator>Teubner, Jens</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20200201</creationdate><title>Data-parallel query processing on non-uniform data</title><author>Funke, Henning ; Teubner, Jens</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c243t-621fb885b1a8af182e7c7b21a8f1f22320d95177bd815a7a5defd3b1de311bc83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Funke, Henning</creatorcontrib><creatorcontrib>Teubner, Jens</creatorcontrib><collection>CrossRef</collection><jtitle>Proceedings of the VLDB Endowment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Funke, Henning</au><au>Teubner, Jens</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data-parallel query processing on non-uniform data</atitle><jtitle>Proceedings of the VLDB Endowment</jtitle><date>2020-02-01</date><risdate>2020</risdate><volume>13</volume><issue>6</issue><spage>884</spage><epage>897</epage><pages>884-897</pages><issn>2150-8097</issn><eissn>2150-8097</eissn><abstract>Graphics processing units (GPUs) promise spectacular performance advantages when used as database coprocessors. Their massive compute capacity, however, is often hampered by
control flow divergence
caused by non-uniform data distributions. When data-parallel work items demand for different amounts or types of processing, instructions execute with lowered efficiency.
Query compilation
techniques---a recent advance in GPU-accelerated database processing---suffer from the problem even more, because divergence effects are amplified during the execution of fused pipeline operators.
In this work, we identify two types of control flow divergence---
filter divergence
and
expansion divergence
---that frequently occur in real world workloads. We quantify the problem for two poster cases and propose techniques to balance these divergence effects. By balancing divergence effects, our approach is able to restore processing efficiency even when pipelines contain heavily skewed operations. Our query compiler DogQC has a wider range of functionality than other query coprocessors
and
achieves performance improvements. We observe shorter execution times for TPC-H benchmark queries by factors up to 4.51x compared with existing GPU query compilers and by factors up to 4.54x compared with CPU-based systems.</abstract><doi>10.14778/3380750.3380758</doi><tpages>14</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2150-8097 |
ispartof | Proceedings of the VLDB Endowment, 2020-02, Vol.13 (6), p.884-897 |
issn | 2150-8097 2150-8097 |
language | eng |
recordid | cdi_crossref_primary_10_14778_3380750_3380758 |
source | ACM Digital Library Complete |
title | Data-parallel query processing on non-uniform data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-31T00%3A27%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data-parallel%20query%20processing%20on%20non-uniform%20data&rft.jtitle=Proceedings%20of%20the%20VLDB%20Endowment&rft.au=Funke,%20Henning&rft.date=2020-02-01&rft.volume=13&rft.issue=6&rft.spage=884&rft.epage=897&rft.pages=884-897&rft.issn=2150-8097&rft.eissn=2150-8097&rft_id=info:doi/10.14778/3380750.3380758&rft_dat=%3Ccrossref%3E10_14778_3380750_3380758%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |