A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices
The particle counting part in the unified gas-kinetic wave–particle method on graphics processing unit (GPU) devices is computationally intensive. This paper introduces a piecewise-hierarchical (P-H) particle counting strategy tailored for the Single Instruction Multiple Threads architecture, which...
Gespeichert in:
Veröffentlicht in: | Physics of fluids (1994) 2024-10, Vol.36 (10) |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 10 |
container_start_page | |
container_title | Physics of fluids (1994) |
container_volume | 36 |
creator | Yu, Hang Xie, Wenjia Ren, Weijie Tian, Zhengyu |
description | The particle counting part in the unified gas-kinetic wave–particle method on graphics processing unit (GPU) devices is computationally intensive. This paper introduces a piecewise-hierarchical (P-H) particle counting strategy tailored for the Single Instruction Multiple Threads architecture, which leverages GPU memory hierarchy to reduce access conflicts. The strategy was evaluated based on throughput, roofline performance, and computation time metrics. Compared to the global counting strategy, the P-H approach achieved a
3.37× speedup for the particle counting kernel, and the overall program experienced a performance boost of more than 30%. |
doi_str_mv | 10.1063/5.0227425 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3112932353</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3112932353</sourcerecordid><originalsourceid>FETCH-LOGICAL-c975-4ad60d85c848bdf5daf98d6e30e03c1be32a3d96400ef2e1dfa2b31d00d37a4f3</originalsourceid><addsrcrecordid>eNp9kUtOwzAQhiMEEqWw4AaWWIGU4kfiJMuq4iVVYtN95NjjxiUvbKeIHXfgCpyMk-DSiiWrGf36_M3IE0WXBM8I5uw2nWFKs4SmR9GE4LyIM8758a7PcMw5I6fRmXMbjDErKJ9EX3M0GJDwZhzEtQErrKyNFA0ahPVGNoBkP3YeteDrXiE3Gi-qkOreIl8DMu3QQAudF970Her1bzp2RhtQaC1c_GI6CCb0Jrbw_fH55z0Yw6O1FUMY6tBgewnOmW69M3ikYGtCcB6daNE4uDjUabS6v1stHuPl88PTYr6MZZGlcSIUxypPZZ7kldKpErrIFQeGATNJKmBUMFXwBGPQFIjSglaMKIwVy0Si2TS62mvDGq8jOF9u-tF2YWLJCKEFoyxlgbreU9L2zlnQ5WBNK-x7SXC5u0GZlocbBPZmzzpp9h_0D_wDUNOLzQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3112932353</pqid></control><display><type>article</type><title>A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices</title><source>American Institute of Physics (AIP) Journals</source><creator>Yu, Hang ; Xie, Wenjia ; Ren, Weijie ; Tian, Zhengyu</creator><creatorcontrib>Yu, Hang ; Xie, Wenjia ; Ren, Weijie ; Tian, Zhengyu</creatorcontrib><description>The particle counting part in the unified gas-kinetic wave–particle method on graphics processing unit (GPU) devices is computationally intensive. This paper introduces a piecewise-hierarchical (P-H) particle counting strategy tailored for the Single Instruction Multiple Threads architecture, which leverages GPU memory hierarchy to reduce access conflicts. The strategy was evaluated based on throughput, roofline performance, and computation time metrics. Compared to the global counting strategy, the P-H approach achieved a
3.37× speedup for the particle counting kernel, and the overall program experienced a performance boost of more than 30%.</description><identifier>ISSN: 1070-6631</identifier><identifier>EISSN: 1089-7666</identifier><identifier>DOI: 10.1063/5.0227425</identifier><identifier>CODEN: PHFLE6</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Counting ; Graphics processing units ; Performance evaluation</subject><ispartof>Physics of fluids (1994), 2024-10, Vol.36 (10)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c975-4ad60d85c848bdf5daf98d6e30e03c1be32a3d96400ef2e1dfa2b31d00d37a4f3</cites><orcidid>0000-0002-5198-8464 ; 0000-0003-4664-7996</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,794,4512,27924,27925</link.rule.ids></links><search><creatorcontrib>Yu, Hang</creatorcontrib><creatorcontrib>Xie, Wenjia</creatorcontrib><creatorcontrib>Ren, Weijie</creatorcontrib><creatorcontrib>Tian, Zhengyu</creatorcontrib><title>A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices</title><title>Physics of fluids (1994)</title><description>The particle counting part in the unified gas-kinetic wave–particle method on graphics processing unit (GPU) devices is computationally intensive. This paper introduces a piecewise-hierarchical (P-H) particle counting strategy tailored for the Single Instruction Multiple Threads architecture, which leverages GPU memory hierarchy to reduce access conflicts. The strategy was evaluated based on throughput, roofline performance, and computation time metrics. Compared to the global counting strategy, the P-H approach achieved a
3.37× speedup for the particle counting kernel, and the overall program experienced a performance boost of more than 30%.</description><subject>Counting</subject><subject>Graphics processing units</subject><subject>Performance evaluation</subject><issn>1070-6631</issn><issn>1089-7666</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kUtOwzAQhiMEEqWw4AaWWIGU4kfiJMuq4iVVYtN95NjjxiUvbKeIHXfgCpyMk-DSiiWrGf36_M3IE0WXBM8I5uw2nWFKs4SmR9GE4LyIM8758a7PcMw5I6fRmXMbjDErKJ9EX3M0GJDwZhzEtQErrKyNFA0ahPVGNoBkP3YeteDrXiE3Gi-qkOreIl8DMu3QQAudF970Her1bzp2RhtQaC1c_GI6CCb0Jrbw_fH55z0Yw6O1FUMY6tBgewnOmW69M3ikYGtCcB6daNE4uDjUabS6v1stHuPl88PTYr6MZZGlcSIUxypPZZ7kldKpErrIFQeGATNJKmBUMFXwBGPQFIjSglaMKIwVy0Si2TS62mvDGq8jOF9u-tF2YWLJCKEFoyxlgbreU9L2zlnQ5WBNK-x7SXC5u0GZlocbBPZmzzpp9h_0D_wDUNOLzQ</recordid><startdate>202410</startdate><enddate>202410</enddate><creator>Yu, Hang</creator><creator>Xie, Wenjia</creator><creator>Ren, Weijie</creator><creator>Tian, Zhengyu</creator><general>American Institute of Physics</general><scope>AJDQP</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0002-5198-8464</orcidid><orcidid>https://orcid.org/0000-0003-4664-7996</orcidid></search><sort><creationdate>202410</creationdate><title>A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices</title><author>Yu, Hang ; Xie, Wenjia ; Ren, Weijie ; Tian, Zhengyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c975-4ad60d85c848bdf5daf98d6e30e03c1be32a3d96400ef2e1dfa2b31d00d37a4f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Counting</topic><topic>Graphics processing units</topic><topic>Performance evaluation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Hang</creatorcontrib><creatorcontrib>Xie, Wenjia</creatorcontrib><creatorcontrib>Ren, Weijie</creatorcontrib><creatorcontrib>Tian, Zhengyu</creatorcontrib><collection>AIP Open Access Journals</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Physics of fluids (1994)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Hang</au><au>Xie, Wenjia</au><au>Ren, Weijie</au><au>Tian, Zhengyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices</atitle><jtitle>Physics of fluids (1994)</jtitle><date>2024-10</date><risdate>2024</risdate><volume>36</volume><issue>10</issue><issn>1070-6631</issn><eissn>1089-7666</eissn><coden>PHFLE6</coden><abstract>The particle counting part in the unified gas-kinetic wave–particle method on graphics processing unit (GPU) devices is computationally intensive. This paper introduces a piecewise-hierarchical (P-H) particle counting strategy tailored for the Single Instruction Multiple Threads architecture, which leverages GPU memory hierarchy to reduce access conflicts. The strategy was evaluated based on throughput, roofline performance, and computation time metrics. Compared to the global counting strategy, the P-H approach achieved a
3.37× speedup for the particle counting kernel, and the overall program experienced a performance boost of more than 30%.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0227425</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-5198-8464</orcidid><orcidid>https://orcid.org/0000-0003-4664-7996</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1070-6631 |
ispartof | Physics of fluids (1994), 2024-10, Vol.36 (10) |
issn | 1070-6631 1089-7666 |
language | eng |
recordid | cdi_proquest_journals_3112932353 |
source | American Institute of Physics (AIP) Journals |
subjects | Counting Graphics processing units Performance evaluation |
title | A piecewise-hierarchical particle count method suitable for the implementation of the unified gas-kinetic wave–particle method on graphics processing unit devices |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T11%3A21%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20piecewise-hierarchical%20particle%20count%20method%20suitable%20for%20the%20implementation%20of%20the%20unified%20gas-kinetic%20wave%E2%80%93particle%20method%20on%20graphics%20processing%20unit%20devices&rft.jtitle=Physics%20of%20fluids%20(1994)&rft.au=Yu,%20Hang&rft.date=2024-10&rft.volume=36&rft.issue=10&rft.issn=1070-6631&rft.eissn=1089-7666&rft.coden=PHFLE6&rft_id=info:doi/10.1063/5.0227425&rft_dat=%3Cproquest_cross%3E3112932353%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3112932353&rft_id=info:pmid/&rfr_iscdi=true |