kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study

Current GPU computational power enables the execution of complex and parallel algorithms, such as Ray Tracing techniques supported by kD-trees for 3D scene rendering in real time. This work describes in detail the study and implementation of five different kD-Tree traversal algorithms using the para...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	dos Santos, A.L., Teixeira, J., de Farias, T.S.M.C., Teichrieb, V., Kelner, J.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Communication standards CUDA Decoding Digital video broadcasting Error correction codes Integrated circuit interconnections Iterative algorithms kD-Tree Network-on-a-chip Parity check codes Ray tracing Traversal WiMAX
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	48
container_issue
container_start_page	41
container_title
container_volume
creator	dos Santos, A.L. Teixeira, J. de Farias, T.S.M.C. Teichrieb, V. Kelner, J.
description	Current GPU computational power enables the execution of complex and parallel algorithms, such as Ray Tracing techniques supported by kD-trees for 3D scene rendering in real time. This work describes in detail the study and implementation of five different kD-Tree traversal algorithms using the parallel framework NVIDIA Compute Unified Device Architecture (CUDA), in order to point their pros and cons regarding adaptation capability to the chosen architecture. In addition, a new algorithm is proposed by the authors based on this analysis, aiming performance improvement. A performance analysis of the implemented techniques demonstrates that two of these algorithms, once adequately adapted to CUDA architecture, are capable of reaching speedup gains up to 15times when compared to former CPU implementations and up to 4times in comparison to existing and optimized parallel ones. As a consequence, interactive frame rates are possible for scenes with 1376 times 768 pixels of resolution and 1 million primitives.
doi_str_mv	10.1109/SBAC-PAD.2009.25
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5336215</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5336215</ieee_id><sourcerecordid>5336215</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-52fd9a3ae06d0749ad759a83eb2c36c107e3c4ceaa28606a34523c42ce5f094f3</originalsourceid><addsrcrecordid>eNotj8tOwzAURC0eEm1hj8TGP-BybcdOzS6kPCq1AtGwri7JDQrkJTut1L8nCFYjzZwZaRi7ljCXEtzt9j5JxWuynCsAN1fmhE2UjbTQAPKUTSG2zuiFie0Zm0hjQFij9QWbhvAFoK1ydsLweykyT8QzjwfyAWu-avqaGmoHHKquDbzsPH_D4y-RV-0n71q-wRCqA_HNvh6q3nc5hdD5cMcTnnZNj36sjvF22BfHS3ZeYh3o6l9n7P3xIUufxfrlaZUma1HJ2AzCqLJwqJHAFhBHDovYOFxo-lC5trmEmHQe5YSoFhYs6sio0VA5mRJcVOoZu_nbrYho1_uqQX_cjY-tkkb_AECUVuo</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>dos Santos, A.L. ; Teixeira, J. ; de Farias, T.S.M.C. ; Teichrieb, V. ; Kelner, J.</creator><creatorcontrib>dos Santos, A.L. ; Teixeira, J. ; de Farias, T.S.M.C. ; Teichrieb, V. ; Kelner, J.</creatorcontrib><description>Current GPU computational power enables the execution of complex and parallel algorithms, such as Ray Tracing techniques supported by kD-trees for 3D scene rendering in real time. This work describes in detail the study and implementation of five different kD-Tree traversal algorithms using the parallel framework NVIDIA Compute Unified Device Architecture (CUDA), in order to point their pros and cons regarding adaptation capability to the chosen architecture. In addition, a new algorithm is proposed by the authors based on this analysis, aiming performance improvement. A performance analysis of the implemented techniques demonstrates that two of these algorithms, once adequately adapted to CUDA architecture, are capable of reaching speedup gains up to 15times when compared to former CPU implementations and up to 4times in comparison to existing and optimized parallel ones. As a consequence, interactive frame rates are possible for scenes with 1376 times 768 pixels of resolution and 1 million primitives.</description><identifier>ISSN: 1550-6533</identifier><identifier>ISBN: 0769538576</identifier><identifier>ISBN: 9780769538570</identifier><identifier>EISSN: 2643-3001</identifier><identifier>DOI: 10.1109/SBAC-PAD.2009.25</identifier><language>eng</language><publisher>IEEE</publisher><subject>Communication standards ; CUDA ; Decoding ; Digital video broadcasting ; Error correction codes ; Integrated circuit interconnections ; Iterative algorithms ; kD-Tree ; Network-on-a-chip ; Parity check codes ; Ray tracing ; Traversal ; WiMAX</subject><ispartof>2009 21st International Symposium on Computer Architecture and High Performance Computing, 2009, p.41-48</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5336215$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5336215$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>dos Santos, A.L.</creatorcontrib><creatorcontrib>Teixeira, J.</creatorcontrib><creatorcontrib>de Farias, T.S.M.C.</creatorcontrib><creatorcontrib>Teichrieb, V.</creatorcontrib><creatorcontrib>Kelner, J.</creatorcontrib><title>kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study</title><title>2009 21st International Symposium on Computer Architecture and High Performance Computing</title><addtitle>SBACPAD</addtitle><description>Current GPU computational power enables the execution of complex and parallel algorithms, such as Ray Tracing techniques supported by kD-trees for 3D scene rendering in real time. This work describes in detail the study and implementation of five different kD-Tree traversal algorithms using the parallel framework NVIDIA Compute Unified Device Architecture (CUDA), in order to point their pros and cons regarding adaptation capability to the chosen architecture. In addition, a new algorithm is proposed by the authors based on this analysis, aiming performance improvement. A performance analysis of the implemented techniques demonstrates that two of these algorithms, once adequately adapted to CUDA architecture, are capable of reaching speedup gains up to 15times when compared to former CPU implementations and up to 4times in comparison to existing and optimized parallel ones. As a consequence, interactive frame rates are possible for scenes with 1376 times 768 pixels of resolution and 1 million primitives.</description><subject>Communication standards</subject><subject>CUDA</subject><subject>Decoding</subject><subject>Digital video broadcasting</subject><subject>Error correction codes</subject><subject>Integrated circuit interconnections</subject><subject>Iterative algorithms</subject><subject>kD-Tree</subject><subject>Network-on-a-chip</subject><subject>Parity check codes</subject><subject>Ray tracing</subject><subject>Traversal</subject><subject>WiMAX</subject><issn>1550-6533</issn><issn>2643-3001</issn><isbn>0769538576</isbn><isbn>9780769538570</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2009</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8tOwzAURC0eEm1hj8TGP-BybcdOzS6kPCq1AtGwri7JDQrkJTut1L8nCFYjzZwZaRi7ljCXEtzt9j5JxWuynCsAN1fmhE2UjbTQAPKUTSG2zuiFie0Zm0hjQFij9QWbhvAFoK1ydsLweykyT8QzjwfyAWu-avqaGmoHHKquDbzsPH_D4y-RV-0n71q-wRCqA_HNvh6q3nc5hdD5cMcTnnZNj36sjvF22BfHS3ZeYh3o6l9n7P3xIUufxfrlaZUma1HJ2AzCqLJwqJHAFhBHDovYOFxo-lC5trmEmHQe5YSoFhYs6sio0VA5mRJcVOoZu_nbrYho1_uqQX_cjY-tkkb_AECUVuo</recordid><startdate>200910</startdate><enddate>200910</enddate><creator>dos Santos, A.L.</creator><creator>Teixeira, J.</creator><creator>de Farias, T.S.M.C.</creator><creator>Teichrieb, V.</creator><creator>Kelner, J.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200910</creationdate><title>kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study</title><author>dos Santos, A.L. ; Teixeira, J. ; de Farias, T.S.M.C. ; Teichrieb, V. ; Kelner, J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-52fd9a3ae06d0749ad759a83eb2c36c107e3c4ceaa28606a34523c42ce5f094f3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Communication standards</topic><topic>CUDA</topic><topic>Decoding</topic><topic>Digital video broadcasting</topic><topic>Error correction codes</topic><topic>Integrated circuit interconnections</topic><topic>Iterative algorithms</topic><topic>kD-Tree</topic><topic>Network-on-a-chip</topic><topic>Parity check codes</topic><topic>Ray tracing</topic><topic>Traversal</topic><topic>WiMAX</topic><toplevel>online_resources</toplevel><creatorcontrib>dos Santos, A.L.</creatorcontrib><creatorcontrib>Teixeira, J.</creatorcontrib><creatorcontrib>de Farias, T.S.M.C.</creatorcontrib><creatorcontrib>Teichrieb, V.</creatorcontrib><creatorcontrib>Kelner, J.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>dos Santos, A.L.</au><au>Teixeira, J.</au><au>de Farias, T.S.M.C.</au><au>Teichrieb, V.</au><au>Kelner, J.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study</atitle><btitle>2009 21st International Symposium on Computer Architecture and High Performance Computing</btitle><stitle>SBACPAD</stitle><date>2009-10</date><risdate>2009</risdate><spage>41</spage><epage>48</epage><pages>41-48</pages><issn>1550-6533</issn><eissn>2643-3001</eissn><isbn>0769538576</isbn><isbn>9780769538570</isbn><abstract>Current GPU computational power enables the execution of complex and parallel algorithms, such as Ray Tracing techniques supported by kD-trees for 3D scene rendering in real time. This work describes in detail the study and implementation of five different kD-Tree traversal algorithms using the parallel framework NVIDIA Compute Unified Device Architecture (CUDA), in order to point their pros and cons regarding adaptation capability to the chosen architecture. In addition, a new algorithm is proposed by the authors based on this analysis, aiming performance improvement. A performance analysis of the implemented techniques demonstrates that two of these algorithms, once adequately adapted to CUDA architecture, are capable of reaching speedup gains up to 15times when compared to former CPU implementations and up to 4times in comparison to existing and optimized parallel ones. As a consequence, interactive frame rates are possible for scenes with 1376 times 768 pixels of resolution and 1 million primitives.</abstract><pub>IEEE</pub><doi>10.1109/SBAC-PAD.2009.25</doi><tpages>8</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1550-6533
ispartof	2009 21st International Symposium on Computer Architecture and High Performance Computing, 2009, p.41-48
issn	1550-6533 2643-3001
language	eng
recordid	cdi_ieee_primary_5336215
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Communication standards CUDA Decoding Digital video broadcasting Error correction codes Integrated circuit interconnections Iterative algorithms kD-Tree Network-on-a-chip Parity check codes Ray tracing Traversal WiMAX
title	kD-Tree Traversal Implementations for Ray Tracing on Massive Multiprocessors: A Comparative Study
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T11%3A43%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=kD-Tree%20Traversal%20Implementations%20for%20Ray%20Tracing%20on%20Massive%20Multiprocessors:%20A%20Comparative%20Study&rft.btitle=2009%2021st%20International%20Symposium%20on%20Computer%20Architecture%20and%20High%20Performance%20Computing&rft.au=dos%20Santos,%20A.L.&rft.date=2009-10&rft.spage=41&rft.epage=48&rft.pages=41-48&rft.issn=1550-6533&rft.eissn=2643-3001&rft.isbn=0769538576&rft.isbn_list=9780769538570&rft_id=info:doi/10.1109/SBAC-PAD.2009.25&rft_dat=%3Cieee_6IE%3E5336215%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5336215&rfr_iscdi=true