Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method
Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of high...
Gespeichert in:
Veröffentlicht in: | Journal of China universities of posts and telecommunications 2015-02, Vol.22 (1), p.72-76 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 76 |
---|---|
container_issue | 1 |
container_start_page | 72 |
container_title | Journal of China universities of posts and telecommunications |
container_volume | 22 |
creator | Bing, Bai Zhongqi, Niu Yi, Niu Bing, Wei Gang, Zhao |
description | Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss. |
doi_str_mv | 10.1016/S1005-8885(15)60627-6 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1815990110</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>664734297</cqvip_id><els_id>S1005888515606276</els_id><sourcerecordid>1815990110</sourcerecordid><originalsourceid>FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</originalsourceid><addsrcrecordid>eNqFkM1KAzEUhbNQsFYfQQiu6mI0yfyvRIpVoeDC7kMmc9NGMpM2yRT69mba0q2rS27OOZfzIfRAyTMltHj5oYTkSVVV-YzmTwUpWJkUV2hyWd-gW-9_CckYI8UEDQvhA271Xntt-0Q5ALwVThgDBvvgBhkGB1hZh6Xt99YMIerwFpwCGcwBdyLIDbTYiAM4rHusdK8DxEilwEEvAQfdxbftRPztIGxse4eulTAe7s9zilaL99X8M1l-f3zN35aJZCkLSWyUZkqmRJSykXXesKqpQFEh6lpllWSxlShBtkDzuoGmpkqUaZ7VZctIStIpmp1it87uBvCBd9pLMEb0YAfPaRV9NaF0lOYnqXTWeweKb53uhDtwSvhIlh_J8hEhpzk_kuVF9L2efBBr7DU47qUeW7faRUC8tfrfhMfz5Y3t1zvdry-niyIr04zVZfoHv9uQeg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1815990110</pqid></control><display><type>article</type><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><source>Alma/SFX Local Collection</source><creator>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</creator><creatorcontrib>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</creatorcontrib><description>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</description><identifier>ISSN: 1005-8885</identifier><identifier>DOI: 10.1016/S1005-8885(15)60627-6</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Algorithms ; Arithmetic ; Convolution ; convolution perfectly matched layer ; Division ; division elimination ; Field programmable gate arrays ; finit difference time domain ; FPGA ; graphic processing unit ; Optimization ; parallel computing ; Perfectly matched layers ; Power consumption ; 卷积完全匹配层 ; 图形处理单元 ; 并行加速 ; 并行结构 ; 时域有限差分法 ; 现场可编程门阵列 ; 除法运算</subject><ispartof>Journal of China universities of posts and telecommunications, 2015-02, Vol.22 (1), p.72-76</ispartof><rights>2015 The Journal of China Universities of Posts and Telecommunications</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84121X/84121X.jpg</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Bing, Bai</creatorcontrib><creatorcontrib>Zhongqi, Niu</creatorcontrib><creatorcontrib>Yi, Niu</creatorcontrib><creatorcontrib>Bing, Wei</creatorcontrib><creatorcontrib>Gang, Zhao</creatorcontrib><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><title>Journal of China universities of posts and telecommunications</title><addtitle>The Journal of China Universities of Posts and Telecommunications</addtitle><description>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</description><subject>Algorithms</subject><subject>Arithmetic</subject><subject>Convolution</subject><subject>convolution perfectly matched layer</subject><subject>Division</subject><subject>division elimination</subject><subject>Field programmable gate arrays</subject><subject>finit difference time domain</subject><subject>FPGA</subject><subject>graphic processing unit</subject><subject>Optimization</subject><subject>parallel computing</subject><subject>Perfectly matched layers</subject><subject>Power consumption</subject><subject>卷积完全匹配层</subject><subject>图形处理单元</subject><subject>并行加速</subject><subject>并行结构</subject><subject>时域有限差分法</subject><subject>现场可编程门阵列</subject><subject>除法运算</subject><issn>1005-8885</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqFkM1KAzEUhbNQsFYfQQiu6mI0yfyvRIpVoeDC7kMmc9NGMpM2yRT69mba0q2rS27OOZfzIfRAyTMltHj5oYTkSVVV-YzmTwUpWJkUV2hyWd-gW-9_CckYI8UEDQvhA271Xntt-0Q5ALwVThgDBvvgBhkGB1hZh6Xt99YMIerwFpwCGcwBdyLIDbTYiAM4rHusdK8DxEilwEEvAQfdxbftRPztIGxse4eulTAe7s9zilaL99X8M1l-f3zN35aJZCkLSWyUZkqmRJSykXXesKqpQFEh6lpllWSxlShBtkDzuoGmpkqUaZ7VZctIStIpmp1it87uBvCBd9pLMEb0YAfPaRV9NaF0lOYnqXTWeweKb53uhDtwSvhIlh_J8hEhpzk_kuVF9L2efBBr7DU47qUeW7faRUC8tfrfhMfz5Y3t1zvdry-niyIr04zVZfoHv9uQeg</recordid><startdate>20150201</startdate><enddate>20150201</enddate><creator>Bing, Bai</creator><creator>Zhongqi, Niu</creator><creator>Yi, Niu</creator><creator>Bing, Wei</creator><creator>Gang, Zhao</creator><general>Elsevier Ltd</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope></search><sort><creationdate>20150201</creationdate><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><author>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Algorithms</topic><topic>Arithmetic</topic><topic>Convolution</topic><topic>convolution perfectly matched layer</topic><topic>Division</topic><topic>division elimination</topic><topic>Field programmable gate arrays</topic><topic>finit difference time domain</topic><topic>FPGA</topic><topic>graphic processing unit</topic><topic>Optimization</topic><topic>parallel computing</topic><topic>Perfectly matched layers</topic><topic>Power consumption</topic><topic>卷积完全匹配层</topic><topic>图形处理单元</topic><topic>并行加速</topic><topic>并行结构</topic><topic>时域有限差分法</topic><topic>现场可编程门阵列</topic><topic>除法运算</topic><toplevel>online_resources</toplevel><creatorcontrib>Bing, Bai</creatorcontrib><creatorcontrib>Zhongqi, Niu</creatorcontrib><creatorcontrib>Yi, Niu</creatorcontrib><creatorcontrib>Bing, Wei</creatorcontrib><creatorcontrib>Gang, Zhao</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Journal of China universities of posts and telecommunications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bing, Bai</au><au>Zhongqi, Niu</au><au>Yi, Niu</au><au>Bing, Wei</au><au>Gang, Zhao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</atitle><jtitle>Journal of China universities of posts and telecommunications</jtitle><addtitle>The Journal of China Universities of Posts and Telecommunications</addtitle><date>2015-02-01</date><risdate>2015</risdate><volume>22</volume><issue>1</issue><spage>72</spage><epage>76</epage><pages>72-76</pages><issn>1005-8885</issn><abstract>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/S1005-8885(15)60627-6</doi><tpages>5</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1005-8885 |
ispartof | Journal of China universities of posts and telecommunications, 2015-02, Vol.22 (1), p.72-76 |
issn | 1005-8885 |
language | eng |
recordid | cdi_proquest_miscellaneous_1815990110 |
source | Alma/SFX Local Collection |
subjects | Algorithms Arithmetic Convolution convolution perfectly matched layer Division division elimination Field programmable gate arrays finit difference time domain FPGA graphic processing unit Optimization parallel computing Perfectly matched layers Power consumption 卷积完全匹配层 图形处理单元 并行加速 并行结构 时域有限差分法 现场可编程门阵列 除法运算 |
title | Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A33%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20division-free%20parallel%20structure%20for%20convolution%20perfectly%20matched%20layer%20in%20finite%20difference%20time%20domain%20method&rft.jtitle=Journal%20of%20China%20universities%20of%20posts%20and%20telecommunications&rft.au=Bing,%20Bai&rft.date=2015-02-01&rft.volume=22&rft.issue=1&rft.spage=72&rft.epage=76&rft.pages=72-76&rft.issn=1005-8885&rft_id=info:doi/10.1016/S1005-8885(15)60627-6&rft_dat=%3Cproquest_cross%3E1815990110%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1815990110&rft_id=info:pmid/&rft_cqvip_id=664734297&rft_els_id=S1005888515606276&rfr_iscdi=true |