Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method

Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of high...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of China universities of posts and telecommunications 2015-02, Vol.22 (1), p.72-76
Hauptverfasser: Bing, Bai, Zhongqi, Niu, Yi, Niu, Bing, Wei, Gang, Zhao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 76
container_issue 1
container_start_page 72
container_title Journal of China universities of posts and telecommunications
container_volume 22
creator Bing, Bai
Zhongqi, Niu
Yi, Niu
Bing, Wei
Gang, Zhao
description Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.
doi_str_mv 10.1016/S1005-8885(15)60627-6
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1815990110</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>664734297</cqvip_id><els_id>S1005888515606276</els_id><sourcerecordid>1815990110</sourcerecordid><originalsourceid>FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</originalsourceid><addsrcrecordid>eNqFkM1KAzEUhbNQsFYfQQiu6mI0yfyvRIpVoeDC7kMmc9NGMpM2yRT69mba0q2rS27OOZfzIfRAyTMltHj5oYTkSVVV-YzmTwUpWJkUV2hyWd-gW-9_CckYI8UEDQvhA271Xntt-0Q5ALwVThgDBvvgBhkGB1hZh6Xt99YMIerwFpwCGcwBdyLIDbTYiAM4rHusdK8DxEilwEEvAQfdxbftRPztIGxse4eulTAe7s9zilaL99X8M1l-f3zN35aJZCkLSWyUZkqmRJSykXXesKqpQFEh6lpllWSxlShBtkDzuoGmpkqUaZ7VZctIStIpmp1it87uBvCBd9pLMEb0YAfPaRV9NaF0lOYnqXTWeweKb53uhDtwSvhIlh_J8hEhpzk_kuVF9L2efBBr7DU47qUeW7faRUC8tfrfhMfz5Y3t1zvdry-niyIr04zVZfoHv9uQeg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1815990110</pqid></control><display><type>article</type><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><source>Alma/SFX Local Collection</source><creator>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</creator><creatorcontrib>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</creatorcontrib><description>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</description><identifier>ISSN: 1005-8885</identifier><identifier>DOI: 10.1016/S1005-8885(15)60627-6</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Algorithms ; Arithmetic ; Convolution ; convolution perfectly matched layer ; Division ; division elimination ; Field programmable gate arrays ; finit difference time domain ; FPGA ; graphic processing unit ; Optimization ; parallel computing ; Perfectly matched layers ; Power consumption ; 卷积完全匹配层 ; 图形处理单元 ; 并行加速 ; 并行结构 ; 时域有限差分法 ; 现场可编程门阵列 ; 除法运算</subject><ispartof>Journal of China universities of posts and telecommunications, 2015-02, Vol.22 (1), p.72-76</ispartof><rights>2015 The Journal of China Universities of Posts and Telecommunications</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84121X/84121X.jpg</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Bing, Bai</creatorcontrib><creatorcontrib>Zhongqi, Niu</creatorcontrib><creatorcontrib>Yi, Niu</creatorcontrib><creatorcontrib>Bing, Wei</creatorcontrib><creatorcontrib>Gang, Zhao</creatorcontrib><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><title>Journal of China universities of posts and telecommunications</title><addtitle>The Journal of China Universities of Posts and Telecommunications</addtitle><description>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</description><subject>Algorithms</subject><subject>Arithmetic</subject><subject>Convolution</subject><subject>convolution perfectly matched layer</subject><subject>Division</subject><subject>division elimination</subject><subject>Field programmable gate arrays</subject><subject>finit difference time domain</subject><subject>FPGA</subject><subject>graphic processing unit</subject><subject>Optimization</subject><subject>parallel computing</subject><subject>Perfectly matched layers</subject><subject>Power consumption</subject><subject>卷积完全匹配层</subject><subject>图形处理单元</subject><subject>并行加速</subject><subject>并行结构</subject><subject>时域有限差分法</subject><subject>现场可编程门阵列</subject><subject>除法运算</subject><issn>1005-8885</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNqFkM1KAzEUhbNQsFYfQQiu6mI0yfyvRIpVoeDC7kMmc9NGMpM2yRT69mba0q2rS27OOZfzIfRAyTMltHj5oYTkSVVV-YzmTwUpWJkUV2hyWd-gW-9_CckYI8UEDQvhA271Xntt-0Q5ALwVThgDBvvgBhkGB1hZh6Xt99YMIerwFpwCGcwBdyLIDbTYiAM4rHusdK8DxEilwEEvAQfdxbftRPztIGxse4eulTAe7s9zilaL99X8M1l-f3zN35aJZCkLSWyUZkqmRJSykXXesKqpQFEh6lpllWSxlShBtkDzuoGmpkqUaZ7VZctIStIpmp1it87uBvCBd9pLMEb0YAfPaRV9NaF0lOYnqXTWeweKb53uhDtwSvhIlh_J8hEhpzk_kuVF9L2efBBr7DU47qUeW7faRUC8tfrfhMfz5Y3t1zvdry-niyIr04zVZfoHv9uQeg</recordid><startdate>20150201</startdate><enddate>20150201</enddate><creator>Bing, Bai</creator><creator>Zhongqi, Niu</creator><creator>Yi, Niu</creator><creator>Bing, Wei</creator><creator>Gang, Zhao</creator><general>Elsevier Ltd</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope></search><sort><creationdate>20150201</creationdate><title>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</title><author>Bing, Bai ; Zhongqi, Niu ; Yi, Niu ; Bing, Wei ; Gang, Zhao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c232t-10134fc30a7cbc95b28b8ef1aa99f48c2100a7ecde159beb91fa735497d20303</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Algorithms</topic><topic>Arithmetic</topic><topic>Convolution</topic><topic>convolution perfectly matched layer</topic><topic>Division</topic><topic>division elimination</topic><topic>Field programmable gate arrays</topic><topic>finit difference time domain</topic><topic>FPGA</topic><topic>graphic processing unit</topic><topic>Optimization</topic><topic>parallel computing</topic><topic>Perfectly matched layers</topic><topic>Power consumption</topic><topic>卷积完全匹配层</topic><topic>图形处理单元</topic><topic>并行加速</topic><topic>并行结构</topic><topic>时域有限差分法</topic><topic>现场可编程门阵列</topic><topic>除法运算</topic><toplevel>online_resources</toplevel><creatorcontrib>Bing, Bai</creatorcontrib><creatorcontrib>Zhongqi, Niu</creatorcontrib><creatorcontrib>Yi, Niu</creatorcontrib><creatorcontrib>Bing, Wei</creatorcontrib><creatorcontrib>Gang, Zhao</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Journal of China universities of posts and telecommunications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bing, Bai</au><au>Zhongqi, Niu</au><au>Yi, Niu</au><au>Bing, Wei</au><au>Gang, Zhao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method</atitle><jtitle>Journal of China universities of posts and telecommunications</jtitle><addtitle>The Journal of China Universities of Posts and Telecommunications</addtitle><date>2015-02-01</date><risdate>2015</risdate><volume>22</volume><issue>1</issue><spage>72</spage><epage>76</epage><pages>72-76</pages><issn>1005-8885</issn><abstract>Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/S1005-8885(15)60627-6</doi><tpages>5</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1005-8885
ispartof Journal of China universities of posts and telecommunications, 2015-02, Vol.22 (1), p.72-76
issn 1005-8885
language eng
recordid cdi_proquest_miscellaneous_1815990110
source Alma/SFX Local Collection
subjects Algorithms
Arithmetic
Convolution
convolution perfectly matched layer
Division
division elimination
Field programmable gate arrays
finit difference time domain
FPGA
graphic processing unit
Optimization
parallel computing
Perfectly matched layers
Power consumption
卷积完全匹配层
图形处理单元
并行加速
并行结构
时域有限差分法
现场可编程门阵列
除法运算
title Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A33%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20division-free%20parallel%20structure%20for%20convolution%20perfectly%20matched%20layer%20in%20finite%20difference%20time%20domain%20method&rft.jtitle=Journal%20of%20China%20universities%20of%20posts%20and%20telecommunications&rft.au=Bing,%20Bai&rft.date=2015-02-01&rft.volume=22&rft.issue=1&rft.spage=72&rft.epage=76&rft.pages=72-76&rft.issn=1005-8885&rft_id=info:doi/10.1016/S1005-8885(15)60627-6&rft_dat=%3Cproquest_cross%3E1815990110%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1815990110&rft_id=info:pmid/&rft_cqvip_id=664734297&rft_els_id=S1005888515606276&rfr_iscdi=true