Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders
This work proposes several intrinsics on a reconfigurable processor intended for HEVC decoding and software pipelining algorithms with a coarse-grained array (CGA) architecture as well as the proposed intrinsic instructions. Software pipelining algorithms are developed for the CGA acceleration of in...
Gespeichert in:
Veröffentlicht in: | Journal of real-time image processing 2019-12, Vol.16 (6), p.2173-2187 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2187 |
---|---|
container_issue | 6 |
container_start_page | 2173 |
container_title | Journal of real-time image processing |
container_volume | 16 |
creator | Ahn, Yong-Jo Yoo, Jonghun Jo, Hyun-Ho Sim, Donggyu |
description | This work proposes several intrinsics on a reconfigurable processor intended for HEVC decoding and software pipelining algorithms with a coarse-grained array (CGA) architecture as well as the proposed intrinsic instructions. Software pipelining algorithms are developed for the CGA acceleration of inverse transform, pixel reconstruction, de-blocking filter and sample adaptive offset modules. To enable efficient software pipelining, several very-long instruction-word-based intrinsics are designed in order to maximize the parallelization rather than the computational acceleration. We found that the HEVC decoder with the proposed intrinsics yields 2.3 times faster in running clock cycle than a decoder that does not use the intrinsics. In addition, the HEVC decoder with CGA pipelining algorithms executes 10.9 times faster than that without the CGA mode. |
doi_str_mv | 10.1007/s11554-017-0729-9 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918674177</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2918674177</sourcerecordid><originalsourceid>FETCH-LOGICAL-c268t-ca28d2d2fab367c29d6c89326dadeed7151f6e074ced81891c049512b3834bb43</originalsourceid><addsrcrecordid>eNp1kE1LAzEQhoMoWKs_wFvA82omu5uPY1lqKwge_LiGbJKtKTW7JluK_96UFT15GGYOzzszPAhdA7kFQvhdAqjrqiDAC8KpLOQJmoFgUAgK8vR3JuQcXaS0JYRxVtYzZJ_7bjzo6PDgB7fzwYcNPvjxHTerBdbB4iH2Q5-cxT6M0YfkTcJ9wBpHZ_rQ-c0-6nbnjpxxKfURd7nWy7cG20xYF9MlOuv0Lrmrnz5Hr_fLl2ZdPD6tHprFY2EoE2NhNBWWWtrptmTcUGmZEbKkzGrrnOVQQ8cc4ZVxVoCQYEgla6BtKcqqbatyjm6mvfmXz71Lo9r2-xjySUVlNsAr4DxTMFEm9ilF16kh-g8dvxQQdZSpJpkqy1RHmUrmDJ0yKbNh4-Lf5v9D38vLd-k</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918674177</pqid></control><display><type>article</type><title>Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders</title><source>ProQuest Central UK/Ireland</source><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Ahn, Yong-Jo ; Yoo, Jonghun ; Jo, Hyun-Ho ; Sim, Donggyu</creator><creatorcontrib>Ahn, Yong-Jo ; Yoo, Jonghun ; Jo, Hyun-Ho ; Sim, Donggyu</creatorcontrib><description>This work proposes several intrinsics on a reconfigurable processor intended for HEVC decoding and software pipelining algorithms with a coarse-grained array (CGA) architecture as well as the proposed intrinsic instructions. Software pipelining algorithms are developed for the CGA acceleration of inverse transform, pixel reconstruction, de-blocking filter and sample adaptive offset modules. To enable efficient software pipelining, several very-long instruction-word-based intrinsics are designed in order to maximize the parallelization rather than the computational acceleration. We found that the HEVC decoder with the proposed intrinsics yields 2.3 times faster in running clock cycle than a decoder that does not use the intrinsics. In addition, the HEVC decoder with CGA pipelining algorithms executes 10.9 times faster than that without the CGA mode.</description><identifier>ISSN: 1861-8200</identifier><identifier>EISSN: 1861-8219</identifier><identifier>DOI: 10.1007/s11554-017-0729-9</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Adaptive sampling ; Algorithms ; Coding standards ; Computer Graphics ; Computer Science ; Decoders ; Decoding ; Digital signal processors ; Efficiency ; Image Processing and Computer Vision ; Microprocessors ; Multimedia Information Systems ; Original Research Paper ; Pattern Recognition ; Reconfiguration ; Signal,Image and Speech Processing ; Software ; Video compression</subject><ispartof>Journal of real-time image processing, 2019-12, Vol.16 (6), p.2173-2187</ispartof><rights>Springer-Verlag GmbH Germany 2017</rights><rights>Springer-Verlag GmbH Germany 2017.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c268t-ca28d2d2fab367c29d6c89326dadeed7151f6e074ced81891c049512b3834bb43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11554-017-0729-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2918674177?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,21388,27924,27925,33744,41488,42557,43805,51319,64385,64389,72469</link.rule.ids></links><search><creatorcontrib>Ahn, Yong-Jo</creatorcontrib><creatorcontrib>Yoo, Jonghun</creatorcontrib><creatorcontrib>Jo, Hyun-Ho</creatorcontrib><creatorcontrib>Sim, Donggyu</creatorcontrib><title>Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders</title><title>Journal of real-time image processing</title><addtitle>J Real-Time Image Proc</addtitle><description>This work proposes several intrinsics on a reconfigurable processor intended for HEVC decoding and software pipelining algorithms with a coarse-grained array (CGA) architecture as well as the proposed intrinsic instructions. Software pipelining algorithms are developed for the CGA acceleration of inverse transform, pixel reconstruction, de-blocking filter and sample adaptive offset modules. To enable efficient software pipelining, several very-long instruction-word-based intrinsics are designed in order to maximize the parallelization rather than the computational acceleration. We found that the HEVC decoder with the proposed intrinsics yields 2.3 times faster in running clock cycle than a decoder that does not use the intrinsics. In addition, the HEVC decoder with CGA pipelining algorithms executes 10.9 times faster than that without the CGA mode.</description><subject>Adaptive sampling</subject><subject>Algorithms</subject><subject>Coding standards</subject><subject>Computer Graphics</subject><subject>Computer Science</subject><subject>Decoders</subject><subject>Decoding</subject><subject>Digital signal processors</subject><subject>Efficiency</subject><subject>Image Processing and Computer Vision</subject><subject>Microprocessors</subject><subject>Multimedia Information Systems</subject><subject>Original Research Paper</subject><subject>Pattern Recognition</subject><subject>Reconfiguration</subject><subject>Signal,Image and Speech Processing</subject><subject>Software</subject><subject>Video compression</subject><issn>1861-8200</issn><issn>1861-8219</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kE1LAzEQhoMoWKs_wFvA82omu5uPY1lqKwge_LiGbJKtKTW7JluK_96UFT15GGYOzzszPAhdA7kFQvhdAqjrqiDAC8KpLOQJmoFgUAgK8vR3JuQcXaS0JYRxVtYzZJ_7bjzo6PDgB7fzwYcNPvjxHTerBdbB4iH2Q5-cxT6M0YfkTcJ9wBpHZ_rQ-c0-6nbnjpxxKfURd7nWy7cG20xYF9MlOuv0Lrmrnz5Hr_fLl2ZdPD6tHprFY2EoE2NhNBWWWtrptmTcUGmZEbKkzGrrnOVQQ8cc4ZVxVoCQYEgla6BtKcqqbatyjm6mvfmXz71Lo9r2-xjySUVlNsAr4DxTMFEm9ilF16kh-g8dvxQQdZSpJpkqy1RHmUrmDJ0yKbNh4-Lf5v9D38vLd-k</recordid><startdate>20191201</startdate><enddate>20191201</enddate><creator>Ahn, Yong-Jo</creator><creator>Yoo, Jonghun</creator><creator>Jo, Hyun-Ho</creator><creator>Sim, Donggyu</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20191201</creationdate><title>Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders</title><author>Ahn, Yong-Jo ; Yoo, Jonghun ; Jo, Hyun-Ho ; Sim, Donggyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c268t-ca28d2d2fab367c29d6c89326dadeed7151f6e074ced81891c049512b3834bb43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Adaptive sampling</topic><topic>Algorithms</topic><topic>Coding standards</topic><topic>Computer Graphics</topic><topic>Computer Science</topic><topic>Decoders</topic><topic>Decoding</topic><topic>Digital signal processors</topic><topic>Efficiency</topic><topic>Image Processing and Computer Vision</topic><topic>Microprocessors</topic><topic>Multimedia Information Systems</topic><topic>Original Research Paper</topic><topic>Pattern Recognition</topic><topic>Reconfiguration</topic><topic>Signal,Image and Speech Processing</topic><topic>Software</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ahn, Yong-Jo</creatorcontrib><creatorcontrib>Yoo, Jonghun</creatorcontrib><creatorcontrib>Jo, Hyun-Ho</creatorcontrib><creatorcontrib>Sim, Donggyu</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Journal of real-time image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ahn, Yong-Jo</au><au>Yoo, Jonghun</au><au>Jo, Hyun-Ho</au><au>Sim, Donggyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders</atitle><jtitle>Journal of real-time image processing</jtitle><stitle>J Real-Time Image Proc</stitle><date>2019-12-01</date><risdate>2019</risdate><volume>16</volume><issue>6</issue><spage>2173</spage><epage>2187</epage><pages>2173-2187</pages><issn>1861-8200</issn><eissn>1861-8219</eissn><abstract>This work proposes several intrinsics on a reconfigurable processor intended for HEVC decoding and software pipelining algorithms with a coarse-grained array (CGA) architecture as well as the proposed intrinsic instructions. Software pipelining algorithms are developed for the CGA acceleration of inverse transform, pixel reconstruction, de-blocking filter and sample adaptive offset modules. To enable efficient software pipelining, several very-long instruction-word-based intrinsics are designed in order to maximize the parallelization rather than the computational acceleration. We found that the HEVC decoder with the proposed intrinsics yields 2.3 times faster in running clock cycle than a decoder that does not use the intrinsics. In addition, the HEVC decoder with CGA pipelining algorithms executes 10.9 times faster than that without the CGA mode.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11554-017-0729-9</doi><tpages>15</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1861-8200 |
ispartof | Journal of real-time image processing, 2019-12, Vol.16 (6), p.2173-2187 |
issn | 1861-8200 1861-8219 |
language | eng |
recordid | cdi_proquest_journals_2918674177 |
source | ProQuest Central UK/Ireland; SpringerLink Journals - AutoHoldings; ProQuest Central |
subjects | Adaptive sampling Algorithms Coding standards Computer Graphics Computer Science Decoders Decoding Digital signal processors Efficiency Image Processing and Computer Vision Microprocessors Multimedia Information Systems Original Research Paper Pattern Recognition Reconfiguration Signal,Image and Speech Processing Software Video compression |
title | Software pipelining with CGA and proposed intrinsics on a reconfigurable processor for HEVC decoders |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T23%3A15%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Software%20pipelining%20with%20CGA%20and%20proposed%20intrinsics%20on%20a%20reconfigurable%20processor%20for%20HEVC%20decoders&rft.jtitle=Journal%20of%20real-time%20image%20processing&rft.au=Ahn,%20Yong-Jo&rft.date=2019-12-01&rft.volume=16&rft.issue=6&rft.spage=2173&rft.epage=2187&rft.pages=2173-2187&rft.issn=1861-8200&rft.eissn=1861-8219&rft_id=info:doi/10.1007/s11554-017-0729-9&rft_dat=%3Cproquest_cross%3E2918674177%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2918674177&rft_id=info:pmid/&rfr_iscdi=true |