Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture
In reconfigurable system, fast reconfiguration and small size of configuration contexts are strongly required to enhance the processing performance and reduce the implementation overhead. In this paper, a hierarchical representation of contexts for CGRA called HCC is proposed to satisfy the above re...
Gespeichert in:
Veröffentlicht in: | Science China. Information sciences 2013-11, Vol.56 (11), p.275-294 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 294 |
---|---|
container_issue | 11 |
container_start_page | 275 |
container_title | Science China. Information sciences |
container_volume | 56 |
creator | Wang, YanSheng Liu, LeiBo Yin, ShouYi Zhu, Min Cao, Peng Yang, Jun Wei, ShaoJun |
description | In reconfigurable system, fast reconfiguration and small size of configuration contexts are strongly required to enhance the processing performance and reduce the implementation overhead. In this paper, a hierarchical representation of contexts for CGRA called HCC is proposed to satisfy the above requirements. In HCC, the contexts are constructed in a hierarchical fashion to thoroughly eliminate the repetitive portions of the contexts, not only reducing the overall contexts storage size, but also alleviating the contexts transportation overhead. The fast context-indexing mechanism is proposed in HCC to achieve high configuration speed, since the hierarchically organized contexts can be located and accessed conveniently. HCC has been verified in a reconfigurable processor called REMUS_HP. Owing to HCC, when implementing H.264 decoding on REMUS_HP, 76.67% of the overall contexts are reduced compared with the traditional non-hierarchical one; and the configuration speed is averagely 23× increased compared with the latest reported optimized configuration mechanism on Virtex-4 FX60. REMUS_HP is implemented on a 48.9 mm^2 silicon with TSMC 65 nm technology. Simulation shows that 1920 ×1088@30 fps could be achieved for H.264 high-profile decoding when exploiting a 200 MHz working frequency. Compared with the high performance version of XPP, the performance is 181% boosted. |
doi_str_mv | 10.1007/s11432-013-4842-5 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1506377161</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>47711537</cqvip_id><sourcerecordid>2918577698</sourcerecordid><originalsourceid>FETCH-LOGICAL-c375t-c39d905496ff7b74a8c53514cf9c0f7aedefa8a61bdf65b2b2b4eec8d8dea04a3</originalsourceid><addsrcrecordid>eNp9kcFqFTEUhgdRsLR9AHcjbtzEJpNkkiylqBUKbhTchdzMyW3KTDJNMqDv4QN7LlOquDCBnJB8_38O_F33itF3jFJ1VRkTfCCUcSK0GIh81p0xPRrCDDPP8T4qQRTn3192l7XeU1yc00Hps-7XTYTiir-L3s19gbVAhdRcizn1OfQ5Efxbe59Tgx-tbxmhafOABd9CPG5lh1tcoHdp6uOyzrA8mbgCrg-5oIUrFcixuJhg-lt_mFF5mqGBb1uBi-5FcHOFy8d63n37-OHr9Q25_fLp8_X7W-K5kg1PMxkqhRlDUAclnPaSSyZ8MJ4G5WCC4LQb2WEKozwMuAWA15OewFHh-Hn3dvddS37YoDa7xOphnl2CvFXLJB25UmxkiL75B73PW0k4nR0M01Kp0Wik2E75kmstEOxa4uLKT8uoPUVl96gsRmVPUVmJmmHXVGTTEcof5_-JXj82usvp-IC6p04CB2aSK_4bg52mgQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918577698</pqid></control><display><type>article</type><title>Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture</title><source>Alma/SFX Local Collection</source><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Wang, YanSheng ; Liu, LeiBo ; Yin, ShouYi ; Zhu, Min ; Cao, Peng ; Yang, Jun ; Wei, ShaoJun</creator><creatorcontrib>Wang, YanSheng ; Liu, LeiBo ; Yin, ShouYi ; Zhu, Min ; Cao, Peng ; Yang, Jun ; Wei, ShaoJun</creatorcontrib><description>In reconfigurable system, fast reconfiguration and small size of configuration contexts are strongly required to enhance the processing performance and reduce the implementation overhead. In this paper, a hierarchical representation of contexts for CGRA called HCC is proposed to satisfy the above requirements. In HCC, the contexts are constructed in a hierarchical fashion to thoroughly eliminate the repetitive portions of the contexts, not only reducing the overall contexts storage size, but also alleviating the contexts transportation overhead. The fast context-indexing mechanism is proposed in HCC to achieve high configuration speed, since the hierarchically organized contexts can be located and accessed conveniently. HCC has been verified in a reconfigurable processor called REMUS_HP. Owing to HCC, when implementing H.264 decoding on REMUS_HP, 76.67% of the overall contexts are reduced compared with the traditional non-hierarchical one; and the configuration speed is averagely 23× increased compared with the latest reported optimized configuration mechanism on Virtex-4 FX60. REMUS_HP is implemented on a 48.9 mm^2 silicon with TSMC 65 nm technology. Simulation shows that 1920 ×1088@30 fps could be achieved for H.264 high-profile decoding when exploiting a 200 MHz working frequency. Compared with the high performance version of XPP, the performance is 181% boosted.</description><identifier>ISSN: 1674-733X</identifier><identifier>EISSN: 1869-1919</identifier><identifier>DOI: 10.1007/s11432-013-4842-5</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>China ; Computer Science ; Configurations ; Context ; Information Systems and Communication Service ; Microprocessors ; Reconfiguration ; Representations ; Research Paper ; 上下文 ; 优化配置 ; 可重构体系结构 ; 可重构系统 ; 恢复时间 ; 硅芯片 ; 粗粒度 ; 索引机制</subject><ispartof>Science China. Information sciences, 2013-11, Vol.56 (11), p.275-294</ispartof><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2013</rights><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2013.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c375t-c39d905496ff7b74a8c53514cf9c0f7aedefa8a61bdf65b2b2b4eec8d8dea04a3</citedby><cites>FETCH-LOGICAL-c375t-c39d905496ff7b74a8c53514cf9c0f7aedefa8a61bdf65b2b2b4eec8d8dea04a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84009A/84009A.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11432-013-4842-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2918577698?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,21369,27903,27904,33723,33724,41467,42536,43784,51297</link.rule.ids></links><search><creatorcontrib>Wang, YanSheng</creatorcontrib><creatorcontrib>Liu, LeiBo</creatorcontrib><creatorcontrib>Yin, ShouYi</creatorcontrib><creatorcontrib>Zhu, Min</creatorcontrib><creatorcontrib>Cao, Peng</creatorcontrib><creatorcontrib>Yang, Jun</creatorcontrib><creatorcontrib>Wei, ShaoJun</creatorcontrib><title>Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture</title><title>Science China. Information sciences</title><addtitle>Sci. China Inf. Sci</addtitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><description>In reconfigurable system, fast reconfiguration and small size of configuration contexts are strongly required to enhance the processing performance and reduce the implementation overhead. In this paper, a hierarchical representation of contexts for CGRA called HCC is proposed to satisfy the above requirements. In HCC, the contexts are constructed in a hierarchical fashion to thoroughly eliminate the repetitive portions of the contexts, not only reducing the overall contexts storage size, but also alleviating the contexts transportation overhead. The fast context-indexing mechanism is proposed in HCC to achieve high configuration speed, since the hierarchically organized contexts can be located and accessed conveniently. HCC has been verified in a reconfigurable processor called REMUS_HP. Owing to HCC, when implementing H.264 decoding on REMUS_HP, 76.67% of the overall contexts are reduced compared with the traditional non-hierarchical one; and the configuration speed is averagely 23× increased compared with the latest reported optimized configuration mechanism on Virtex-4 FX60. REMUS_HP is implemented on a 48.9 mm^2 silicon with TSMC 65 nm technology. Simulation shows that 1920 ×1088@30 fps could be achieved for H.264 high-profile decoding when exploiting a 200 MHz working frequency. Compared with the high performance version of XPP, the performance is 181% boosted.</description><subject>China</subject><subject>Computer Science</subject><subject>Configurations</subject><subject>Context</subject><subject>Information Systems and Communication Service</subject><subject>Microprocessors</subject><subject>Reconfiguration</subject><subject>Representations</subject><subject>Research Paper</subject><subject>上下文</subject><subject>优化配置</subject><subject>可重构体系结构</subject><subject>可重构系统</subject><subject>恢复时间</subject><subject>硅芯片</subject><subject>粗粒度</subject><subject>索引机制</subject><issn>1674-733X</issn><issn>1869-1919</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kcFqFTEUhgdRsLR9AHcjbtzEJpNkkiylqBUKbhTchdzMyW3KTDJNMqDv4QN7LlOquDCBnJB8_38O_F33itF3jFJ1VRkTfCCUcSK0GIh81p0xPRrCDDPP8T4qQRTn3192l7XeU1yc00Hps-7XTYTiir-L3s19gbVAhdRcizn1OfQ5Efxbe59Tgx-tbxmhafOABd9CPG5lh1tcoHdp6uOyzrA8mbgCrg-5oIUrFcixuJhg-lt_mFF5mqGBb1uBi-5FcHOFy8d63n37-OHr9Q25_fLp8_X7W-K5kg1PMxkqhRlDUAclnPaSSyZ8MJ4G5WCC4LQb2WEKozwMuAWA15OewFHh-Hn3dvddS37YoDa7xOphnl2CvFXLJB25UmxkiL75B73PW0k4nR0M01Kp0Wik2E75kmstEOxa4uLKT8uoPUVl96gsRmVPUVmJmmHXVGTTEcof5_-JXj82usvp-IC6p04CB2aSK_4bg52mgQ</recordid><startdate>20131101</startdate><enddate>20131101</enddate><creator>Wang, YanSheng</creator><creator>Liu, LeiBo</creator><creator>Yin, ShouYi</creator><creator>Zhu, Min</creator><creator>Cao, Peng</creator><creator>Yang, Jun</creator><creator>Wei, ShaoJun</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7SC</scope><scope>8FD</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20131101</creationdate><title>Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture</title><author>Wang, YanSheng ; Liu, LeiBo ; Yin, ShouYi ; Zhu, Min ; Cao, Peng ; Yang, Jun ; Wei, ShaoJun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c375t-c39d905496ff7b74a8c53514cf9c0f7aedefa8a61bdf65b2b2b4eec8d8dea04a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>China</topic><topic>Computer Science</topic><topic>Configurations</topic><topic>Context</topic><topic>Information Systems and Communication Service</topic><topic>Microprocessors</topic><topic>Reconfiguration</topic><topic>Representations</topic><topic>Research Paper</topic><topic>上下文</topic><topic>优化配置</topic><topic>可重构体系结构</topic><topic>可重构系统</topic><topic>恢复时间</topic><topic>硅芯片</topic><topic>粗粒度</topic><topic>索引机制</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, YanSheng</creatorcontrib><creatorcontrib>Liu, LeiBo</creatorcontrib><creatorcontrib>Yin, ShouYi</creatorcontrib><creatorcontrib>Zhu, Min</creatorcontrib><creatorcontrib>Cao, Peng</creatorcontrib><creatorcontrib>Yang, Jun</creatorcontrib><creatorcontrib>Wei, ShaoJun</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Science China. Information sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, YanSheng</au><au>Liu, LeiBo</au><au>Yin, ShouYi</au><au>Zhu, Min</au><au>Cao, Peng</au><au>Yang, Jun</au><au>Wei, ShaoJun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture</atitle><jtitle>Science China. Information sciences</jtitle><stitle>Sci. China Inf. Sci</stitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><date>2013-11-01</date><risdate>2013</risdate><volume>56</volume><issue>11</issue><spage>275</spage><epage>294</epage><pages>275-294</pages><issn>1674-733X</issn><eissn>1869-1919</eissn><abstract>In reconfigurable system, fast reconfiguration and small size of configuration contexts are strongly required to enhance the processing performance and reduce the implementation overhead. In this paper, a hierarchical representation of contexts for CGRA called HCC is proposed to satisfy the above requirements. In HCC, the contexts are constructed in a hierarchical fashion to thoroughly eliminate the repetitive portions of the contexts, not only reducing the overall contexts storage size, but also alleviating the contexts transportation overhead. The fast context-indexing mechanism is proposed in HCC to achieve high configuration speed, since the hierarchically organized contexts can be located and accessed conveniently. HCC has been verified in a reconfigurable processor called REMUS_HP. Owing to HCC, when implementing H.264 decoding on REMUS_HP, 76.67% of the overall contexts are reduced compared with the traditional non-hierarchical one; and the configuration speed is averagely 23× increased compared with the latest reported optimized configuration mechanism on Virtex-4 FX60. REMUS_HP is implemented on a 48.9 mm^2 silicon with TSMC 65 nm technology. Simulation shows that 1920 ×1088@30 fps could be achieved for H.264 high-profile decoding when exploiting a 200 MHz working frequency. Compared with the high performance version of XPP, the performance is 181% boosted.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11432-013-4842-5</doi><tpages>20</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1674-733X |
ispartof | Science China. Information sciences, 2013-11, Vol.56 (11), p.275-294 |
issn | 1674-733X 1869-1919 |
language | eng |
recordid | cdi_proquest_miscellaneous_1506377161 |
source | Alma/SFX Local Collection; SpringerLink Journals - AutoHoldings; ProQuest Central |
subjects | China Computer Science Configurations Context Information Systems and Communication Service Microprocessors Reconfiguration Representations Research Paper 上下文 优化配置 可重构体系结构 可重构系统 恢复时间 硅芯片 粗粒度 索引机制 |
title | Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T03%3A48%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hierarchical%20representation%20of%20on-chip%20context%20to%20reduce%20reconfiguration%20time%20and%20implementation%20area%20for%20coarse-grained%20reconfigurable%20architecture&rft.jtitle=Science%20China.%20Information%20sciences&rft.au=Wang,%20YanSheng&rft.date=2013-11-01&rft.volume=56&rft.issue=11&rft.spage=275&rft.epage=294&rft.pages=275-294&rft.issn=1674-733X&rft.eissn=1869-1919&rft_id=info:doi/10.1007/s11432-013-4842-5&rft_dat=%3Cproquest_cross%3E2918577698%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2918577698&rft_id=info:pmid/&rft_cqvip_id=47711537&rfr_iscdi=true |