Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect

With the progressive augmentation of the density of 3-D processor arrays, some processor elements (PEs) often fail due to overload or overheating during massively parallel computing operations. Therefore, it is necessary to take effective fault-tolerant technology to ensure the reliability of the sy...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on computer-aided design of integrated circuits and systems 2024-04, Vol.43 (4), p.1023-1036
Hauptverfasser: Ding, Hao, He, Yanlong, Zhai, Zhongyi, Li, Zhi, Qian, Junyan, Zhao, Lingzhong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1036
container_issue 4
container_start_page 1023
container_title IEEE transactions on computer-aided design of integrated circuits and systems
container_volume 43
creator Ding, Hao
He, Yanlong
Zhai, Zhongyi
Li, Zhi
Qian, Junyan
Zhao, Lingzhong
description With the progressive augmentation of the density of 3-D processor arrays, some processor elements (PEs) often fail due to overload or overheating during massively parallel computing operations. Therefore, it is necessary to take effective fault-tolerant technology to ensure the reliability of the system. This article investigates an efficient reconfiguration method to construct 3-D fault-free logical subarray with more fault-free PEs and less interconnection length (interlength). First, we propose a novel method based on the barrel effect to find the bottleneck plane of 3-D processor arrays. Second, an efficient compensation strategy is proposed to replace faulty PEs on adjacent physical planes with fault-free PEs on the bottleneck planes, which leads to more fault-free PEs that can be used to construct the subarray. Then, we propose a heuristic to construct the subarray and optimize iteration redundancy to accelerate reconstruction. Finally, a heuristic optimization algorithm is proposed to reduce the interlength between PEs, which can reduce the dynamic power consumption and communication costs. In addition, we propose a more accurate method to calculate the lower bound of the interlength to better evaluate the performance of the algorithm. Simulation experiments show that, compared to the state-of-the-arts, on 128\times 128\times 128 host array, the utilization rate of fault-free PEs can be improved up to 15.6% and the interlength redundancy can be reduced by 78.2% for random faults. On 64\times 64\times 64 host array, the average improvement of the two indicators under clustered faults can reach 93.2% and 69.3%. Moreover, for all cases considered, the proposed new lower bound and reconstruction time can be reduced by an average of 18.47% and 76.13%, respectively.
doi_str_mv 10.1109/TCAD.2023.3337196
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2969049878</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10333075</ieee_id><sourcerecordid>2969049878</sourcerecordid><originalsourceid>FETCH-LOGICAL-c246t-1112b16d032b5b97efe42219e781efb17b3c55b34bdba39829c36977d8d56b4a3</originalsourceid><addsrcrecordid>eNpNkMtOwzAQRS0EEqXwAUgsLLFO8dhJHC_TUh5SpSJU1lbsjEtKGxc7WfTvSdUuWI00c-4d6RByD2wCwNTTalY-TzjjYiKEkKDyCzICJWSSQgaXZMS4LBLGJLsmNzFuGIM042pElnPnGttg21GRPNOP4C3G6AMtQ6gO9BOtb12z7kPVNb6l5XbtQ9N97yKdVhFrOuymvf3Bjg5FaLtbcuWqbcS78xyTr5f5avaWLJav77NykVie5l0CANxAXjPBTWaURIcp56BQFoDOgDTCZpkRqalNJVTBlRW5krIu6iw3aSXG5PHUuw_-t8fY6Y3vQzu81FzliqWqkMVAwYmywccY0Ol9aHZVOGhg-uhNH73pozd99jZkHk6ZBhH_8cOdyUz8Afj9aCk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2969049878</pqid></control><display><type>article</type><title>Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect</title><source>IEEE Electronic Library (IEL)</source><creator>Ding, Hao ; He, Yanlong ; Zhai, Zhongyi ; Li, Zhi ; Qian, Junyan ; Zhao, Lingzhong</creator><creatorcontrib>Ding, Hao ; He, Yanlong ; Zhai, Zhongyi ; Li, Zhi ; Qian, Junyan ; Zhao, Lingzhong</creatorcontrib><description><![CDATA[With the progressive augmentation of the density of 3-D processor arrays, some processor elements (PEs) often fail due to overload or overheating during massively parallel computing operations. Therefore, it is necessary to take effective fault-tolerant technology to ensure the reliability of the system. This article investigates an efficient reconfiguration method to construct 3-D fault-free logical subarray with more fault-free PEs and less interconnection length (interlength). First, we propose a novel method based on the barrel effect to find the bottleneck plane of 3-D processor arrays. Second, an efficient compensation strategy is proposed to replace faulty PEs on adjacent physical planes with fault-free PEs on the bottleneck planes, which leads to more fault-free PEs that can be used to construct the subarray. Then, we propose a heuristic to construct the subarray and optimize iteration redundancy to accelerate reconstruction. Finally, a heuristic optimization algorithm is proposed to reduce the interlength between PEs, which can reduce the dynamic power consumption and communication costs. In addition, we propose a more accurate method to calculate the lower bound of the interlength to better evaluate the performance of the algorithm. Simulation experiments show that, compared to the state-of-the-arts, on <inline-formula> <tex-math notation="LaTeX">128\times 128\times 128 </tex-math></inline-formula> host array, the utilization rate of fault-free PEs can be improved up to 15.6% and the interlength redundancy can be reduced by 78.2% for random faults. On <inline-formula> <tex-math notation="LaTeX">64\times 64\times 64 </tex-math></inline-formula> host array, the average improvement of the two indicators under clustered faults can reach 93.2% and 69.3%. Moreover, for all cases considered, the proposed new lower bound and reconstruction time can be reduced by an average of 18.47% and 76.13%, respectively.]]></description><identifier>ISSN: 0278-0070</identifier><identifier>EISSN: 1937-4151</identifier><identifier>DOI: 10.1109/TCAD.2023.3337196</identifier><identifier>CODEN: ITCSDI</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>3-D very large-scale integration (VLSI) array ; algorithm ; Algorithms ; Array processors ; Circuit faults ; Fault tolerance ; Fault tolerant systems ; Heuristic ; Integrated circuit interconnections ; interconnection length ; Iterative methods ; Logic arrays ; Lower bounds ; Microprocessors ; Overheating ; Parallel processing ; Power consumption ; Reconfiguration ; Reconstruction ; Redundancy ; Switches ; Three-dimensional displays</subject><ispartof>IEEE transactions on computer-aided design of integrated circuits and systems, 2024-04, Vol.43 (4), p.1023-1036</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c246t-1112b16d032b5b97efe42219e781efb17b3c55b34bdba39829c36977d8d56b4a3</cites><orcidid>0000-0003-4935-3993 ; 0000-0002-1325-6975 ; 0000-0001-5325-9997 ; 0000-0003-1861-6842</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10333075$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10333075$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ding, Hao</creatorcontrib><creatorcontrib>He, Yanlong</creatorcontrib><creatorcontrib>Zhai, Zhongyi</creatorcontrib><creatorcontrib>Li, Zhi</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><creatorcontrib>Zhao, Lingzhong</creatorcontrib><title>Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect</title><title>IEEE transactions on computer-aided design of integrated circuits and systems</title><addtitle>TCAD</addtitle><description><![CDATA[With the progressive augmentation of the density of 3-D processor arrays, some processor elements (PEs) often fail due to overload or overheating during massively parallel computing operations. Therefore, it is necessary to take effective fault-tolerant technology to ensure the reliability of the system. This article investigates an efficient reconfiguration method to construct 3-D fault-free logical subarray with more fault-free PEs and less interconnection length (interlength). First, we propose a novel method based on the barrel effect to find the bottleneck plane of 3-D processor arrays. Second, an efficient compensation strategy is proposed to replace faulty PEs on adjacent physical planes with fault-free PEs on the bottleneck planes, which leads to more fault-free PEs that can be used to construct the subarray. Then, we propose a heuristic to construct the subarray and optimize iteration redundancy to accelerate reconstruction. Finally, a heuristic optimization algorithm is proposed to reduce the interlength between PEs, which can reduce the dynamic power consumption and communication costs. In addition, we propose a more accurate method to calculate the lower bound of the interlength to better evaluate the performance of the algorithm. Simulation experiments show that, compared to the state-of-the-arts, on <inline-formula> <tex-math notation="LaTeX">128\times 128\times 128 </tex-math></inline-formula> host array, the utilization rate of fault-free PEs can be improved up to 15.6% and the interlength redundancy can be reduced by 78.2% for random faults. On <inline-formula> <tex-math notation="LaTeX">64\times 64\times 64 </tex-math></inline-formula> host array, the average improvement of the two indicators under clustered faults can reach 93.2% and 69.3%. Moreover, for all cases considered, the proposed new lower bound and reconstruction time can be reduced by an average of 18.47% and 76.13%, respectively.]]></description><subject>3-D very large-scale integration (VLSI) array</subject><subject>algorithm</subject><subject>Algorithms</subject><subject>Array processors</subject><subject>Circuit faults</subject><subject>Fault tolerance</subject><subject>Fault tolerant systems</subject><subject>Heuristic</subject><subject>Integrated circuit interconnections</subject><subject>interconnection length</subject><subject>Iterative methods</subject><subject>Logic arrays</subject><subject>Lower bounds</subject><subject>Microprocessors</subject><subject>Overheating</subject><subject>Parallel processing</subject><subject>Power consumption</subject><subject>Reconfiguration</subject><subject>Reconstruction</subject><subject>Redundancy</subject><subject>Switches</subject><subject>Three-dimensional displays</subject><issn>0278-0070</issn><issn>1937-4151</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkMtOwzAQRS0EEqXwAUgsLLFO8dhJHC_TUh5SpSJU1lbsjEtKGxc7WfTvSdUuWI00c-4d6RByD2wCwNTTalY-TzjjYiKEkKDyCzICJWSSQgaXZMS4LBLGJLsmNzFuGIM042pElnPnGttg21GRPNOP4C3G6AMtQ6gO9BOtb12z7kPVNb6l5XbtQ9N97yKdVhFrOuymvf3Bjg5FaLtbcuWqbcS78xyTr5f5avaWLJav77NykVie5l0CANxAXjPBTWaURIcp56BQFoDOgDTCZpkRqalNJVTBlRW5krIu6iw3aSXG5PHUuw_-t8fY6Y3vQzu81FzliqWqkMVAwYmywccY0Ol9aHZVOGhg-uhNH73pozd99jZkHk6ZBhH_8cOdyUz8Afj9aCk</recordid><startdate>20240401</startdate><enddate>20240401</enddate><creator>Ding, Hao</creator><creator>He, Yanlong</creator><creator>Zhai, Zhongyi</creator><creator>Li, Zhi</creator><creator>Qian, Junyan</creator><creator>Zhao, Lingzhong</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-4935-3993</orcidid><orcidid>https://orcid.org/0000-0002-1325-6975</orcidid><orcidid>https://orcid.org/0000-0001-5325-9997</orcidid><orcidid>https://orcid.org/0000-0003-1861-6842</orcidid></search><sort><creationdate>20240401</creationdate><title>Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect</title><author>Ding, Hao ; He, Yanlong ; Zhai, Zhongyi ; Li, Zhi ; Qian, Junyan ; Zhao, Lingzhong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c246t-1112b16d032b5b97efe42219e781efb17b3c55b34bdba39829c36977d8d56b4a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>3-D very large-scale integration (VLSI) array</topic><topic>algorithm</topic><topic>Algorithms</topic><topic>Array processors</topic><topic>Circuit faults</topic><topic>Fault tolerance</topic><topic>Fault tolerant systems</topic><topic>Heuristic</topic><topic>Integrated circuit interconnections</topic><topic>interconnection length</topic><topic>Iterative methods</topic><topic>Logic arrays</topic><topic>Lower bounds</topic><topic>Microprocessors</topic><topic>Overheating</topic><topic>Parallel processing</topic><topic>Power consumption</topic><topic>Reconfiguration</topic><topic>Reconstruction</topic><topic>Redundancy</topic><topic>Switches</topic><topic>Three-dimensional displays</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ding, Hao</creatorcontrib><creatorcontrib>He, Yanlong</creatorcontrib><creatorcontrib>Zhai, Zhongyi</creatorcontrib><creatorcontrib>Li, Zhi</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><creatorcontrib>Zhao, Lingzhong</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on computer-aided design of integrated circuits and systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ding, Hao</au><au>He, Yanlong</au><au>Zhai, Zhongyi</au><au>Li, Zhi</au><au>Qian, Junyan</au><au>Zhao, Lingzhong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect</atitle><jtitle>IEEE transactions on computer-aided design of integrated circuits and systems</jtitle><stitle>TCAD</stitle><date>2024-04-01</date><risdate>2024</risdate><volume>43</volume><issue>4</issue><spage>1023</spage><epage>1036</epage><pages>1023-1036</pages><issn>0278-0070</issn><eissn>1937-4151</eissn><coden>ITCSDI</coden><abstract><![CDATA[With the progressive augmentation of the density of 3-D processor arrays, some processor elements (PEs) often fail due to overload or overheating during massively parallel computing operations. Therefore, it is necessary to take effective fault-tolerant technology to ensure the reliability of the system. This article investigates an efficient reconfiguration method to construct 3-D fault-free logical subarray with more fault-free PEs and less interconnection length (interlength). First, we propose a novel method based on the barrel effect to find the bottleneck plane of 3-D processor arrays. Second, an efficient compensation strategy is proposed to replace faulty PEs on adjacent physical planes with fault-free PEs on the bottleneck planes, which leads to more fault-free PEs that can be used to construct the subarray. Then, we propose a heuristic to construct the subarray and optimize iteration redundancy to accelerate reconstruction. Finally, a heuristic optimization algorithm is proposed to reduce the interlength between PEs, which can reduce the dynamic power consumption and communication costs. In addition, we propose a more accurate method to calculate the lower bound of the interlength to better evaluate the performance of the algorithm. Simulation experiments show that, compared to the state-of-the-arts, on <inline-formula> <tex-math notation="LaTeX">128\times 128\times 128 </tex-math></inline-formula> host array, the utilization rate of fault-free PEs can be improved up to 15.6% and the interlength redundancy can be reduced by 78.2% for random faults. On <inline-formula> <tex-math notation="LaTeX">64\times 64\times 64 </tex-math></inline-formula> host array, the average improvement of the two indicators under clustered faults can reach 93.2% and 69.3%. Moreover, for all cases considered, the proposed new lower bound and reconstruction time can be reduced by an average of 18.47% and 76.13%, respectively.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TCAD.2023.3337196</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-4935-3993</orcidid><orcidid>https://orcid.org/0000-0002-1325-6975</orcidid><orcidid>https://orcid.org/0000-0001-5325-9997</orcidid><orcidid>https://orcid.org/0000-0003-1861-6842</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0278-0070
ispartof IEEE transactions on computer-aided design of integrated circuits and systems, 2024-04, Vol.43 (4), p.1023-1036
issn 0278-0070
1937-4151
language eng
recordid cdi_proquest_journals_2969049878
source IEEE Electronic Library (IEL)
subjects 3-D very large-scale integration (VLSI) array
algorithm
Algorithms
Array processors
Circuit faults
Fault tolerance
Fault tolerant systems
Heuristic
Integrated circuit interconnections
interconnection length
Iterative methods
Logic arrays
Lower bounds
Microprocessors
Overheating
Parallel processing
Power consumption
Reconfiguration
Reconstruction
Redundancy
Switches
Three-dimensional displays
title Efficient 3-D Processor Array Reconfiguration Algorithms Based on Bucket Effect
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T15%3A26%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%203-D%20Processor%20Array%20Reconfiguration%20Algorithms%20Based%20on%20Bucket%20Effect&rft.jtitle=IEEE%20transactions%20on%20computer-aided%20design%20of%20integrated%20circuits%20and%20systems&rft.au=Ding,%20Hao&rft.date=2024-04-01&rft.volume=43&rft.issue=4&rft.spage=1023&rft.epage=1036&rft.pages=1023-1036&rft.issn=0278-0070&rft.eissn=1937-4151&rft.coden=ITCSDI&rft_id=info:doi/10.1109/TCAD.2023.3337196&rft_dat=%3Cproquest_RIE%3E2969049878%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2969049878&rft_id=info:pmid/&rft_ieee_id=10333075&rfr_iscdi=true