Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software

Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and met...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the IEEE 2016-07, Vol.104 (7), p.1410-1448
Hauptverfasser: Rahimi, Abbas, Benini, Luca, Gupta, Rajesh K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1448
container_issue 7
container_start_page 1410
container_title Proceedings of the IEEE
container_volume 104
creator Rahimi, Abbas
Benini, Luca
Gupta, Rajesh K.
description Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.
doi_str_mv 10.1109/JPROC.2016.2518864
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_1825529641</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7426722</ieee_id><sourcerecordid>1825529641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</originalsourceid><addsrcrecordid>eNpdkU9PGzEQxa2qlZoGvgC9WOqll039Z-21e0OrBqiSBhHgajm7Y-oou6a2F5Rvz6ZBHNAcnkZ6b-ZJP4TOKJlRSvSP39c3q3rGCJUzJqhSsvyAJlQIVTAm5Ec0IYSqQjOqP6MvKW0JIVxIPkHdvY3ebvzO5z1e-uwfbPahx77Hf2wfOsgQcb1crfFVn-Eh2gwtXu9Thi79xOd4PcQn2OPg8C00f3v_b4CE5zF0uPaxGXxOOAe8Di4_2wgn6JOzuwSnrzpFd_Nft_VlsVhdXNXni6LhWuWCCdVyXVLOSKkbbSUQypWCStmNclqUTDjKnBuXloNjSlR0nEa2sKGct3yKvh_vPsZwaJRN51MDu53tIQzJUMWEYFqOL6bo2zvrNgyxH9sZWmmldCnVwcWOriaGlCI48xh9Z-PeUGIOBMx_AuZAwLwSGENfjyEPAG-BqmSyYoy_ADLYgbI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1798894681</pqid></control><display><type>article</type><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><source>IEEE Electronic Library (IEL)</source><creator>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</creator><creatorcontrib>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</creatorcontrib><description>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</description><identifier>ISSN: 0018-9219</identifier><identifier>EISSN: 1558-2256</identifier><identifier>DOI: 10.1109/JPROC.2016.2518864</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Approximate computing ; Architecture ; CMOS integrated circuits ; Computation ; Computer programs ; Design ; Design engineering ; Devices ; Error analysis ; Nanostructure ; Parallel architectures ; Performance evaluation ; Resilient systems ; Software ; Time measurements ; timing errors ; variability</subject><ispartof>Proceedings of the IEEE, 2016-07, Vol.104 (7), p.1410-1448</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</citedby><cites>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7426722$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7426722$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rahimi, Abbas</creatorcontrib><creatorcontrib>Benini, Luca</creatorcontrib><creatorcontrib>Gupta, Rajesh K.</creatorcontrib><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><title>Proceedings of the IEEE</title><addtitle>JPROC</addtitle><description>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</description><subject>Approximate computing</subject><subject>Architecture</subject><subject>CMOS integrated circuits</subject><subject>Computation</subject><subject>Computer programs</subject><subject>Design</subject><subject>Design engineering</subject><subject>Devices</subject><subject>Error analysis</subject><subject>Nanostructure</subject><subject>Parallel architectures</subject><subject>Performance evaluation</subject><subject>Resilient systems</subject><subject>Software</subject><subject>Time measurements</subject><subject>timing errors</subject><subject>variability</subject><issn>0018-9219</issn><issn>1558-2256</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkU9PGzEQxa2qlZoGvgC9WOqll039Z-21e0OrBqiSBhHgajm7Y-oou6a2F5Rvz6ZBHNAcnkZ6b-ZJP4TOKJlRSvSP39c3q3rGCJUzJqhSsvyAJlQIVTAm5Ec0IYSqQjOqP6MvKW0JIVxIPkHdvY3ebvzO5z1e-uwfbPahx77Hf2wfOsgQcb1crfFVn-Eh2gwtXu9Thi79xOd4PcQn2OPg8C00f3v_b4CE5zF0uPaxGXxOOAe8Di4_2wgn6JOzuwSnrzpFd_Nft_VlsVhdXNXni6LhWuWCCdVyXVLOSKkbbSUQypWCStmNclqUTDjKnBuXloNjSlR0nEa2sKGct3yKvh_vPsZwaJRN51MDu53tIQzJUMWEYFqOL6bo2zvrNgyxH9sZWmmldCnVwcWOriaGlCI48xh9Z-PeUGIOBMx_AuZAwLwSGENfjyEPAG-BqmSyYoy_ADLYgbI</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Rahimi, Abbas</creator><creator>Benini, Luca</creator><creator>Gupta, Rajesh K.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20160701</creationdate><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><author>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Approximate computing</topic><topic>Architecture</topic><topic>CMOS integrated circuits</topic><topic>Computation</topic><topic>Computer programs</topic><topic>Design</topic><topic>Design engineering</topic><topic>Devices</topic><topic>Error analysis</topic><topic>Nanostructure</topic><topic>Parallel architectures</topic><topic>Performance evaluation</topic><topic>Resilient systems</topic><topic>Software</topic><topic>Time measurements</topic><topic>timing errors</topic><topic>variability</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rahimi, Abbas</creatorcontrib><creatorcontrib>Benini, Luca</creatorcontrib><creatorcontrib>Gupta, Rajesh K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>Proceedings of the IEEE</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rahimi, Abbas</au><au>Benini, Luca</au><au>Gupta, Rajesh K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</atitle><jtitle>Proceedings of the IEEE</jtitle><stitle>JPROC</stitle><date>2016-07-01</date><risdate>2016</risdate><volume>104</volume><issue>7</issue><spage>1410</spage><epage>1448</epage><pages>1410-1448</pages><issn>0018-9219</issn><eissn>1558-2256</eissn><coden>IEEPAD</coden><abstract>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/JPROC.2016.2518864</doi><tpages>39</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0018-9219
ispartof Proceedings of the IEEE, 2016-07, Vol.104 (7), p.1410-1448
issn 0018-9219
1558-2256
language eng
recordid cdi_proquest_miscellaneous_1825529641
source IEEE Electronic Library (IEL)
subjects Approximate computing
Architecture
CMOS integrated circuits
Computation
Computer programs
Design
Design engineering
Devices
Error analysis
Nanostructure
Parallel architectures
Performance evaluation
Resilient systems
Software
Time measurements
timing errors
variability
title Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T01%3A47%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Variability%20Mitigation%20in%20Nanometer%20CMOS%20Integrated%20Systems:%20A%20Survey%20of%20Techniques%20From%20Circuits%20to%20Software&rft.jtitle=Proceedings%20of%20the%20IEEE&rft.au=Rahimi,%20Abbas&rft.date=2016-07-01&rft.volume=104&rft.issue=7&rft.spage=1410&rft.epage=1448&rft.pages=1410-1448&rft.issn=0018-9219&rft.eissn=1558-2256&rft.coden=IEEPAD&rft_id=info:doi/10.1109/JPROC.2016.2518864&rft_dat=%3Cproquest_RIE%3E1825529641%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1798894681&rft_id=info:pmid/&rft_ieee_id=7426722&rfr_iscdi=true