Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software
Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and met...
Gespeichert in:
Veröffentlicht in: | Proceedings of the IEEE 2016-07, Vol.104 (7), p.1410-1448 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1448 |
---|---|
container_issue | 7 |
container_start_page | 1410 |
container_title | Proceedings of the IEEE |
container_volume | 104 |
creator | Rahimi, Abbas Benini, Luca Gupta, Rajesh K. |
description | Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field. |
doi_str_mv | 10.1109/JPROC.2016.2518864 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_1825529641</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7426722</ieee_id><sourcerecordid>1825529641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</originalsourceid><addsrcrecordid>eNpdkU9PGzEQxa2qlZoGvgC9WOqll039Z-21e0OrBqiSBhHgajm7Y-oou6a2F5Rvz6ZBHNAcnkZ6b-ZJP4TOKJlRSvSP39c3q3rGCJUzJqhSsvyAJlQIVTAm5Ec0IYSqQjOqP6MvKW0JIVxIPkHdvY3ebvzO5z1e-uwfbPahx77Hf2wfOsgQcb1crfFVn-Eh2gwtXu9Thi79xOd4PcQn2OPg8C00f3v_b4CE5zF0uPaxGXxOOAe8Di4_2wgn6JOzuwSnrzpFd_Nft_VlsVhdXNXni6LhWuWCCdVyXVLOSKkbbSUQypWCStmNclqUTDjKnBuXloNjSlR0nEa2sKGct3yKvh_vPsZwaJRN51MDu53tIQzJUMWEYFqOL6bo2zvrNgyxH9sZWmmldCnVwcWOriaGlCI48xh9Z-PeUGIOBMx_AuZAwLwSGENfjyEPAG-BqmSyYoy_ADLYgbI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1798894681</pqid></control><display><type>article</type><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><source>IEEE Electronic Library (IEL)</source><creator>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</creator><creatorcontrib>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</creatorcontrib><description>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</description><identifier>ISSN: 0018-9219</identifier><identifier>EISSN: 1558-2256</identifier><identifier>DOI: 10.1109/JPROC.2016.2518864</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Approximate computing ; Architecture ; CMOS integrated circuits ; Computation ; Computer programs ; Design ; Design engineering ; Devices ; Error analysis ; Nanostructure ; Parallel architectures ; Performance evaluation ; Resilient systems ; Software ; Time measurements ; timing errors ; variability</subject><ispartof>Proceedings of the IEEE, 2016-07, Vol.104 (7), p.1410-1448</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</citedby><cites>FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7426722$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7426722$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rahimi, Abbas</creatorcontrib><creatorcontrib>Benini, Luca</creatorcontrib><creatorcontrib>Gupta, Rajesh K.</creatorcontrib><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><title>Proceedings of the IEEE</title><addtitle>JPROC</addtitle><description>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</description><subject>Approximate computing</subject><subject>Architecture</subject><subject>CMOS integrated circuits</subject><subject>Computation</subject><subject>Computer programs</subject><subject>Design</subject><subject>Design engineering</subject><subject>Devices</subject><subject>Error analysis</subject><subject>Nanostructure</subject><subject>Parallel architectures</subject><subject>Performance evaluation</subject><subject>Resilient systems</subject><subject>Software</subject><subject>Time measurements</subject><subject>timing errors</subject><subject>variability</subject><issn>0018-9219</issn><issn>1558-2256</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkU9PGzEQxa2qlZoGvgC9WOqll039Z-21e0OrBqiSBhHgajm7Y-oou6a2F5Rvz6ZBHNAcnkZ6b-ZJP4TOKJlRSvSP39c3q3rGCJUzJqhSsvyAJlQIVTAm5Ec0IYSqQjOqP6MvKW0JIVxIPkHdvY3ebvzO5z1e-uwfbPahx77Hf2wfOsgQcb1crfFVn-Eh2gwtXu9Thi79xOd4PcQn2OPg8C00f3v_b4CE5zF0uPaxGXxOOAe8Di4_2wgn6JOzuwSnrzpFd_Nft_VlsVhdXNXni6LhWuWCCdVyXVLOSKkbbSUQypWCStmNclqUTDjKnBuXloNjSlR0nEa2sKGct3yKvh_vPsZwaJRN51MDu53tIQzJUMWEYFqOL6bo2zvrNgyxH9sZWmmldCnVwcWOriaGlCI48xh9Z-PeUGIOBMx_AuZAwLwSGENfjyEPAG-BqmSyYoy_ADLYgbI</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Rahimi, Abbas</creator><creator>Benini, Luca</creator><creator>Gupta, Rajesh K.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20160701</creationdate><title>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</title><author>Rahimi, Abbas ; Benini, Luca ; Gupta, Rajesh K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c398t-258d394132049c9a6e01388e78ab8f95425f12ffb8fd3ef28571717c6deb133d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Approximate computing</topic><topic>Architecture</topic><topic>CMOS integrated circuits</topic><topic>Computation</topic><topic>Computer programs</topic><topic>Design</topic><topic>Design engineering</topic><topic>Devices</topic><topic>Error analysis</topic><topic>Nanostructure</topic><topic>Parallel architectures</topic><topic>Performance evaluation</topic><topic>Resilient systems</topic><topic>Software</topic><topic>Time measurements</topic><topic>timing errors</topic><topic>variability</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rahimi, Abbas</creatorcontrib><creatorcontrib>Benini, Luca</creatorcontrib><creatorcontrib>Gupta, Rajesh K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><jtitle>Proceedings of the IEEE</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rahimi, Abbas</au><au>Benini, Luca</au><au>Gupta, Rajesh K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software</atitle><jtitle>Proceedings of the IEEE</jtitle><stitle>JPROC</stitle><date>2016-07-01</date><risdate>2016</risdate><volume>104</volume><issue>7</issue><spage>1410</spage><epage>1448</epage><pages>1410-1448</pages><issn>0018-9219</issn><eissn>1558-2256</eissn><coden>IEEPAD</coden><abstract>Variation in performance and power across manufactured parts and their operating conditions is an accepted reality in modern microelectronic manufacturing processes with geometries in nanometer scales. This article surveys challenges and opportunities in identifying variations, their effects and methods to combat these variations for improved microelectronic devices. We focus on computing devices and their design at various levels to combat variability. First, we provide a review of key concepts with particular emphasis on timing errors caused by various variability sources. We consider methods to predict and prevent, detect and correct, and finally conditions under which such errors can be accepted; we also consider their implications on cost, performance and quality. We provide a comparative evaluation of methods for deployment across various layers of the system from circuits, architecture, to application software. These can be combined in various ways to achieve specific goals related to observability and controllability of the variability effects, providing means to achieve cross-layer or hybrid resilience. We then provide examples of real world resilient single-core and parallel architectures. We find that parallel architectures and parallelism in general provide the best means to combat and exploit variability to design resilient and efficient systems. Using programmable accelerator architectures such as clustered processing elements and GP-GPUs, we show how system designers can coordinate propagation of timing error information and its effects along with new techniques for memoization (i.e., spatial or temporal reuse of computation). This discussion naturally leads to use of these techniques into emerging area of "approximate computing," and how these can be used in building resilient and efficient computing systems. We conclude with an outlook for the emerging field.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/JPROC.2016.2518864</doi><tpages>39</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0018-9219 |
ispartof | Proceedings of the IEEE, 2016-07, Vol.104 (7), p.1410-1448 |
issn | 0018-9219 1558-2256 |
language | eng |
recordid | cdi_proquest_miscellaneous_1825529641 |
source | IEEE Electronic Library (IEL) |
subjects | Approximate computing Architecture CMOS integrated circuits Computation Computer programs Design Design engineering Devices Error analysis Nanostructure Parallel architectures Performance evaluation Resilient systems Software Time measurements timing errors variability |
title | Variability Mitigation in Nanometer CMOS Integrated Systems: A Survey of Techniques From Circuits to Software |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T01%3A47%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Variability%20Mitigation%20in%20Nanometer%20CMOS%20Integrated%20Systems:%20A%20Survey%20of%20Techniques%20From%20Circuits%20to%20Software&rft.jtitle=Proceedings%20of%20the%20IEEE&rft.au=Rahimi,%20Abbas&rft.date=2016-07-01&rft.volume=104&rft.issue=7&rft.spage=1410&rft.epage=1448&rft.pages=1410-1448&rft.issn=0018-9219&rft.eissn=1558-2256&rft.coden=IEEPAD&rft_id=info:doi/10.1109/JPROC.2016.2518864&rft_dat=%3Cproquest_RIE%3E1825529641%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1798894681&rft_id=info:pmid/&rft_ieee_id=7426722&rfr_iscdi=true |