Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication

Transient faults or soft errors are considered one of the most daunting reliability challenges for microprocessors. Software solutions for soft error protection are attractive because they can provide flexible and effective error protection. For instance, nZDC (Didehban and Shrivastava 2016) state-o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on dependable and secure computing 2024-01, Vol.21 (1), p.78-92
Hauptverfasser: Didehban, Moslem, So, Hwisoo, Gali, Prudhvi, Shrivastava, Aviral, Lee, Kyoungwoo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 92
container_issue 1
container_start_page 78
container_title IEEE transactions on dependable and secure computing
container_volume 21
creator Didehban, Moslem
So, Hwisoo
Gali, Prudhvi
Shrivastava, Aviral
Lee, Kyoungwoo
description Transient faults or soft errors are considered one of the most daunting reliability challenges for microprocessors. Software solutions for soft error protection are attractive because they can provide flexible and effective error protection. For instance, nZDC (Didehban and Shrivastava 2016) state-of-the-art instruction duplication error protection scheme achieves a high degree of error detection by verifying the results of memory write operations and utilizes an effective control-flow checking mechanism. However, nZDC control-flow checking mechanism is architecture-dependent and suffers from some vulnerability holes. In this work, we address these issues by substituting nZDC control-flow checking mechanism with a general (ISA-independent) scheme and propose two transformations, coarse-grained scheduling, and asymmetric control-flow signatures, for hard-to-detect control flow errors. Fault injection experiments on different hardware components of synthesizable Verilog description of an OpenRISC-based microprocessor reveal that the proposed transformation shows 85% less silent data corruptions compared to nZDC. In addition, programs protected by the proposed scheme run on average around 37% faster than nZDC-protected programs.
doi_str_mv 10.1109/TDSC.2023.3245842
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_10045828</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10045828</ieee_id><sourcerecordid>2915734958</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-ecb1d14829dd3d9bbeaf1327aa78bc209ed6ab6370bc3cb4ed5ec88296aa80e43</originalsourceid><addsrcrecordid>eNpNkEtrwzAQhEVpoenjBxR6EPTsVC_H0rE4jwYCPSQ9C0leg4NrpZJMyb-vjVPoaWeZmV34EHqiZE4pUa-H5b6cM8L4nDORS8Gu0IwqQTNCqLwedC7yLFcFvUV3MR4JYUIqMUP7DXQQGof3vk54FYIPeGmSwaarcOm7FHyL163_-fMggUuN77A9420XU-inddmf2saZUT-gm9q0ER4v8x59rleH8j3bfWy25dsuc0yJlIGztKJCMlVVvFLWgqkpZ4UxhbSOEQXVwtgFL4h13FkBVQ5ODvGFMZKA4PfoZbp7Cv67h5j00fehG15qpmhecKFyOaTolHLBxxig1qfQfJlw1pTokZ0e2emRnb6wGzrPU6cBgH95MthM8l9HEmuW</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2915734958</pqid></control><display><type>article</type><title>Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication</title><source>IEEE Electronic Library (IEL)</source><creator>Didehban, Moslem ; So, Hwisoo ; Gali, Prudhvi ; Shrivastava, Aviral ; Lee, Kyoungwoo</creator><creatorcontrib>Didehban, Moslem ; So, Hwisoo ; Gali, Prudhvi ; Shrivastava, Aviral ; Lee, Kyoungwoo</creatorcontrib><description>Transient faults or soft errors are considered one of the most daunting reliability challenges for microprocessors. Software solutions for soft error protection are attractive because they can provide flexible and effective error protection. For instance, nZDC (Didehban and Shrivastava 2016) state-of-the-art instruction duplication error protection scheme achieves a high degree of error detection by verifying the results of memory write operations and utilizes an effective control-flow checking mechanism. However, nZDC control-flow checking mechanism is architecture-dependent and suffers from some vulnerability holes. In this work, we address these issues by substituting nZDC control-flow checking mechanism with a general (ISA-independent) scheme and propose two transformations, coarse-grained scheduling, and asymmetric control-flow signatures, for hard-to-detect control flow errors. Fault injection experiments on different hardware components of synthesizable Verilog description of an OpenRISC-based microprocessor reveal that the proposed transformation shows 85% less silent data corruptions compared to nZDC. In addition, programs protected by the proposed scheme run on average around 37% faster than nZDC-protected programs.</description><identifier>ISSN: 1545-5971</identifier><identifier>EISSN: 1941-0018</identifier><identifier>DOI: 10.1109/TDSC.2023.3245842</identifier><identifier>CODEN: ITDSCM</identifier><language>eng</language><publisher>Washington: IEEE</publisher><subject>Codes ; Compiler ; Error correction &amp; detection ; Error detection ; Fault detection ; Hardware ; Microprocessors ; Redundancy ; Registers ; reliability ; silent data corruption ; Soft errors ; Software ; Software reliability ; Transient analysis ; transient faults</subject><ispartof>IEEE transactions on dependable and secure computing, 2024-01, Vol.21 (1), p.78-92</ispartof><rights>Copyright IEEE Computer Society 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c294t-ecb1d14829dd3d9bbeaf1327aa78bc209ed6ab6370bc3cb4ed5ec88296aa80e43</citedby><cites>FETCH-LOGICAL-c294t-ecb1d14829dd3d9bbeaf1327aa78bc209ed6ab6370bc3cb4ed5ec88296aa80e43</cites><orcidid>0000-0001-5082-3775 ; 0000-0002-1075-897X ; 0000-0002-3496-6079 ; 0000-0002-9357-4702</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10045828$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10045828$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Didehban, Moslem</creatorcontrib><creatorcontrib>So, Hwisoo</creatorcontrib><creatorcontrib>Gali, Prudhvi</creatorcontrib><creatorcontrib>Shrivastava, Aviral</creatorcontrib><creatorcontrib>Lee, Kyoungwoo</creatorcontrib><title>Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication</title><title>IEEE transactions on dependable and secure computing</title><addtitle>TDSC</addtitle><description>Transient faults or soft errors are considered one of the most daunting reliability challenges for microprocessors. Software solutions for soft error protection are attractive because they can provide flexible and effective error protection. For instance, nZDC (Didehban and Shrivastava 2016) state-of-the-art instruction duplication error protection scheme achieves a high degree of error detection by verifying the results of memory write operations and utilizes an effective control-flow checking mechanism. However, nZDC control-flow checking mechanism is architecture-dependent and suffers from some vulnerability holes. In this work, we address these issues by substituting nZDC control-flow checking mechanism with a general (ISA-independent) scheme and propose two transformations, coarse-grained scheduling, and asymmetric control-flow signatures, for hard-to-detect control flow errors. Fault injection experiments on different hardware components of synthesizable Verilog description of an OpenRISC-based microprocessor reveal that the proposed transformation shows 85% less silent data corruptions compared to nZDC. In addition, programs protected by the proposed scheme run on average around 37% faster than nZDC-protected programs.</description><subject>Codes</subject><subject>Compiler</subject><subject>Error correction &amp; detection</subject><subject>Error detection</subject><subject>Fault detection</subject><subject>Hardware</subject><subject>Microprocessors</subject><subject>Redundancy</subject><subject>Registers</subject><subject>reliability</subject><subject>silent data corruption</subject><subject>Soft errors</subject><subject>Software</subject><subject>Software reliability</subject><subject>Transient analysis</subject><subject>transient faults</subject><issn>1545-5971</issn><issn>1941-0018</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkEtrwzAQhEVpoenjBxR6EPTsVC_H0rE4jwYCPSQ9C0leg4NrpZJMyb-vjVPoaWeZmV34EHqiZE4pUa-H5b6cM8L4nDORS8Gu0IwqQTNCqLwedC7yLFcFvUV3MR4JYUIqMUP7DXQQGof3vk54FYIPeGmSwaarcOm7FHyL163_-fMggUuN77A9420XU-inddmf2saZUT-gm9q0ER4v8x59rleH8j3bfWy25dsuc0yJlIGztKJCMlVVvFLWgqkpZ4UxhbSOEQXVwtgFL4h13FkBVQ5ODvGFMZKA4PfoZbp7Cv67h5j00fehG15qpmhecKFyOaTolHLBxxig1qfQfJlw1pTokZ0e2emRnb6wGzrPU6cBgH95MthM8l9HEmuW</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Didehban, Moslem</creator><creator>So, Hwisoo</creator><creator>Gali, Prudhvi</creator><creator>Shrivastava, Aviral</creator><creator>Lee, Kyoungwoo</creator><general>IEEE</general><general>IEEE Computer Society</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0001-5082-3775</orcidid><orcidid>https://orcid.org/0000-0002-1075-897X</orcidid><orcidid>https://orcid.org/0000-0002-3496-6079</orcidid><orcidid>https://orcid.org/0000-0002-9357-4702</orcidid></search><sort><creationdate>202401</creationdate><title>Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication</title><author>Didehban, Moslem ; So, Hwisoo ; Gali, Prudhvi ; Shrivastava, Aviral ; Lee, Kyoungwoo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-ecb1d14829dd3d9bbeaf1327aa78bc209ed6ab6370bc3cb4ed5ec88296aa80e43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Codes</topic><topic>Compiler</topic><topic>Error correction &amp; detection</topic><topic>Error detection</topic><topic>Fault detection</topic><topic>Hardware</topic><topic>Microprocessors</topic><topic>Redundancy</topic><topic>Registers</topic><topic>reliability</topic><topic>silent data corruption</topic><topic>Soft errors</topic><topic>Software</topic><topic>Software reliability</topic><topic>Transient analysis</topic><topic>transient faults</topic><toplevel>online_resources</toplevel><creatorcontrib>Didehban, Moslem</creatorcontrib><creatorcontrib>So, Hwisoo</creatorcontrib><creatorcontrib>Gali, Prudhvi</creatorcontrib><creatorcontrib>Shrivastava, Aviral</creatorcontrib><creatorcontrib>Lee, Kyoungwoo</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>IEEE transactions on dependable and secure computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Didehban, Moslem</au><au>So, Hwisoo</au><au>Gali, Prudhvi</au><au>Shrivastava, Aviral</au><au>Lee, Kyoungwoo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication</atitle><jtitle>IEEE transactions on dependable and secure computing</jtitle><stitle>TDSC</stitle><date>2024-01</date><risdate>2024</risdate><volume>21</volume><issue>1</issue><spage>78</spage><epage>92</epage><pages>78-92</pages><issn>1545-5971</issn><eissn>1941-0018</eissn><coden>ITDSCM</coden><abstract>Transient faults or soft errors are considered one of the most daunting reliability challenges for microprocessors. Software solutions for soft error protection are attractive because they can provide flexible and effective error protection. For instance, nZDC (Didehban and Shrivastava 2016) state-of-the-art instruction duplication error protection scheme achieves a high degree of error detection by verifying the results of memory write operations and utilizes an effective control-flow checking mechanism. However, nZDC control-flow checking mechanism is architecture-dependent and suffers from some vulnerability holes. In this work, we address these issues by substituting nZDC control-flow checking mechanism with a general (ISA-independent) scheme and propose two transformations, coarse-grained scheduling, and asymmetric control-flow signatures, for hard-to-detect control flow errors. Fault injection experiments on different hardware components of synthesizable Verilog description of an OpenRISC-based microprocessor reveal that the proposed transformation shows 85% less silent data corruptions compared to nZDC. In addition, programs protected by the proposed scheme run on average around 37% faster than nZDC-protected programs.</abstract><cop>Washington</cop><pub>IEEE</pub><doi>10.1109/TDSC.2023.3245842</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0001-5082-3775</orcidid><orcidid>https://orcid.org/0000-0002-1075-897X</orcidid><orcidid>https://orcid.org/0000-0002-3496-6079</orcidid><orcidid>https://orcid.org/0000-0002-9357-4702</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1545-5971
ispartof IEEE transactions on dependable and secure computing, 2024-01, Vol.21 (1), p.78-92
issn 1545-5971
1941-0018
language eng
recordid cdi_ieee_primary_10045828
source IEEE Electronic Library (IEL)
subjects Codes
Compiler
Error correction & detection
Error detection
Fault detection
Hardware
Microprocessors
Redundancy
Registers
reliability
silent data corruption
Soft errors
Software
Software reliability
Transient analysis
transient faults
title Generic Soft Error Data and Control Flow Error Detection by Instruction Duplication
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T06%3A16%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generic%20Soft%20Error%20Data%20and%20Control%20Flow%20Error%20Detection%20by%20Instruction%20Duplication&rft.jtitle=IEEE%20transactions%20on%20dependable%20and%20secure%20computing&rft.au=Didehban,%20Moslem&rft.date=2024-01&rft.volume=21&rft.issue=1&rft.spage=78&rft.epage=92&rft.pages=78-92&rft.issn=1545-5971&rft.eissn=1941-0018&rft.coden=ITDSCM&rft_id=info:doi/10.1109/TDSC.2023.3245842&rft_dat=%3Cproquest_RIE%3E2915734958%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2915734958&rft_id=info:pmid/&rft_ieee_id=10045828&rfr_iscdi=true