Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors

Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently wi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gao, Yue, Gupta, Sandeep K., Breuer, Melvin A.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 932
container_issue
container_start_page 927
container_title
container_volume
creator Gao, Yue
Gupta, Sandeep K.
Breuer, Melvin A.
description Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently with task execution, and associated overheads are incurred as increases in software run-time or hardware area. However, such embedded error detection (EED) techniques, e.g., watchdog processor assisted control flow checking, only provide approximately 70% error coverage [1, 2]. We propose the idea of utilizing straightforward explicit output comparison (EOC) which provides nearly 100% error coverage. We construct a framework for utilizing EOC in FTS, identify new challenges and tradeoffs, and develop a new off-line scheduling algorithm for EOC. We show that our EOC based approach provides higher error coverage and an average performance improvement of nearly 10% over EED-based FTS approaches, without increasing resource requirements. In our ongoing research we are identifying a richer set of ways of applying EOC, by itself and in conjunction with EED, to obtain further improvements.
doi_str_mv 10.7873/DATE.2013.195
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6513641</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6513641</ieee_id><sourcerecordid>6513641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c219t-b30f05762afbc9f93456b91df90ccb839c0eac6664a6edfa33dce3c25bb171233</originalsourceid><addsrcrecordid>eNotjz1PwzAURc2XRCkdmVg8wpDglxfb8ViVFpAqMdDOlePYbSCJIzuR4N_TCqY7nHuudAm5A5bKQuLT83yzTDMGmILiZ2SmZIGqAI6SMXZOJsB5kQAwuCA3kAuJnElglyeALAGu4JrMYvw8tgFQZZhNyNc21t2e2u--qU09UD8O_ThQ49tehzr6LlLnA3V6bAY6-MYG3Q00moOtxuZkPqw2H4_Ud7T1lQ0dPdT7Q9LbcLRa3RlL--CNjdGHeEuunG6inf3nlGxXy83iNVm_v7wt5uvEZKCGpETmGJci0640yinMuSgVVE4xY8rjZcOsNkKIXAtbOY1YGYsm42UJEjLEKbn_262ttbs-1K0OPzvBAUUO-As1Dl5o</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Gao, Yue ; Gupta, Sandeep K. ; Breuer, Melvin A.</creator><creatorcontrib>Gao, Yue ; Gupta, Sandeep K. ; Breuer, Melvin A.</creatorcontrib><description>Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently with task execution, and associated overheads are incurred as increases in software run-time or hardware area. However, such embedded error detection (EED) techniques, e.g., watchdog processor assisted control flow checking, only provide approximately 70% error coverage [1, 2]. We propose the idea of utilizing straightforward explicit output comparison (EOC) which provides nearly 100% error coverage. We construct a framework for utilizing EOC in FTS, identify new challenges and tradeoffs, and develop a new off-line scheduling algorithm for EOC. We show that our EOC based approach provides higher error coverage and an average performance improvement of nearly 10% over EED-based FTS approaches, without increasing resource requirements. In our ongoing research we are identifying a richer set of ways of applying EOC, by itself and in conjunction with EED, to obtain further improvements.</description><identifier>ISSN: 1530-1591</identifier><identifier>ISBN: 1467350710</identifier><identifier>ISBN: 9781467350716</identifier><identifier>EISSN: 1558-1101</identifier><identifier>EISBN: 9783981537000</identifier><identifier>EISBN: 3981537009</identifier><identifier>DOI: 10.7873/DATE.2013.195</identifier><language>eng</language><publisher>IEEE</publisher><subject>Circuit faults ; Fault tolerance ; Fault tolerant systems ; Hardware ; Processor scheduling ; Program processors ; Schedules</subject><ispartof>2013 Design, Automation &amp; Test in Europe Conference &amp; Exhibition (DATE), 2013, p.927-932</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c219t-b30f05762afbc9f93456b91df90ccb839c0eac6664a6edfa33dce3c25bb171233</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6513641$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6513641$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Gao, Yue</creatorcontrib><creatorcontrib>Gupta, Sandeep K.</creatorcontrib><creatorcontrib>Breuer, Melvin A.</creatorcontrib><title>Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors</title><title>2013 Design, Automation &amp; Test in Europe Conference &amp; Exhibition (DATE)</title><addtitle>DATE</addtitle><description>Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently with task execution, and associated overheads are incurred as increases in software run-time or hardware area. However, such embedded error detection (EED) techniques, e.g., watchdog processor assisted control flow checking, only provide approximately 70% error coverage [1, 2]. We propose the idea of utilizing straightforward explicit output comparison (EOC) which provides nearly 100% error coverage. We construct a framework for utilizing EOC in FTS, identify new challenges and tradeoffs, and develop a new off-line scheduling algorithm for EOC. We show that our EOC based approach provides higher error coverage and an average performance improvement of nearly 10% over EED-based FTS approaches, without increasing resource requirements. In our ongoing research we are identifying a richer set of ways of applying EOC, by itself and in conjunction with EED, to obtain further improvements.</description><subject>Circuit faults</subject><subject>Fault tolerance</subject><subject>Fault tolerant systems</subject><subject>Hardware</subject><subject>Processor scheduling</subject><subject>Program processors</subject><subject>Schedules</subject><issn>1530-1591</issn><issn>1558-1101</issn><isbn>1467350710</isbn><isbn>9781467350716</isbn><isbn>9783981537000</isbn><isbn>3981537009</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2013</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotjz1PwzAURc2XRCkdmVg8wpDglxfb8ViVFpAqMdDOlePYbSCJIzuR4N_TCqY7nHuudAm5A5bKQuLT83yzTDMGmILiZ2SmZIGqAI6SMXZOJsB5kQAwuCA3kAuJnElglyeALAGu4JrMYvw8tgFQZZhNyNc21t2e2u--qU09UD8O_ThQ49tehzr6LlLnA3V6bAY6-MYG3Q00moOtxuZkPqw2H4_Ud7T1lQ0dPdT7Q9LbcLRa3RlL--CNjdGHeEuunG6inf3nlGxXy83iNVm_v7wt5uvEZKCGpETmGJci0640yinMuSgVVE4xY8rjZcOsNkKIXAtbOY1YGYsm42UJEjLEKbn_262ttbs-1K0OPzvBAUUO-As1Dl5o</recordid><startdate>201303</startdate><enddate>201303</enddate><creator>Gao, Yue</creator><creator>Gupta, Sandeep K.</creator><creator>Breuer, Melvin A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201303</creationdate><title>Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors</title><author>Gao, Yue ; Gupta, Sandeep K. ; Breuer, Melvin A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c219t-b30f05762afbc9f93456b91df90ccb839c0eac6664a6edfa33dce3c25bb171233</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Circuit faults</topic><topic>Fault tolerance</topic><topic>Fault tolerant systems</topic><topic>Hardware</topic><topic>Processor scheduling</topic><topic>Program processors</topic><topic>Schedules</topic><toplevel>online_resources</toplevel><creatorcontrib>Gao, Yue</creatorcontrib><creatorcontrib>Gupta, Sandeep K.</creatorcontrib><creatorcontrib>Breuer, Melvin A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gao, Yue</au><au>Gupta, Sandeep K.</au><au>Breuer, Melvin A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors</atitle><btitle>2013 Design, Automation &amp; Test in Europe Conference &amp; Exhibition (DATE)</btitle><stitle>DATE</stitle><date>2013-03</date><risdate>2013</risdate><spage>927</spage><epage>932</epage><pages>927-932</pages><issn>1530-1591</issn><eissn>1558-1101</eissn><isbn>1467350710</isbn><isbn>9781467350716</isbn><eisbn>9783981537000</eisbn><eisbn>3981537009</eisbn><abstract>Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently with task execution, and associated overheads are incurred as increases in software run-time or hardware area. However, such embedded error detection (EED) techniques, e.g., watchdog processor assisted control flow checking, only provide approximately 70% error coverage [1, 2]. We propose the idea of utilizing straightforward explicit output comparison (EOC) which provides nearly 100% error coverage. We construct a framework for utilizing EOC in FTS, identify new challenges and tradeoffs, and develop a new off-line scheduling algorithm for EOC. We show that our EOC based approach provides higher error coverage and an average performance improvement of nearly 10% over EED-based FTS approaches, without increasing resource requirements. In our ongoing research we are identifying a richer set of ways of applying EOC, by itself and in conjunction with EED, to obtain further improvements.</abstract><pub>IEEE</pub><doi>10.7873/DATE.2013.195</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1530-1591
ispartof 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013, p.927-932
issn 1530-1591
1558-1101
language eng
recordid cdi_ieee_primary_6513641
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Circuit faults
Fault tolerance
Fault tolerant systems
Hardware
Processor scheduling
Program processors
Schedules
title Using explicit output comparisons for fault tolerant scheduling (FTS) on modern high-performance processors
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T09%3A10%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Using%20explicit%20output%20comparisons%20for%20fault%20tolerant%20scheduling%20(FTS)%20on%20modern%20high-performance%20processors&rft.btitle=2013%20Design,%20Automation%20&%20Test%20in%20Europe%20Conference%20&%20Exhibition%20(DATE)&rft.au=Gao,%20Yue&rft.date=2013-03&rft.spage=927&rft.epage=932&rft.pages=927-932&rft.issn=1530-1591&rft.eissn=1558-1101&rft.isbn=1467350710&rft.isbn_list=9781467350716&rft_id=info:doi/10.7873/DATE.2013.195&rft_dat=%3Cieee_6IE%3E6513641%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783981537000&rft.eisbn_list=3981537009&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6513641&rfr_iscdi=true