Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures

Many Cyber-Physical Systems (CPS) have timing constraints that must be met by the cyber components (software and the network) to ensure safety. It is a tedious job to check if a CPS meets its timing requirement especially when it is distributed and the software and/or the underlying computing platfo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on cyber-physical systems 2022-09, Vol.6 (3), p.1-39, Article 21
Hauptverfasser: Khayatian, Mohammad, Mehrabian, Mohammadreza, Andert, Edward, Grimsley, Reese, Liang, Kyle, Hu, Yi, McCormack, Ian, Joe-Wong, Carlee, Aldrich, Jonathan, Iannucci, Bob, Shrivastava, Aviral
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 39
container_issue 3
container_start_page 1
container_title ACM transactions on cyber-physical systems
container_volume 6
creator Khayatian, Mohammad
Mehrabian, Mohammadreza
Andert, Edward
Grimsley, Reese
Liang, Kyle
Hu, Yi
McCormack, Ian
Joe-Wong, Carlee
Aldrich, Jonathan
Iannucci, Bob
Shrivastava, Aviral
description Many Cyber-Physical Systems (CPS) have timing constraints that must be met by the cyber components (software and the network) to ensure safety. It is a tedious job to check if a CPS meets its timing requirement especially when it is distributed and the software and/or the underlying computing platforms are complex. Furthermore, the system design is brittle since a timing failure can still happen (e.g., network failure, soft error bit flip). In this article, we propose a new design methodology called Plan B where timing constraints of the CPS are monitored at runtime, and a proper backup routine is executed when a timing failure happens to ensure safety. We provide a model on how to express the desired timing behavior using a set of timing constructs in a C/C++ code and how to efficiently monitor them at the runtime. We showcase the effectiveness of our approach by conducting experiments on three case studies: (1) the full software stack for autonomous driving (Apollo), (2) a multi-agent system with 1/10th-scale model robots, and (3) a quadrotor for search and rescue application. We show that the system remains safe and stable even when intentional faults are injected to cause a timing failure. We also demonstrate that the system can achieve graceful degradation when a less extreme timing failure happens.
doi_str_mv 10.1145/3516449
format Article
fullrecord <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3516449</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3516449</sourcerecordid><originalsourceid>FETCH-LOGICAL-a277t-2829c44e0f7a7da341f65e457beda5cb46bdb7ac33a99d8aa4bae2670bd6b2f63</originalsourceid><addsrcrecordid>eNo9kD1PwzAYhC0EElWp2Jm8MQX8FTthg0ABqagVFIkteh3baVASIzsd8u8paulyd9I9uuEQuqTkhlKR3vKUSiHyEzRhXGVJLnl2eszs6xzNYvwmhFAld6omaLlqoccPd_jRxqbu8ZsdNt741tcjdj7gYtQ2JKvNGJsKWvwxxsF2Eb97vY0DHjxeN13T13gOTbsNNl6gMwdttLODT9Hn_GldvCSL5fNrcb9IgCk1JCxjeSWEJU6BMsAFdTK1IlXaGkgrLaQ2WkHFOeS5yQCEBsukItpIzZzkU3S9362CjzFYV_6EpoMwlpSUf1eUhyt25NWehKo7Qv_lL4G7WRM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures</title><source>ACM Digital Library Complete</source><creator>Khayatian, Mohammad ; Mehrabian, Mohammadreza ; Andert, Edward ; Grimsley, Reese ; Liang, Kyle ; Hu, Yi ; McCormack, Ian ; Joe-Wong, Carlee ; Aldrich, Jonathan ; Iannucci, Bob ; Shrivastava, Aviral</creator><creatorcontrib>Khayatian, Mohammad ; Mehrabian, Mohammadreza ; Andert, Edward ; Grimsley, Reese ; Liang, Kyle ; Hu, Yi ; McCormack, Ian ; Joe-Wong, Carlee ; Aldrich, Jonathan ; Iannucci, Bob ; Shrivastava, Aviral</creatorcontrib><description>Many Cyber-Physical Systems (CPS) have timing constraints that must be met by the cyber components (software and the network) to ensure safety. It is a tedious job to check if a CPS meets its timing requirement especially when it is distributed and the software and/or the underlying computing platforms are complex. Furthermore, the system design is brittle since a timing failure can still happen (e.g., network failure, soft error bit flip). In this article, we propose a new design methodology called Plan B where timing constraints of the CPS are monitored at runtime, and a proper backup routine is executed when a timing failure happens to ensure safety. We provide a model on how to express the desired timing behavior using a set of timing constructs in a C/C++ code and how to efficiently monitor them at the runtime. We showcase the effectiveness of our approach by conducting experiments on three case studies: (1) the full software stack for autonomous driving (Apollo), (2) a multi-agent system with 1/10th-scale model robots, and (3) a quadrotor for search and rescue application. We show that the system remains safe and stable even when intentional faults are injected to cause a timing failure. We also demonstrate that the system can achieve graceful degradation when a less extreme timing failure happens.</description><identifier>ISSN: 2378-962X</identifier><identifier>EISSN: 2378-9638</identifier><identifier>DOI: 10.1145/3516449</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computer systems organization ; Embedded and cyber-physical systems</subject><ispartof>ACM transactions on cyber-physical systems, 2022-09, Vol.6 (3), p.1-39, Article 21</ispartof><rights>Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a277t-2829c44e0f7a7da341f65e457beda5cb46bdb7ac33a99d8aa4bae2670bd6b2f63</citedby><cites>FETCH-LOGICAL-a277t-2829c44e0f7a7da341f65e457beda5cb46bdb7ac33a99d8aa4bae2670bd6b2f63</cites><orcidid>0000-0003-4134-5008 ; 0000-0002-5010-2377 ; 0000-0002-9081-9049</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/3516449$$EPDF$$P50$$Gacm$$Hfree_for_read</linktopdf><link.rule.ids>314,778,782,2278,27911,27912,40183,75983</link.rule.ids></links><search><creatorcontrib>Khayatian, Mohammad</creatorcontrib><creatorcontrib>Mehrabian, Mohammadreza</creatorcontrib><creatorcontrib>Andert, Edward</creatorcontrib><creatorcontrib>Grimsley, Reese</creatorcontrib><creatorcontrib>Liang, Kyle</creatorcontrib><creatorcontrib>Hu, Yi</creatorcontrib><creatorcontrib>McCormack, Ian</creatorcontrib><creatorcontrib>Joe-Wong, Carlee</creatorcontrib><creatorcontrib>Aldrich, Jonathan</creatorcontrib><creatorcontrib>Iannucci, Bob</creatorcontrib><creatorcontrib>Shrivastava, Aviral</creatorcontrib><title>Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures</title><title>ACM transactions on cyber-physical systems</title><addtitle>ACM TCPS</addtitle><description>Many Cyber-Physical Systems (CPS) have timing constraints that must be met by the cyber components (software and the network) to ensure safety. It is a tedious job to check if a CPS meets its timing requirement especially when it is distributed and the software and/or the underlying computing platforms are complex. Furthermore, the system design is brittle since a timing failure can still happen (e.g., network failure, soft error bit flip). In this article, we propose a new design methodology called Plan B where timing constraints of the CPS are monitored at runtime, and a proper backup routine is executed when a timing failure happens to ensure safety. We provide a model on how to express the desired timing behavior using a set of timing constructs in a C/C++ code and how to efficiently monitor them at the runtime. We showcase the effectiveness of our approach by conducting experiments on three case studies: (1) the full software stack for autonomous driving (Apollo), (2) a multi-agent system with 1/10th-scale model robots, and (3) a quadrotor for search and rescue application. We show that the system remains safe and stable even when intentional faults are injected to cause a timing failure. We also demonstrate that the system can achieve graceful degradation when a less extreme timing failure happens.</description><subject>Computer systems organization</subject><subject>Embedded and cyber-physical systems</subject><issn>2378-962X</issn><issn>2378-9638</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo9kD1PwzAYhC0EElWp2Jm8MQX8FTthg0ABqagVFIkteh3baVASIzsd8u8paulyd9I9uuEQuqTkhlKR3vKUSiHyEzRhXGVJLnl2eszs6xzNYvwmhFAld6omaLlqoccPd_jRxqbu8ZsdNt741tcjdj7gYtQ2JKvNGJsKWvwxxsF2Eb97vY0DHjxeN13T13gOTbsNNl6gMwdttLODT9Hn_GldvCSL5fNrcb9IgCk1JCxjeSWEJU6BMsAFdTK1IlXaGkgrLaQ2WkHFOeS5yQCEBsukItpIzZzkU3S9362CjzFYV_6EpoMwlpSUf1eUhyt25NWehKo7Qv_lL4G7WRM</recordid><startdate>20220907</startdate><enddate>20220907</enddate><creator>Khayatian, Mohammad</creator><creator>Mehrabian, Mohammadreza</creator><creator>Andert, Edward</creator><creator>Grimsley, Reese</creator><creator>Liang, Kyle</creator><creator>Hu, Yi</creator><creator>McCormack, Ian</creator><creator>Joe-Wong, Carlee</creator><creator>Aldrich, Jonathan</creator><creator>Iannucci, Bob</creator><creator>Shrivastava, Aviral</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-4134-5008</orcidid><orcidid>https://orcid.org/0000-0002-5010-2377</orcidid><orcidid>https://orcid.org/0000-0002-9081-9049</orcidid></search><sort><creationdate>20220907</creationdate><title>Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures</title><author>Khayatian, Mohammad ; Mehrabian, Mohammadreza ; Andert, Edward ; Grimsley, Reese ; Liang, Kyle ; Hu, Yi ; McCormack, Ian ; Joe-Wong, Carlee ; Aldrich, Jonathan ; Iannucci, Bob ; Shrivastava, Aviral</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a277t-2829c44e0f7a7da341f65e457beda5cb46bdb7ac33a99d8aa4bae2670bd6b2f63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer systems organization</topic><topic>Embedded and cyber-physical systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khayatian, Mohammad</creatorcontrib><creatorcontrib>Mehrabian, Mohammadreza</creatorcontrib><creatorcontrib>Andert, Edward</creatorcontrib><creatorcontrib>Grimsley, Reese</creatorcontrib><creatorcontrib>Liang, Kyle</creatorcontrib><creatorcontrib>Hu, Yi</creatorcontrib><creatorcontrib>McCormack, Ian</creatorcontrib><creatorcontrib>Joe-Wong, Carlee</creatorcontrib><creatorcontrib>Aldrich, Jonathan</creatorcontrib><creatorcontrib>Iannucci, Bob</creatorcontrib><creatorcontrib>Shrivastava, Aviral</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on cyber-physical systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khayatian, Mohammad</au><au>Mehrabian, Mohammadreza</au><au>Andert, Edward</au><au>Grimsley, Reese</au><au>Liang, Kyle</au><au>Hu, Yi</au><au>McCormack, Ian</au><au>Joe-Wong, Carlee</au><au>Aldrich, Jonathan</au><au>Iannucci, Bob</au><au>Shrivastava, Aviral</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures</atitle><jtitle>ACM transactions on cyber-physical systems</jtitle><stitle>ACM TCPS</stitle><date>2022-09-07</date><risdate>2022</risdate><volume>6</volume><issue>3</issue><spage>1</spage><epage>39</epage><pages>1-39</pages><artnum>21</artnum><issn>2378-962X</issn><eissn>2378-9638</eissn><abstract>Many Cyber-Physical Systems (CPS) have timing constraints that must be met by the cyber components (software and the network) to ensure safety. It is a tedious job to check if a CPS meets its timing requirement especially when it is distributed and the software and/or the underlying computing platforms are complex. Furthermore, the system design is brittle since a timing failure can still happen (e.g., network failure, soft error bit flip). In this article, we propose a new design methodology called Plan B where timing constraints of the CPS are monitored at runtime, and a proper backup routine is executed when a timing failure happens to ensure safety. We provide a model on how to express the desired timing behavior using a set of timing constructs in a C/C++ code and how to efficiently monitor them at the runtime. We showcase the effectiveness of our approach by conducting experiments on three case studies: (1) the full software stack for autonomous driving (Apollo), (2) a multi-agent system with 1/10th-scale model robots, and (3) a quadrotor for search and rescue application. We show that the system remains safe and stable even when intentional faults are injected to cause a timing failure. We also demonstrate that the system can achieve graceful degradation when a less extreme timing failure happens.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3516449</doi><tpages>39</tpages><orcidid>https://orcid.org/0000-0003-4134-5008</orcidid><orcidid>https://orcid.org/0000-0002-5010-2377</orcidid><orcidid>https://orcid.org/0000-0002-9081-9049</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2378-962X
ispartof ACM transactions on cyber-physical systems, 2022-09, Vol.6 (3), p.1-39, Article 21
issn 2378-962X
2378-9638
language eng
recordid cdi_crossref_primary_10_1145_3516449
source ACM Digital Library Complete
subjects Computer systems organization
Embedded and cyber-physical systems
title Plan B: Design Methodology for Cyber-Physical Systems Robust to Timing Failures
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T14%3A00%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Plan%20B:%20Design%20Methodology%20for%20Cyber-Physical%20Systems%20Robust%20to%20Timing%20Failures&rft.jtitle=ACM%20transactions%20on%20cyber-physical%20systems&rft.au=Khayatian,%20Mohammad&rft.date=2022-09-07&rft.volume=6&rft.issue=3&rft.spage=1&rft.epage=39&rft.pages=1-39&rft.artnum=21&rft.issn=2378-962X&rft.eissn=2378-9638&rft_id=info:doi/10.1145/3516449&rft_dat=%3Cacm_cross%3E3516449%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true