CEC: Continuous eventual checkpointing for data stream processing operators

The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent che...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Sebepou, Z., Magoutis, K.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Aggregates Checkpointing Complexity theory Continuous Stream Processing Fault tolerance Fault tolerant systems Instruction sets Production
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	156
container_issue
container_start_page	145
container_title
container_volume
creator	Sebepou, Z. Magoutis, K.
description	The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent checkpoint and requests log replay to reach a consistent pre-failure state. Challenges with that technique include its complexity (typically implemented via copy-on-write), the associated overhead (exception handling under state updates), and limits to the frequency of checkpointing. The latter limit affects the amount of information that needs to be replayed leading to long recovery times. In this work we introduce continuous eventual checkpointing (CEC), a novel mechanism to provide fault-tolerance guarantees by taking continuous incremental state checkpoints with minimal pausing of operator processing. We achieve this by separating operator state into independent parts and producing frequent independent partial checkpoints of them. Our results show that our method can achieve low overhead fault-tolerance with adjustable checkpoint intensity, trading off recovery time with performance.
doi_str_mv	10.1109/DSN.2011.5958214
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5958214</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5958214</ieee_id><sourcerecordid>5958214</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-b9295449bee9de670598df0accdb59d5631149d7edf5750faf6df37ff8c3498e3</originalsourceid><addsrcrecordid>eNpVkMlOwzAURc0kUZXukdj4BxI8xMNjh0IZRAULYF058TME2jqyEyT-nlZ0w93cxZGOri4h55yVnDO4vHl5KgXjvFSgrODVAZmBsbwSVQVCSn1IJoIrW0gQ5ugfE3BMJlxJVjBr4ZTMcv5k22gNTNsJeazn9RWt42boNmMcM8Vv3AyjW9H2A9uvPnY78k5DTNS7wdE8JHRr2qfYYs47FHtMbogpn5GT4FYZZ_uekrfb-Wt9Xyye7x7q60XRcaOGogEBajuuQQSP2jAF1gfm2tY3CrzSkvMKvEEflFEsuKB9kCYE28oKLMopufjzdoi47FO3dulnub9G_gLhJ1PQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>CEC: Continuous eventual checkpointing for data stream processing operators</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Sebepou, Z. ; Magoutis, K.</creator><creatorcontrib>Sebepou, Z. ; Magoutis, K.</creatorcontrib><description>The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent checkpoint and requests log replay to reach a consistent pre-failure state. Challenges with that technique include its complexity (typically implemented via copy-on-write), the associated overhead (exception handling under state updates), and limits to the frequency of checkpointing. The latter limit affects the amount of information that needs to be replayed leading to long recovery times. In this work we introduce continuous eventual checkpointing (CEC), a novel mechanism to provide fault-tolerance guarantees by taking continuous incremental state checkpoints with minimal pausing of operator processing. We achieve this by separating operator state into independent parts and producing frequent independent partial checkpoints of them. Our results show that our method can achieve low overhead fault-tolerance with adjustable checkpoint intensity, trading off recovery time with performance.</description><identifier>ISSN: 1530-0889</identifier><identifier>ISBN: 9781424492329</identifier><identifier>ISBN: 1424492327</identifier><identifier>EISSN: 2158-3927</identifier><identifier>EISBN: 9781424492336</identifier><identifier>EISBN: 1424492319</identifier><identifier>EISBN: 9781424492312</identifier><identifier>EISBN: 1424492335</identifier><identifier>DOI: 10.1109/DSN.2011.5958214</identifier><language>eng</language><publisher>IEEE</publisher><subject>Aggregates ; Checkpointing ; Complexity theory ; Continuous Stream Processing ; Fault tolerance ; Fault tolerant systems ; Instruction sets ; Production</subject><ispartof>2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN), 2011, p.145-156</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5958214$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>310,311,781,785,790,791,2059,27930,54925</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5958214$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Sebepou, Z.</creatorcontrib><creatorcontrib>Magoutis, K.</creatorcontrib><title>CEC: Continuous eventual checkpointing for data stream processing operators</title><title>2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN)</title><addtitle>DSN</addtitle><description>The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent checkpoint and requests log replay to reach a consistent pre-failure state. Challenges with that technique include its complexity (typically implemented via copy-on-write), the associated overhead (exception handling under state updates), and limits to the frequency of checkpointing. The latter limit affects the amount of information that needs to be replayed leading to long recovery times. In this work we introduce continuous eventual checkpointing (CEC), a novel mechanism to provide fault-tolerance guarantees by taking continuous incremental state checkpoints with minimal pausing of operator processing. We achieve this by separating operator state into independent parts and producing frequent independent partial checkpoints of them. Our results show that our method can achieve low overhead fault-tolerance with adjustable checkpoint intensity, trading off recovery time with performance.</description><subject>Aggregates</subject><subject>Checkpointing</subject><subject>Complexity theory</subject><subject>Continuous Stream Processing</subject><subject>Fault tolerance</subject><subject>Fault tolerant systems</subject><subject>Instruction sets</subject><subject>Production</subject><issn>1530-0889</issn><issn>2158-3927</issn><isbn>9781424492329</isbn><isbn>1424492327</isbn><isbn>9781424492336</isbn><isbn>1424492319</isbn><isbn>9781424492312</isbn><isbn>1424492335</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpVkMlOwzAURc0kUZXukdj4BxI8xMNjh0IZRAULYF058TME2jqyEyT-nlZ0w93cxZGOri4h55yVnDO4vHl5KgXjvFSgrODVAZmBsbwSVQVCSn1IJoIrW0gQ5ugfE3BMJlxJVjBr4ZTMcv5k22gNTNsJeazn9RWt42boNmMcM8Vv3AyjW9H2A9uvPnY78k5DTNS7wdE8JHRr2qfYYs47FHtMbogpn5GT4FYZZ_uekrfb-Wt9Xyye7x7q60XRcaOGogEBajuuQQSP2jAF1gfm2tY3CrzSkvMKvEEflFEsuKB9kCYE28oKLMopufjzdoi47FO3dulnub9G_gLhJ1PQ</recordid><startdate>201106</startdate><enddate>201106</enddate><creator>Sebepou, Z.</creator><creator>Magoutis, K.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201106</creationdate><title>CEC: Continuous eventual checkpointing for data stream processing operators</title><author>Sebepou, Z. ; Magoutis, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-b9295449bee9de670598df0accdb59d5631149d7edf5750faf6df37ff8c3498e3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Aggregates</topic><topic>Checkpointing</topic><topic>Complexity theory</topic><topic>Continuous Stream Processing</topic><topic>Fault tolerance</topic><topic>Fault tolerant systems</topic><topic>Instruction sets</topic><topic>Production</topic><toplevel>online_resources</toplevel><creatorcontrib>Sebepou, Z.</creatorcontrib><creatorcontrib>Magoutis, K.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sebepou, Z.</au><au>Magoutis, K.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>CEC: Continuous eventual checkpointing for data stream processing operators</atitle><btitle>2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN)</btitle><stitle>DSN</stitle><date>2011-06</date><risdate>2011</risdate><spage>145</spage><epage>156</epage><pages>145-156</pages><issn>1530-0889</issn><eissn>2158-3927</eissn><isbn>9781424492329</isbn><isbn>1424492327</isbn><eisbn>9781424492336</eisbn><eisbn>1424492319</eisbn><eisbn>9781424492312</eisbn><eisbn>1424492335</eisbn><abstract>The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent checkpoint and requests log replay to reach a consistent pre-failure state. Challenges with that technique include its complexity (typically implemented via copy-on-write), the associated overhead (exception handling under state updates), and limits to the frequency of checkpointing. The latter limit affects the amount of information that needs to be replayed leading to long recovery times. In this work we introduce continuous eventual checkpointing (CEC), a novel mechanism to provide fault-tolerance guarantees by taking continuous incremental state checkpoints with minimal pausing of operator processing. We achieve this by separating operator state into independent parts and producing frequent independent partial checkpoints of them. Our results show that our method can achieve low overhead fault-tolerance with adjustable checkpoint intensity, trading off recovery time with performance.</abstract><pub>IEEE</pub><doi>10.1109/DSN.2011.5958214</doi><tpages>12</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1530-0889
ispartof	2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN), 2011, p.145-156
issn	1530-0889 2158-3927
language	eng
recordid	cdi_ieee_primary_5958214
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Aggregates Checkpointing Complexity theory Continuous Stream Processing Fault tolerance Fault tolerant systems Instruction sets Production
title	CEC: Continuous eventual checkpointing for data stream processing operators
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T13%3A58%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=CEC:%20Continuous%20eventual%20checkpointing%20for%20data%20stream%20processing%20operators&rft.btitle=2011%20IEEE/IFIP%2041st%20International%20Conference%20on%20Dependable%20Systems%20&%20Networks%20(DSN)&rft.au=Sebepou,%20Z.&rft.date=2011-06&rft.spage=145&rft.epage=156&rft.pages=145-156&rft.issn=1530-0889&rft.eissn=2158-3927&rft.isbn=9781424492329&rft.isbn_list=1424492327&rft_id=info:doi/10.1109/DSN.2011.5958214&rft_dat=%3Cieee_6IE%3E5958214%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781424492336&rft.eisbn_list=1424492319&rft.eisbn_list=9781424492312&rft.eisbn_list=1424492335&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5958214&rfr_iscdi=true