Automatic Reconfiguration for Large-Scale Reliable Storage Systems

Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on dependable and secure computing 2012-03, Vol.9 (2), p.145-158
Hauptverfasser: Rodrigues, R., Liskov, B., Chen, K., Liskov, M., Schultz, D.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 158
container_issue 2
container_start_page 145
container_title IEEE transactions on dependable and secure computing
container_volume 9
creator Rodrigues, R.
Liskov, B.
Chen, K.
Liskov, M.
Schultz, D.
description Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in how they handle reconfigurations (e.g., in terms of the scalability of the solutions or the consistency levels they provide). This can be problematic in long-lived, large-scale systems where system membership is likely to change during the system lifetime. In this paper, we present a complete solution for dynamically changing system membership in a large-scale Byzantine-fault-tolerant system. We present a service that tracks system membership and periodically notifies other system nodes of membership changes. The membership service runs mostly automatically, to avoid human configuration errors; is itself Byzantine-fault-tolerant and reconfigurable; and provides applications with a sequence of consistent views of the system membership. We demonstrate the utility of this membership service by using it in a novel distributed hash table called dBQS that provides atomic semantics even across changes in replica sets. dBQS is interesting in its own right because its storage algorithms extend existing Byzantine quorum protocols to handle changes in the replica set, and because it differs from previous DHTs by providing Byzantine fault tolerance and offering strong semantics. We implemented the membership service and dBQS. Our results show that the approach works well, in practice: the membership service is able to manage a large system and the cost to change the system membership is low.
doi_str_mv 10.1109/TDSC.2010.52
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_5593239</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5593239</ieee_id><sourcerecordid>2636237971</sourcerecordid><originalsourceid>FETCH-LOGICAL-c354t-35529033fb8a56cbf076fd0e3116263f5172f6a2bf5f287d914ae5e20a6ffb3</originalsourceid><addsrcrecordid>eNpd0D1PwzAQBmALgUQpbGwslVgYSLHPuSQeoXxKlZBId8tJz1WqJC52MvTf46qIgenu5Een88vYteBzIbh6WD2XiznwOCKcsIlQqUg4F8Vp7DHFBFUuztlFCFvOIS1UOmFPj-PgOjM09eyLatfbZjP6OLp-Zp2fLY3fUFLWpqX43jamik05OG82se7DQF24ZGfWtIGufuuUla8vq8V7svx8-1g8LpNaYjokEhEUl9JWhcGsrizPM7vmJIXIIJMWRQ42M1BZtFDkayVSQ0jATWZtJafs7rh15933SGHQXRNqalvTkxuDFvHbCgqAPNLbf3TrRt_H26KKcSgUkEZ1f1S1dyF4snrnm874fUT6EKc-xKkPcWqEyG-OvCGiP4qoJEglfwAwwG80</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1001895124</pqid></control><display><type>article</type><title>Automatic Reconfiguration for Large-Scale Reliable Storage Systems</title><source>IEEE Electronic Library (IEL)</source><creator>Rodrigues, R. ; Liskov, B. ; Chen, K. ; Liskov, M. ; Schultz, D.</creator><creatorcontrib>Rodrigues, R. ; Liskov, B. ; Chen, K. ; Liskov, M. ; Schultz, D.</creatorcontrib><description>Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in how they handle reconfigurations (e.g., in terms of the scalability of the solutions or the consistency levels they provide). This can be problematic in long-lived, large-scale systems where system membership is likely to change during the system lifetime. In this paper, we present a complete solution for dynamically changing system membership in a large-scale Byzantine-fault-tolerant system. We present a service that tracks system membership and periodically notifies other system nodes of membership changes. The membership service runs mostly automatically, to avoid human configuration errors; is itself Byzantine-fault-tolerant and reconfigurable; and provides applications with a sequence of consistent views of the system membership. We demonstrate the utility of this membership service by using it in a novel distributed hash table called dBQS that provides atomic semantics even across changes in replica sets. dBQS is interesting in its own right because its storage algorithms extend existing Byzantine quorum protocols to handle changes in the replica set, and because it differs from previous DHTs by providing Byzantine fault tolerance and offering strong semantics. We implemented the membership service and dBQS. Our results show that the approach works well, in practice: the membership service is able to manage a large system and the cost to change the system membership is low.</description><identifier>ISSN: 1545-5971</identifier><identifier>EISSN: 1941-0018</identifier><identifier>DOI: 10.1109/TDSC.2010.52</identifier><identifier>CODEN: ITDSCM</identifier><language>eng</language><publisher>Washington: IEEE</publisher><subject>Analysis ; Byzantine fault tolerance ; Computer centers ; Computer science ; Design ; Digital Object Identifier ; distributed hash tables ; Distributed processing ; dynamic system membership ; Dynamical systems ; Dynamics ; Errors ; Failure ; Fault detection ; Fault tolerance ; Handles ; Information storage ; Internet ; Mathematical models ; Membership renewal ; membership service ; Protocols ; Public key ; Semantics ; Software ; Storage systems ; Studies</subject><ispartof>IEEE transactions on dependable and secure computing, 2012-03, Vol.9 (2), p.145-158</ispartof><rights>Copyright IEEE Computer Society Jan-Mar 2012</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c354t-35529033fb8a56cbf076fd0e3116263f5172f6a2bf5f287d914ae5e20a6ffb3</citedby><cites>FETCH-LOGICAL-c354t-35529033fb8a56cbf076fd0e3116263f5172f6a2bf5f287d914ae5e20a6ffb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5593239$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5593239$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rodrigues, R.</creatorcontrib><creatorcontrib>Liskov, B.</creatorcontrib><creatorcontrib>Chen, K.</creatorcontrib><creatorcontrib>Liskov, M.</creatorcontrib><creatorcontrib>Schultz, D.</creatorcontrib><title>Automatic Reconfiguration for Large-Scale Reliable Storage Systems</title><title>IEEE transactions on dependable and secure computing</title><addtitle>TDSC</addtitle><description>Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in how they handle reconfigurations (e.g., in terms of the scalability of the solutions or the consistency levels they provide). This can be problematic in long-lived, large-scale systems where system membership is likely to change during the system lifetime. In this paper, we present a complete solution for dynamically changing system membership in a large-scale Byzantine-fault-tolerant system. We present a service that tracks system membership and periodically notifies other system nodes of membership changes. The membership service runs mostly automatically, to avoid human configuration errors; is itself Byzantine-fault-tolerant and reconfigurable; and provides applications with a sequence of consistent views of the system membership. We demonstrate the utility of this membership service by using it in a novel distributed hash table called dBQS that provides atomic semantics even across changes in replica sets. dBQS is interesting in its own right because its storage algorithms extend existing Byzantine quorum protocols to handle changes in the replica set, and because it differs from previous DHTs by providing Byzantine fault tolerance and offering strong semantics. We implemented the membership service and dBQS. Our results show that the approach works well, in practice: the membership service is able to manage a large system and the cost to change the system membership is low.</description><subject>Analysis</subject><subject>Byzantine fault tolerance</subject><subject>Computer centers</subject><subject>Computer science</subject><subject>Design</subject><subject>Digital Object Identifier</subject><subject>distributed hash tables</subject><subject>Distributed processing</subject><subject>dynamic system membership</subject><subject>Dynamical systems</subject><subject>Dynamics</subject><subject>Errors</subject><subject>Failure</subject><subject>Fault detection</subject><subject>Fault tolerance</subject><subject>Handles</subject><subject>Information storage</subject><subject>Internet</subject><subject>Mathematical models</subject><subject>Membership renewal</subject><subject>membership service</subject><subject>Protocols</subject><subject>Public key</subject><subject>Semantics</subject><subject>Software</subject><subject>Storage systems</subject><subject>Studies</subject><issn>1545-5971</issn><issn>1941-0018</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNpd0D1PwzAQBmALgUQpbGwslVgYSLHPuSQeoXxKlZBId8tJz1WqJC52MvTf46qIgenu5Een88vYteBzIbh6WD2XiznwOCKcsIlQqUg4F8Vp7DHFBFUuztlFCFvOIS1UOmFPj-PgOjM09eyLatfbZjP6OLp-Zp2fLY3fUFLWpqX43jamik05OG82se7DQF24ZGfWtIGufuuUla8vq8V7svx8-1g8LpNaYjokEhEUl9JWhcGsrizPM7vmJIXIIJMWRQ42M1BZtFDkayVSQ0jATWZtJafs7rh15933SGHQXRNqalvTkxuDFvHbCgqAPNLbf3TrRt_H26KKcSgUkEZ1f1S1dyF4snrnm874fUT6EKc-xKkPcWqEyG-OvCGiP4qoJEglfwAwwG80</recordid><startdate>20120301</startdate><enddate>20120301</enddate><creator>Rodrigues, R.</creator><creator>Liskov, B.</creator><creator>Chen, K.</creator><creator>Liskov, M.</creator><creator>Schultz, D.</creator><general>IEEE</general><general>IEEE Computer Society</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYYUZ</scope><scope>Q9U</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120301</creationdate><title>Automatic Reconfiguration for Large-Scale Reliable Storage Systems</title><author>Rodrigues, R. ; Liskov, B. ; Chen, K. ; Liskov, M. ; Schultz, D.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c354t-35529033fb8a56cbf076fd0e3116263f5172f6a2bf5f287d914ae5e20a6ffb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Analysis</topic><topic>Byzantine fault tolerance</topic><topic>Computer centers</topic><topic>Computer science</topic><topic>Design</topic><topic>Digital Object Identifier</topic><topic>distributed hash tables</topic><topic>Distributed processing</topic><topic>dynamic system membership</topic><topic>Dynamical systems</topic><topic>Dynamics</topic><topic>Errors</topic><topic>Failure</topic><topic>Fault detection</topic><topic>Fault tolerance</topic><topic>Handles</topic><topic>Information storage</topic><topic>Internet</topic><topic>Mathematical models</topic><topic>Membership renewal</topic><topic>membership service</topic><topic>Protocols</topic><topic>Public key</topic><topic>Semantics</topic><topic>Software</topic><topic>Storage systems</topic><topic>Studies</topic><toplevel>online_resources</toplevel><creatorcontrib>Rodrigues, R.</creatorcontrib><creatorcontrib>Liskov, B.</creatorcontrib><creatorcontrib>Chen, K.</creatorcontrib><creatorcontrib>Liskov, M.</creatorcontrib><creatorcontrib>Schultz, D.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>ABI/INFORM Collection China</collection><collection>ProQuest Central Basic</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on dependable and secure computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rodrigues, R.</au><au>Liskov, B.</au><au>Chen, K.</au><au>Liskov, M.</au><au>Schultz, D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic Reconfiguration for Large-Scale Reliable Storage Systems</atitle><jtitle>IEEE transactions on dependable and secure computing</jtitle><stitle>TDSC</stitle><date>2012-03-01</date><risdate>2012</risdate><volume>9</volume><issue>2</issue><spage>145</spage><epage>158</epage><pages>145-158</pages><issn>1545-5971</issn><eissn>1941-0018</eissn><coden>ITDSCM</coden><abstract>Byzantine-fault-tolerant replication enhances the availability and reliability of Internet services that store critical state and preserve it despite attacks or software errors. However, existing Byzantine-fault-tolerant storage systems either assume a static set of replicas, or have limitations in how they handle reconfigurations (e.g., in terms of the scalability of the solutions or the consistency levels they provide). This can be problematic in long-lived, large-scale systems where system membership is likely to change during the system lifetime. In this paper, we present a complete solution for dynamically changing system membership in a large-scale Byzantine-fault-tolerant system. We present a service that tracks system membership and periodically notifies other system nodes of membership changes. The membership service runs mostly automatically, to avoid human configuration errors; is itself Byzantine-fault-tolerant and reconfigurable; and provides applications with a sequence of consistent views of the system membership. We demonstrate the utility of this membership service by using it in a novel distributed hash table called dBQS that provides atomic semantics even across changes in replica sets. dBQS is interesting in its own right because its storage algorithms extend existing Byzantine quorum protocols to handle changes in the replica set, and because it differs from previous DHTs by providing Byzantine fault tolerance and offering strong semantics. We implemented the membership service and dBQS. Our results show that the approach works well, in practice: the membership service is able to manage a large system and the cost to change the system membership is low.</abstract><cop>Washington</cop><pub>IEEE</pub><doi>10.1109/TDSC.2010.52</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1545-5971
ispartof IEEE transactions on dependable and secure computing, 2012-03, Vol.9 (2), p.145-158
issn 1545-5971
1941-0018
language eng
recordid cdi_ieee_primary_5593239
source IEEE Electronic Library (IEL)
subjects Analysis
Byzantine fault tolerance
Computer centers
Computer science
Design
Digital Object Identifier
distributed hash tables
Distributed processing
dynamic system membership
Dynamical systems
Dynamics
Errors
Failure
Fault detection
Fault tolerance
Handles
Information storage
Internet
Mathematical models
Membership renewal
membership service
Protocols
Public key
Semantics
Software
Storage systems
Studies
title Automatic Reconfiguration for Large-Scale Reliable Storage Systems
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T20%3A07%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20Reconfiguration%20for%20Large-Scale%20Reliable%20Storage%20Systems&rft.jtitle=IEEE%20transactions%20on%20dependable%20and%20secure%20computing&rft.au=Rodrigues,%20R.&rft.date=2012-03-01&rft.volume=9&rft.issue=2&rft.spage=145&rft.epage=158&rft.pages=145-158&rft.issn=1545-5971&rft.eissn=1941-0018&rft.coden=ITDSCM&rft_id=info:doi/10.1109/TDSC.2010.52&rft_dat=%3Cproquest_RIE%3E2636237971%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1001895124&rft_id=info:pmid/&rft_ieee_id=5593239&rfr_iscdi=true