Differential RAID: Rethinking RAID for SSD reliability

SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on storage 2010-07, Vol.6 (2), p.1-22
Hauptverfasser: Balakrishnan, Mahesh, Kadav, Asim, Prabhakaran, Vijayan, Malkhi, Dahlia
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 22
container_issue 2
container_start_page 1
container_title ACM transactions on storage
container_volume 6
creator Balakrishnan, Mahesh
Kadav, Asim
Prabhakaran, Vijayan
Malkhi, Dahlia
description SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes can wear out devices at similar times. When a device in the array fails towards the end of its lifetime, the high BER of the remaining devices can result in data loss. We propose Diff-RAID, a parity-based redundancy solution that creates an age differential in an array of SSDs. Diff-RAID distributes parity blocks unevenly across the array, leveraging their higher update rate to age devices at different rates. To maintain this age differential when old devices are replaced by new ones, Diff-RAID reshuffles the parity distribution on each drive replacement. We evaluate Diff-RAID's reliability by using real BER data from 12 flash chips on a simulator and show that it is more reliable than RAID-5, in some cases by multiple orders of magnitude. We also evaluate Diff-RAID's performance using a software implementation on a 5-device array of 80 GB Intel X25-M SSDs and show that it offers a trade-off between throughput and reliability.
doi_str_mv 10.1145/1807060.1807061
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_1807060_1807061</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_1807060_1807061</sourcerecordid><originalsourceid>FETCH-LOGICAL-c195t-137c20711256a637439ac02aab208517079816efd2c55ee3d207b008e0b11bee3</originalsourceid><addsrcrecordid>eNo9j82LwjAUxMOi4Md68LRX_4Ha9_L6kvYofoMgiHsuaUygi7sriRf_e6sWT7-ZYRgYIb4QpogZp5iDBtWYJ_FD9JGZEoKCOm-tdU8MYvwBICUz7ovxovbeBfd3rc15cphtF5-i6805ulHLofheLY_zTbLbr7fz2S6xWPA1QdJWgkaUrIwinVFhLEhjKgk5owZd5KicP0nL7BydmnIFkDuoEKsmGIr0tWvDf4zB-fIS6l8TbiVC-XhUto9aIt0B2zU6Rg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Differential RAID: Rethinking RAID for SSD reliability</title><source>ACM Digital Library Complete</source><creator>Balakrishnan, Mahesh ; Kadav, Asim ; Prabhakaran, Vijayan ; Malkhi, Dahlia</creator><creatorcontrib>Balakrishnan, Mahesh ; Kadav, Asim ; Prabhakaran, Vijayan ; Malkhi, Dahlia</creatorcontrib><description>SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes can wear out devices at similar times. When a device in the array fails towards the end of its lifetime, the high BER of the remaining devices can result in data loss. We propose Diff-RAID, a parity-based redundancy solution that creates an age differential in an array of SSDs. Diff-RAID distributes parity blocks unevenly across the array, leveraging their higher update rate to age devices at different rates. To maintain this age differential when old devices are replaced by new ones, Diff-RAID reshuffles the parity distribution on each drive replacement. We evaluate Diff-RAID's reliability by using real BER data from 12 flash chips on a simulator and show that it is more reliable than RAID-5, in some cases by multiple orders of magnitude. We also evaluate Diff-RAID's performance using a software implementation on a 5-device array of 80 GB Intel X25-M SSDs and show that it offers a trade-off between throughput and reliability.</description><identifier>ISSN: 1553-3077</identifier><identifier>EISSN: 1553-3093</identifier><identifier>DOI: 10.1145/1807060.1807061</identifier><language>eng</language><ispartof>ACM transactions on storage, 2010-07, Vol.6 (2), p.1-22</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c195t-137c20711256a637439ac02aab208517079816efd2c55ee3d207b008e0b11bee3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,27907,27908</link.rule.ids></links><search><creatorcontrib>Balakrishnan, Mahesh</creatorcontrib><creatorcontrib>Kadav, Asim</creatorcontrib><creatorcontrib>Prabhakaran, Vijayan</creatorcontrib><creatorcontrib>Malkhi, Dahlia</creatorcontrib><title>Differential RAID: Rethinking RAID for SSD reliability</title><title>ACM transactions on storage</title><description>SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes can wear out devices at similar times. When a device in the array fails towards the end of its lifetime, the high BER of the remaining devices can result in data loss. We propose Diff-RAID, a parity-based redundancy solution that creates an age differential in an array of SSDs. Diff-RAID distributes parity blocks unevenly across the array, leveraging their higher update rate to age devices at different rates. To maintain this age differential when old devices are replaced by new ones, Diff-RAID reshuffles the parity distribution on each drive replacement. We evaluate Diff-RAID's reliability by using real BER data from 12 flash chips on a simulator and show that it is more reliable than RAID-5, in some cases by multiple orders of magnitude. We also evaluate Diff-RAID's performance using a software implementation on a 5-device array of 80 GB Intel X25-M SSDs and show that it offers a trade-off between throughput and reliability.</description><issn>1553-3077</issn><issn>1553-3093</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNo9j82LwjAUxMOi4Md68LRX_4Ha9_L6kvYofoMgiHsuaUygi7sriRf_e6sWT7-ZYRgYIb4QpogZp5iDBtWYJ_FD9JGZEoKCOm-tdU8MYvwBICUz7ovxovbeBfd3rc15cphtF5-i6805ulHLofheLY_zTbLbr7fz2S6xWPA1QdJWgkaUrIwinVFhLEhjKgk5owZd5KicP0nL7BydmnIFkDuoEKsmGIr0tWvDf4zB-fIS6l8TbiVC-XhUto9aIt0B2zU6Rg</recordid><startdate>201007</startdate><enddate>201007</enddate><creator>Balakrishnan, Mahesh</creator><creator>Kadav, Asim</creator><creator>Prabhakaran, Vijayan</creator><creator>Malkhi, Dahlia</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>201007</creationdate><title>Differential RAID</title><author>Balakrishnan, Mahesh ; Kadav, Asim ; Prabhakaran, Vijayan ; Malkhi, Dahlia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c195t-137c20711256a637439ac02aab208517079816efd2c55ee3d207b008e0b11bee3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Balakrishnan, Mahesh</creatorcontrib><creatorcontrib>Kadav, Asim</creatorcontrib><creatorcontrib>Prabhakaran, Vijayan</creatorcontrib><creatorcontrib>Malkhi, Dahlia</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on storage</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Balakrishnan, Mahesh</au><au>Kadav, Asim</au><au>Prabhakaran, Vijayan</au><au>Malkhi, Dahlia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Differential RAID: Rethinking RAID for SSD reliability</atitle><jtitle>ACM transactions on storage</jtitle><date>2010-07</date><risdate>2010</risdate><volume>6</volume><issue>2</issue><spage>1</spage><epage>22</epage><pages>1-22</pages><issn>1553-3077</issn><eissn>1553-3093</eissn><abstract>SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes can wear out devices at similar times. When a device in the array fails towards the end of its lifetime, the high BER of the remaining devices can result in data loss. We propose Diff-RAID, a parity-based redundancy solution that creates an age differential in an array of SSDs. Diff-RAID distributes parity blocks unevenly across the array, leveraging their higher update rate to age devices at different rates. To maintain this age differential when old devices are replaced by new ones, Diff-RAID reshuffles the parity distribution on each drive replacement. We evaluate Diff-RAID's reliability by using real BER data from 12 flash chips on a simulator and show that it is more reliable than RAID-5, in some cases by multiple orders of magnitude. We also evaluate Diff-RAID's performance using a software implementation on a 5-device array of 80 GB Intel X25-M SSDs and show that it offers a trade-off between throughput and reliability.</abstract><doi>10.1145/1807060.1807061</doi><tpages>22</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1553-3077
ispartof ACM transactions on storage, 2010-07, Vol.6 (2), p.1-22
issn 1553-3077
1553-3093
language eng
recordid cdi_crossref_primary_10_1145_1807060_1807061
source ACM Digital Library Complete
title Differential RAID: Rethinking RAID for SSD reliability
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T12%3A03%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Differential%20RAID:%20Rethinking%20RAID%20for%20SSD%20reliability&rft.jtitle=ACM%20transactions%20on%20storage&rft.au=Balakrishnan,%20Mahesh&rft.date=2010-07&rft.volume=6&rft.issue=2&rft.spage=1&rft.epage=22&rft.pages=1-22&rft.issn=1553-3077&rft.eissn=1553-3093&rft_id=info:doi/10.1145/1807060.1807061&rft_dat=%3Ccrossref%3E10_1145_1807060_1807061%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true