Cluster-Aware Scattered Repair in Erasure-Coded Storage: Design and Analysis

Erasure coding is a storage-efficient means to guarantee data reliability in today's commodity storage systems, yet its repair performance is seriously hindered by the substantial repair traffic. Repair in clustered storage systems is even complicated because of the scarcity of the cross-cluste...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on computers 2021-11, Vol.70 (11), p.1861-1874
Hauptverfasser: Shen, Zhirong, Lin, Shiyao, Shu, Jiwu, Xie, Chengxin, Huang, Zhijie, Fu, Yingxun
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Erasure coding is a storage-efficient means to guarantee data reliability in today's commodity storage systems, yet its repair performance is seriously hindered by the substantial repair traffic. Repair in clustered storage systems is even complicated because of the scarcity of the cross-cluster bandwidth. We present {\sf ClusterSR} ClusterSR , a cluster-aware scattered repair approach. {\sf ClusterSR} ClusterSR minimizes the cross-cluster repair traffic by carefully choosing the clusters for reading and repairing chunks. It further balances the cross-cluster repair traffic by scheduling the repair of multiple chunks. Large-scale simulation and Alibaba Cloud ECS experiments show that {\sf ClusterSR} ClusterSR can reduce 5.6-52.7 percent of the cross-cluster repair traffic and improve 14.4-68.8 percent of the repair throughput.
ISSN:0018-9340
1557-9956
DOI:10.1109/TC.2020.3028353