NewBalance: Efficient Data Space Management and Algorithmic Optimization for Large-Scale Storage Systems

Fragmentation usually occurs when data space of original storage nodes has to be reallocated to new added storage nodes during the scale-out evolution of the large-scale storage system. It greatly influences its performance and becomes a challenge to manage the whole space. We present an efficient s...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chinese Journal of Electronics 2017-05, Vol.26 (3), p.493-501
Hauptverfasser: Xu, Guangping, Lin, Sheng, Shi, Kai, Zhang, Hua
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Fragmentation usually occurs when data space of original storage nodes has to be reallocated to new added storage nodes during the scale-out evolution of the large-scale storage system. It greatly influences its performance and becomes a challenge to manage the whole space. We present an efficient space management frame-work, called NewBalance, to reduce fragmentation with the minimum data movement while keeping the storage system load balance. The space management framework has two phases including the collection phase and the allo- cation phase. For the collection phase, we propose a novel algorithm, called the greedy hi-direction collector, which collects enough space for the new storage nodes. For the allocation phase, we formally represent it as a variant of the bin packing problem and then utilize some bin packing heuristics including the first fitting and the best fitting to allocate collected intervals to new added storage nodes. The experimental results show that the amount of intervals can be reduced by 20%-55% and our algorithmic optimization improves the data lookup performance by at least 10~0 and the scale-out performance by 2X-3X.
ISSN:1022-4653
2075-5597
DOI:10.1049/cje.2017.03.012