NewBalance: Efficient Data Space Management and Algorithmic Optimization for Large-Scale Storage Systems
Fragmentation usually occurs when data space of original storage nodes has to be reallocated to new added storage nodes during the scale-out evolution of the large-scale storage system. It greatly influences its performance and becomes a challenge to manage the whole space. We present an efficient s...
Gespeichert in:
Veröffentlicht in: | Chinese Journal of Electronics 2017-05, Vol.26 (3), p.493-501 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Fragmentation usually occurs when data space of original storage nodes has to be reallocated to new added storage nodes during the scale-out evolution of the large-scale storage system. It greatly influences its performance and becomes a challenge to manage the whole space. We present an efficient space management frame-work, called NewBalance, to reduce fragmentation with the minimum data movement while keeping the storage system load balance. The space management framework has two phases including the collection phase and the allo- cation phase. For the collection phase, we propose a novel algorithm, called the greedy hi-direction collector, which collects enough space for the new storage nodes. For the allocation phase, we formally represent it as a variant of the bin packing problem and then utilize some bin packing heuristics including the first fitting and the best fitting to allocate collected intervals to new added storage nodes. The experimental results show that the amount of intervals can be reduced by 20%-55% and our algorithmic optimization improves the data lookup performance by at least 10~0 and the scale-out performance by 2X-3X. |
---|---|
ISSN: | 1022-4653 2075-5597 |
DOI: | 10.1049/cje.2017.03.012 |