Adaptive request scheduling for the I/O forwarding layer using reinforcement learning

In this paper, we propose an approach to adapt the I/O forwarding layer of HPC systems to applications’ access patterns. I/O optimization techniques can improve performance for the access patterns they were designed to target, but they often decrease performance for others. Furthermore, these techni...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Future generation computer systems 2020-11, Vol.112, p.1156-1169
Hauptverfasser:	Bez, Jean Luca, Zanon Boito, Francieli, Nou, Ramon, Miranda, Alberto, Cortes, Toni, Navaux, Philippe O.A.
Format:	Artikel
Sprache:	eng
Schlagworte:	Auto-tuning Computer Science Distributed, Parallel, and Cluster Computing High performance I/O I/O forwarding I/O scheduling Parallel I/O Reinforcement learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we propose an approach to adapt the I/O forwarding layer of HPC systems to applications’ access patterns. I/O optimization techniques can improve performance for the access patterns they were designed to target, but they often decrease performance for others. Furthermore, these techniques usually depend on the precise tune of their parameters, which commonly falls back to the users. Instead, we propose to do it dynamically at runtime based on the I/O workload observed by the system. Our approach uses a reinforcement learning technique – contextual bandits – to make the system capable of learning the best parameter value to each observed access pattern during its execution. That eliminates the need of a complicated and time-consuming previous training phase. Our case study is the TWINS scheduling algorithm, where performance improvements depend on the time window parameter, which in turn depends on the workload. We evaluate our proposal and demonstrate it can reach a precision of 88% on the parameter selection in the first hundreds of observations of an access pattern, achieving 99% of the optimal performance. We demonstrate that the system – which is expected to live for years – will be able to adapt to changes and optimize its performance after having observed an access pattern for a few (not necessarily contiguous) minutes. •We propose to adapt the I/O forwarding layer of HPC systems to the current workload.•We use reinforcement learning to make the system capable of learning the best choices.•We aim at avoiding a time-consuming and expensive training step.•We also aim at removing from the users the burden of manually tuning I/O related parameters.•The system can achieve 99% of the optimal performance with automatic parameter selection.
ISSN:	0167-739X 1872-7115
DOI:	10.1016/j.future.2020.05.005