ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems
The past decade has seen rapid growth of distributed stream data processing systems. Under these systems, a stream application is realized as a Directed Acyclic Graph (DAG) of operators, where the level of parallelism of each operator has a substantial impact on its overall performance. However, fin...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The past decade has seen rapid growth of distributed stream data processing
systems. Under these systems, a stream application is realized as a Directed
Acyclic Graph (DAG) of operators, where the level of parallelism of each
operator has a substantial impact on its overall performance. However, finding
optimal levels of parallelism remains challenging. Most existing methods are
heavily coupled with the topological graph of operators, unable to efficiently
tune under-provisioned jobs. They either insufficiently use previous tuning
experience by treating successively tuning independently, or explore the
configuration space aggressively, violating the Service Level Agreements (SLA).
To address the above problems, we propose ContTune, a continuous tuning
system for stream applications. It is equipped with a novel Big-small
algorithm, in which the Big phase decouples the tuning from the topological
graph by decomposing the job tuning problem into sub-problems that can be
solved concurrently. We propose a conservative Bayesian Optimization (CBO)
technique in the Small phase to speed up the tuning process by utilizing the
previous observations. It leverages the state-of-the-art (SOTA) tuning method
as conservative exploration to avoid SLA violations. Experimental results show
that ContTune reduces up to 60.75% number of reconfigurations under synthetic
workloads and up to 57.5% number of reconfigurations under real workloads,
compared to the SOTA method DS2. |
---|---|
DOI: | 10.48550/arxiv.2309.12239 |