A gray-box modeling methodology for runtime prediction of Apache Spark jobs

Apache Spark jobs are often characterized by processing huge data sets and, therefore, require runtimes in the range of minutes to hours. Thus, being able to predict the runtime of such jobs would be useful not only to know when the job will finish, but also for scheduling purposes, to estimate mone...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Distributed and parallel databases : an international journal 2020-12, Vol.38 (4), p.819-839
Hauptverfasser:	Al-Sayeh, Hani, Hagedorn, Stefan, Sattler, Kai-Uwe
Format:	Artikel
Sprache:	eng
Schlagworte:	Clusters Computer Science Configurations Data Structures Database Management Information Systems Applications (incl.Internet) Mathematical models Memory Structures Methodology Modelling Operating Systems Parameters Special Issue on Self-Managing and Hardware-Optimized Database Systems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!