Tuning MPI Runtime Parameter Setting for High Performance Computing

The performance of MPI applications on parallel computers can be considerably improved by tuning the runtime parameters provided by modern MPI libraries. However, due to the large and increasing number of tunable parameters, finding a parameter setting which optimizes the execution of several user p...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pellegrini, S., Prodan, R., Fahringer, T.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The performance of MPI applications on parallel computers can be considerably improved by tuning the runtime parameters provided by modern MPI libraries. However, due to the large and increasing number of tunable parameters, finding a parameter setting which optimizes the execution of several user programs on a chosen target machine is challenging. Existing tools execute input programs multiple times with varying parameter settings until a satisfying performance is reached. Several hundred runs of the input programs are nevertheless needed making this approach appealing only when the cost of the tuning phase can be amortized over many runs of the optimized programs. In this paper, we introduce a novel technique for tuning MPI runtime parameter values to better suit the underlying system architecture. The MPI parameter values are determined by performing the analysis of variance (ANOVA) on experimental data collected by randomly exploring the optimization space of a set of computational kernels commonly employed in High Performance Computing (HPC). We use our new technique to derive optimized values for 27 runtime parameters of the Open MPI library for two different parallel architectures. Results show an average performance improvement up to 20% for codes from the SPEC MPI 2007 benchmark suite with respect to Open MPI's default parameter setting.
DOI:10.1109/ClusterW.2012.15