Portable and architecture independent parallel performance tuning using a call-graph profiling tool

The paper describes a post mortem call graph profiling tool that analyses trace information generated during the execution of BSPlib programs. The purpose of the tool is to expose imbalance in either computation or communication, and to highlight portions of code that are amenable to improvement. Un...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hill, J.M.D., Jarvis, S.A., Siniolakis, C.J., Vasilev, V.P.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The paper describes a post mortem call graph profiling tool that analyses trace information generated during the execution of BSPlib programs. The purpose of the tool is to expose imbalance in either computation or communication, and to highlight portions of code that are amenable to improvement. Unlike other profiling tools, the profile information guides optimisation in an architecture independent way. From an ease of use perspective, the amount of information displayed when visualising a profile for a parallel program is no more complex than that of a sequential program.
DOI:10.1109/EMPDP.1998.647211