Accurate prediction of the behavior of multithreaded applications in shared caches
► We model the cache behavior of shared caches for multithreaded applications. ► Support for configurable block cyclic distribution of iterations among the threads. ► The predictions are precise and they are obtained very fast. ► The model can help a compiler to choose the best parallelization strat...
Gespeichert in:
Veröffentlicht in: | Parallel computing 2013-01, Vol.39 (1), p.36-57 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | ► We model the cache behavior of shared caches for multithreaded applications. ► Support for configurable block cyclic distribution of iterations among the threads. ► The predictions are precise and they are obtained very fast. ► The model can help a compiler to choose the best parallelization strategy.
Multicores are the norm nowadays and in many of them there are cores that share one or several levels of cache. The theoretical performance gain expected when several cores cooperate in the parallel execution of an application can be reduced in some cases by a cache access bottleneck, as the data accessed by them can interfere in the shared cache levels. In other cases the performance gain can be increased due to a greater reuse of the data loaded in the cache. This paper presents an analytical model that can predict the behavior of shared caches when executing applications parallelized at loop level. To the best of our knowledge, this is the first analytical model that tackles the behavior of multithreaded applications on realistic shared caches without requiring profiling. The experimental results show that the model predictions are precise and very fast and that the model can help a compiler or programmer choose the best parallelization strategy. |
---|---|
ISSN: | 0167-8191 1872-7336 |
DOI: | 10.1016/j.parco.2012.11.003 |