A Graph-based Model for GPU Caching Problems
Modeling data sharing in GPU programs is a challenging task because of the massive parallelism and complex data sharing patterns provided by GPU architectures. Better GPU caching efficiency can be achieved through careful task scheduling among different threads. Traditionally, in the field of parall...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Modeling data sharing in GPU programs is a challenging task because of the
massive parallelism and complex data sharing patterns provided by GPU
architectures. Better GPU caching efficiency can be achieved through careful
task scheduling among different threads. Traditionally, in the field of
parallel computing, graph partition models are used to model data communication
and guide task scheduling. However, we discover that the previous methods are
either inaccurate or expensive when applied to GPU programs. In this paper, we
propose a novel task partition model that is accurate and gives rise to the
development of fast and high quality task/data reorganization algorithms. We
demonstrate the effectiveness of the proposed model by rigorous theoretical
analysis of the algorithm bounds and extensive experimental analysis. The
experimental results show that it achieves significant performance improvement
across a representative set of GPU applications. |
---|---|
DOI: | 10.48550/arxiv.1605.02043 |