Understanding Data Movement in Tightly Coupled Heterogeneous Systems: A Case Study with the Grace Hopper Superchip
Heterogeneous supercomputers have become the standard in HPC. GPUs in particular have dominated the accelerator landscape, offering unprecedented performance in parallel workloads and unlocking new possibilities in fields like AI and climate modeling. With many workloads becoming memory-bound, impro...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Heterogeneous supercomputers have become the standard in HPC. GPUs in
particular have dominated the accelerator landscape, offering unprecedented
performance in parallel workloads and unlocking new possibilities in fields
like AI and climate modeling. With many workloads becoming memory-bound,
improving the communication latency and bandwidth within the system has become
a main driver in the development of new architectures. The Grace Hopper
Superchip (GH200) is a significant step in the direction of tightly coupled
heterogeneous systems, in which all CPUs and GPUs share a unified address space
and support transparent fine grained access to all main memory on the system.
We characterize both intra- and inter-node memory operations on the Quad GH200
nodes of the new Swiss National Supercomputing Centre Alps supercomputer, and
show the importance of careful memory placement on example workloads,
highlighting tradeoffs and opportunities. |
---|---|
DOI: | 10.48550/arxiv.2408.11556 |