PyPOD-GP: Using PyTorch for Accelerated Chip-Level Thermal Simulation of the GPU
The rising demand for high-performance computing (HPC) has made full-chip dynamic thermal simulation in many-core GPUs critical for optimizing performance and extending device lifespans. Proper orthogonal decomposition (POD) with Galerkin projection (GP) has shown to offer high accuracy and massive...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The rising demand for high-performance computing (HPC) has made full-chip
dynamic thermal simulation in many-core GPUs critical for optimizing
performance and extending device lifespans. Proper orthogonal decomposition
(POD) with Galerkin projection (GP) has shown to offer high accuracy and
massive runtime improvements over direct numerical simulation (DNS). However,
previous implementations of POD-GP use MPI-based libraries like PETSc and
FEniCS and face significant runtime bottlenecks. We propose a
$\textbf{Py}$Torch-based $\textbf{POD-GP}$ library (PyPOD-GP), a GPU-optimized
library for chip-level thermal simulation. PyPOD-GP achieves over $23.4\times$
speedup in training and over $10\times$ speedup in inference on a GPU with over
13,000 cores, with just $1.2\%$ error over the device layer. |
---|---|
DOI: | 10.48550/arxiv.2412.06041 |