SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation
Fast and accurate climate simulations and weather predictions are critical for understanding and preparing for the impact of climate change. Real-world weather and climate modeling consist of complex compound stencil kernels that do not perform well on conventional architectures. Horizontal diffusio...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Fast and accurate climate simulations and weather predictions are critical
for understanding and preparing for the impact of climate change. Real-world
weather and climate modeling consist of complex compound stencil kernels that
do not perform well on conventional architectures. Horizontal diffusion is one
such important compound stencil found in many climate and weather prediction
models. Recent works propose using FPGAs as an alternative to traditional CPU
and GPU-based systems to accelerate compound stencil kernels. However, we
observe that compound stencil computations cannot leverage the bit-level
flexibility available on an FPGA because of its complex memory access patterns,
leading to high hardware resource utilization and low peak performance. We
introduce SPARTA, a novel spatial accelerator for horizontal diffusion weather
stencil computation. We exploit the two-dimensional spatial architecture to
efficiently accelerate horizontal diffusion stencil by designing the first
scaled-out spatial accelerator using MLIR (Multi-Level Intermediate
Representation) compiler framework. We evaluate its performance on a real
cutting-edge AMD-Xilinx Versal AI Engine spatial architecture. Our real-system
evaluation results demonstrate that SPARTA outperforms the state-of-the-art
CPU, GPU, and FPGA implementations by 17.1x, 1.2x, and 2.1x, respectively. Our
results reveal that balancing workload across the available processing
resources is crucial in achieving high performance on spatial architectures. We
also implement and evaluate five elementary stencils that are commonly used as
benchmarks for stencil computation research. We freely open-source all our
implementations to aid future research in stencil computation and spatial
computing systems at https://github.com/CMU-SAFARI/SPARTA. |
---|---|
DOI: | 10.48550/arxiv.2303.03509 |