Demystifying Memory Access Patterns of FPGA-Based Graph Processing Accelerators
Recent advances in reprogrammable hardware (e.g., FPGAs) and memory technology (e.g., DDR4, HBM) promise to solve performance problems inherent to graph processing like irregular memory access patterns on traditional hardware (e.g., CPU). While several of these graph accelerators were proposed in re...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent advances in reprogrammable hardware (e.g., FPGAs) and memory
technology (e.g., DDR4, HBM) promise to solve performance problems inherent to
graph processing like irregular memory access patterns on traditional hardware
(e.g., CPU). While several of these graph accelerators were proposed in recent
years, it remains difficult to assess their performance and compare them on
common graph workloads and accelerator platforms, due to few open source
implementations and excessive implementation effort.
In this work, we build on a simulation environment for graph processing
accelerators, to make several existing accelerator approaches comparable. This
allows us to study relevant performance dimensions such as partitioning schemes
and memory technology, among others. The evaluation yields insights into the
strengths and weaknesses of current graph processing accelerators along these
dimensions, and features a novel in-depth comparison. |
---|---|
DOI: | 10.48550/arxiv.2104.07776 |