Enabling OpenMP Task Parallelism on Multi-FPGAs
FPGA-based hardware accelerators have received increasing attention mainly due to their ability to accelerate deep pipelined applications, thus resulting in higher computational performance and energy efficiency. Nevertheless, the amount of resources available on even the most powerful FPGA is still...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | FPGA-based hardware accelerators have received increasing attention mainly
due to their ability to accelerate deep pipelined applications, thus resulting
in higher computational performance and energy efficiency. Nevertheless, the
amount of resources available on even the most powerful FPGA is still not
enough to speed up very large modern workloads. To achieve that, FPGAs need to
be interconnected in a Multi-FPGA architecture capable of accelerating a single
application. However, programming such architecture is a challenging endeavor
that still requires additional research. This paper extends the OpenMP
task-based computation offloading model to enable a number of FPGAs to work
together as a single Multi-FPGA architecture. Experimental results for a set of
OpenMP stencil applications running on a Multi-FPGA platform consisting of 6
Xilinx VC709 boards interconnected through fiber-optic links have shown close
to linear speedups as the number of FPGAs and IP-cores per FPGA increase. |
---|---|
DOI: | 10.48550/arxiv.2103.10573 |