Performance of parallel FDTD method for shared- and distributed-memory architectures: Application tobioelectromagnetics

This work provides an in-depth computational performance study of the parallel finite-difference time-domain (FDTD) method. The parallelization is done at various levels including: shared- (OpenMP) and distributed- (MPI) memory paradigms and vectorization on three different architectures: Intel'...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PloS one 2020-09, Vol.15 (9), p.e0238115-e0238115
Hauptverfasser:	Ruiz-Cabello N., Miguel, AbaÄ¼enkovs, Maksims, Diaz Angulo, Luis M, Cobos Sanchez, Clemente, Moglie, Franco, Garcia, Salvador G
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Bandwidths Biology and Life Sciences Computer and Information Sciences Computer applications Computer architecture Computer memory Computer simulation Design and construction Distributed memory Electric properties Electromagnetic fields Electromagnetism Engineering and Technology Finite difference time domain method Funding Magnetic fields Magnetic properties Measurement Methods Optimization Parallel processing Physical Sciences Physics Research and Analysis Methods Reverberation chambers Time domain analysis Vector processing (computers)
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This work provides an in-depth computational performance study of the parallel finite-difference time-domain (FDTD) method. The parallelization is done at various levels including: shared- (OpenMP) and distributed- (MPI) memory paradigms and vectorization on three different architectures: Intel's Knights Landing, Skylake and ARM's Cavium ThunderX2. This study contributes to prove, in a systematic manner, the well-established claim within the Computational Electromagnetic community, that the main factor limiting FDTD performance, in realistic problems, is the memory bandwidth. Consequently a memory bandwidth threshold can be assessed depending on the problem size in order to attain optimal performance. Finally, the results of this study have been used to optimize the workload balancing of simulation of a bioelectromagnetic problem consisting in the exposure of a human model to a reverberation chamber-like environment.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0238115