A scheduling policy to save 10% of communication time in parallel fast Fourier transform

Summary The fast Fourier transform (FFT) has applications in almost every frequency related study, for example, in image and signal processing, and radio astronomy. It is also used as a Poisson operator inversion kernel in partial differential equations in fluid flows, in density functional theory,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Concurrency and computation 2023-07, Vol.35 (15), p.n/a
Hauptverfasser: Aseeri, Samar A., Gopal Chatterjee, Anando, Verma, Mahendra K., Keyes, David E.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Summary The fast Fourier transform (FFT) has applications in almost every frequency related study, for example, in image and signal processing, and radio astronomy. It is also used as a Poisson operator inversion kernel in partial differential equations in fluid flows, in density functional theory, many‐body theory, and others. The three‐dimensional N3 FFT has large time complexity O(N3log2N). Hence, parallelization is used to compute such FFTs. Popular libraries perform slab division or pencil decomposition of N3 data. None of the existing libraries achieve perfect inverse scaling of time with (T−1≈n) cores because FFT requires all‐to‐all communication and clusters hitherto do not have physical all‐to‐all connections. Dragonfly, one of the popular topologies for the interconnect, supports hierarchical connections among the components. We show that if we align the all‐to‐all communication of FFT with the physical connections of Dragonfly topology we will achieve a better scaling and reduce communication time.
ISSN:1532-0626
1532-0634
DOI:10.1002/cpe.6508