Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations

•Larger and longer simulations of biological RDMEs with multiple GPUs.•Spatial decomposition allows for simulation of large cellular volumes.•Multi-GPU performance allows an increase in particle counts and reactions.•Load balancing optimizes for heterogeneity in lattice sites and GPU hardware. Simul...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Parallel computing 2014-05, Vol.40 (5-6), p.86-99
Hauptverfasser: Hallock, Michael J., Stone, John E., Roberts, Elijah, Fry, Corey, Luthey-Schulten, Zaida
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Larger and longer simulations of biological RDMEs with multiple GPUs.•Spatial decomposition allows for simulation of large cellular volumes.•Multi-GPU performance allows an increase in particle counts and reactions.•Load balancing optimizes for heterogeneity in lattice sites and GPU hardware. Simulation of in vivo cellular processes with the reaction–diffusion master equation (RDME) is a computationally expensive task. Our previous software enabled simulation of inhomogeneous biochemical systems for small bacteria over long time scales using the MPD-RDME method on a single GPU. Simulations of larger eukaryotic systems exceed the on-board memory capacity of individual GPUs, and long time simulations of modest-sized cells such as yeast are impractical on a single GPU. We present a new multi-GPU parallel implementation of the MPD-RDME method based on a spatial decomposition approach that supports dynamic load balancing for workstations containing GPUs of varying performance and memory capacity. We take advantage of high-performance features of CUDA for peer-to-peer GPU memory transfers and evaluate the performance of our algorithms on state-of-the-art GPU devices. We present parallel efficiency and performance results for simulations using multiple GPUs as system size, particle counts, and number of reactions grow. We also demonstrate multi-GPU performance in simulations of the Min protein system in E. coli. Moreover, our multi-GPU decomposition and load balancing approach can be generalized to other lattice-based problems.
ISSN:0167-8191
1872-7336
DOI:10.1016/j.parco.2014.03.009