StePS: A Multi-GPU Cosmological N-body Code for Compactified Simulations. (arXiv:1811.05903v1 [astro-ph.CO])
<a href="http://arxiv.org/find/astro-ph/1/au:+Racz_G/0/1/0/all/0/1">G&#xe1;bor R&#xe1;cz</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Szapudi_I/0/1/0/all/0/1">Istv&#xe1;n Szapudi</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Dobos_L/0/1/0/all/0/1">L&#xe1;szl&#xf3; Dobos</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Csabai_I/0/1/0/all/0/1">Istv&#xe1;n Csabai</a>, <a href="http://arxiv.org/find/astro-ph/1/au:+Szalay_A/0/1/0/all/0/1">Alexander S. Szalay</a>

We present the multi-GPU realization of the StePS (Stereographically
Projected Cosmological Simulations) algorithm with MPI-OpenMP-CUDA hybrid
parallelization, and show what parallelization efficiency can be reached. We
use a new zoom-in cosmological direct N-body simulation method, that can
simulate the infinite universe with unprecedented dynamic range for a given
amount of memory and, in contrast to traditional periodic simulations, its
fundamental geometry and topology match observations. By using a sperical
geometry instead of periodic boundary conditions, and gradually decreasing the
mass resolution with radius, our code is capable of running simulations with a
few gigaparsecs in diameter and with a mass resolution of $sim
10^{9}M_{odot}$ in the center in four days on three compute nodes with four
GTX 1080Ti GPUs each. The code can also be used to run extremely fast
simulations with reasonable resolution for fitting cosmological parameters.
These simulations can be used for prediction needs of large surveys. The StePS
code is publicly available for the research community.

We present the multi-GPU realization of the StePS (Stereographically
Projected Cosmological Simulations) algorithm with MPI-OpenMP-CUDA hybrid
parallelization, and show what parallelization efficiency can be reached. We
use a new zoom-in cosmological direct N-body simulation method, that can
simulate the infinite universe with unprecedented dynamic range for a given
amount of memory and, in contrast to traditional periodic simulations, its
fundamental geometry and topology match observations. By using a sperical
geometry instead of periodic boundary conditions, and gradually decreasing the
mass resolution with radius, our code is capable of running simulations with a
few gigaparsecs in diameter and with a mass resolution of $sim
10^{9}M_{odot}$ in the center in four days on three compute nodes with four
GTX 1080Ti GPUs each. The code can also be used to run extremely fast
simulations with reasonable resolution for fitting cosmological parameters.
These simulations can be used for prediction needs of large surveys. The StePS
code is publicly available for the research community.

http://arxiv.org/icons/sfx.gif