JUWELS Booster -- A Supercomputer for Large-Scale AI Research
Stefan Kesselheim
A. Herten
K. Krajsek
J. Ebert
J. Jitsev
Mehdi Cherti
M. Langguth
Bing Gong
S. Stadtler
A. Mozaffari
Gabriele Cavallaro
Rocco Sedona
A. Schug
A. Strube
Roshni Kamath
Martin G. Schultz
M. Riedel
T. Lippert

Abstract
In this article, we present JUWELS Booster, a recently commissioned high-performance computing system at the J\"ulich Supercomputing Center. With its system architecture, most importantly its large number of powerful Graphics Processing Units (GPUs) and its fast interconnect via InfiniBand, it is an ideal machine for large-scale Artificial Intelligence (AI) research and applications. We detail its system architecture, parallel, distributed model training, and benchmarks indicating its outstanding performance. We exemplify its potential for research application by presenting large-scale AI research highlights from various scientific fields that require such a facility.
View on arXivComments on this paper