Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.10183
Cited By
A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning
29 January 2019
Tal Ben-Nun
Maciej Besta
Simon Huber
A. Ziogas
D. Peter
Torsten Hoefler
ELM
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning"
17 / 17 papers shown
Title
Low-Depth Spatial Tree Algorithms
Yves Baumann
Tal Ben-Nun
Maciej Besta
Lukas Gianinazzi
Torsten Hoefler
Piotr Luczynski
42
0
0
19 Apr 2024
Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis
Maciej Besta
Torsten Hoefler
GNN
34
56
0
19 May 2022
The spatial computer: A model for energy-efficient parallel computation
Lukas Gianinazzi
Tal Ben-Nun
Maciej Besta
Saleh Ashkboos
Yves Baumann
Piotr Luczynski
Torsten Hoefler
18
5
0
10 May 2022
Scientific Machine Learning Benchmarks
Jeyan Thiyagalingam
Mallikarjun Shankar
Geoffrey C. Fox
Tony (Anthony) John Grenville Hey
14
108
0
25 Oct 2021
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
S. Farrell
M. Emani
J. Balma
L. Drescher
Aleksandr Drozd
...
Akihiro Tabuchi
V. Vishwanath
M. Wahib
Masafumi Yamazaki
Junqi Yin
VLM
32
35
0
21 Oct 2021
Quantifying and Improving Performance of Distributed Deep Learning with Cloud Storage
Nicholas Krichevsky
M. S. Louis
Tian Guo
25
9
0
13 Aug 2021
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs
G. Fursin
14
7
0
02 Nov 2020
High-Performance Parallel Graph Coloring with Strong Guarantees on Work, Depth, and Quality
Maciej Besta
Armon Carigiet
Zur Vonarburg-Shmaria
Kacper Janda
Lukas Gianinazzi
Torsten Hoefler
20
26
0
26 Aug 2020
AIPerf: Automated machine learning as an AI-HPC benchmark
Zhixiang Ren
Yongheng Liu
Tianhui Shi
Lei Xie
Yue Zhou
Jidong Zhai
Youhui Zhang
Yunquan Zhang
Wenguang Chen
19
22
0
17 Aug 2020
The Collective Knowledge project: making ML models more portable and reproducible with open APIs, reusable best practices and MLOps
G. Fursin
VLM
9
11
0
12 Jun 2020
Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems
Maciej Besta
Marc Fischer
Vasiliki Kalavri
Michael Kapralov
Torsten Hoefler
GNN
26
55
0
29 Dec 2019
Communication-Efficient Jaccard Similarity for High-Performance Distributed Genome Comparisons
Maciej Besta
Raghavendra Kanakagiri
Harun Mustafa
Mikhail Karasikov
Gunnar Rätsch
Torsten Hoefler
Edgar Solomonik
18
65
0
11 Nov 2019
Machine Learning and Big Scientific Data
Tony (Anthony) John Grenville Hey
K. Butler
Sam Jackson
Jeyarajan Thiyagalingam
AI4CE
28
74
0
12 Oct 2019
MLPerf Training Benchmark
Arya D. McCarthy
Christine Cheng
Cody Coleman
Greg Diamos
Paulius Micikevicius
...
Carole-Jean Wu
Lingjie Xu
Masafumi Yamazaki
C. Young
Matei A. Zaharia
31
305
0
02 Oct 2019
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs
Cheng-rong Li
Abdul Dakkak
Jinjun Xiong
Wei Wei
Lingjie Xu
Wen-mei W. Hwu
11
16
0
19 Aug 2019
Graph Processing on FPGAs: Taxonomy, Survey, Challenges
Maciej Besta
Dimitri Stanojevic
Johannes de Fine Licht
Tal Ben-Nun
Torsten Hoefler
GNN
AI4CE
27
52
0
25 Feb 2019
TensorLayer: A Versatile Library for Efficient Deep Learning Development
Hao Dong
A. Supratak
Luo Mai
Fangde Liu
A. Oehmichen
Simiao Yu
Yike Guo
53
114
0
26 Jul 2017
1