Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.06581
Cited By
Fathom: Reference Workloads for Modern Deep Learning Methods
IEEE International Symposium on Workload Characterization (IISWC), 2016
23 August 2016
Robert Adolf
Saketh Rama
Brandon Reagen
Gu-Yeon Wei
David Brooks
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fathom: Reference Workloads for Modern Deep Learning Methods"
50 / 60 papers shown
An MLCommons Scientific Benchmarks Ontology
B. Hawks
G. V. Laszewski
Matthew D. Sinclair
Marco Colombo
Shivaram Venkataraman
Rutwik Jain
Yiwei Jiang
Nhan Tran
Geoffrey C. Fox
96
1
0
06 Nov 2025
On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers
Zhengxian Lu
Fangyu Wang
Zhiwei Xu
Fei Yang
Tao Li
221
4
0
02 Jul 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
495
4
0
16 Apr 2024
GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System
Yidong Gong
Pradeep Kumar
GNN
197
4
0
05 Apr 2024
Beyond Inference: Performance Analysis of DNN Server Overheads for Computer Vision
Ahmed F. AbouElhamayed
Susanne Balle
Deshanand Singh
Mohamed S. Abdelfattah
3DH
109
0
0
02 Mar 2024
TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao
Xu Zhao
Bin Bao
David Berard
William Constable
Adnan Aziz
Xu Liu
342
12
0
27 Apr 2023
Hierarchical Training of Deep Neural Networks Using Early Exiting
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yamin Sepehri
P. Pad
A. C. Yüzügüler
P. Frossard
L. A. Dunbar
243
11
0
04 Mar 2023
Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning
ACM International Conference on Computing Frontiers (CF), 2023
Gianluca Mittone
Nicolò Tonci
Robert Birke
Iacopo Colonnelli
Doriana Medić
...
Francesco Beneventi
Mirko Polato
Massimo Torquati
Luca Benini
Marco Aldinucci
202
16
0
15 Feb 2023
Computation vs. Communication Scaling for Future Transformers on Future Hardware
Suchita Pati
Shaizeen Aga
Mahzabeen Islam
Nuwan Jayasena
Matthew D. Sinclair
267
14
0
06 Feb 2023
MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Huaizheng Zhang
Yuanming Li
Wencong Xiao
Yizheng Huang
Xing Di
Jianxiong Yin
Simon See
Yong Luo
C. Lau
Yang You
VLM
161
6
0
01 Jan 2023
Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks
International Symposium on Computer Architecture (ISCA), 2022
Mingyu Liang
Wenyin Fu
Louis Feng
Zhongyi Lin
P. Panakanti
Shengbao Zheng
Srinivas Sridharan
Christina Delimitrou
189
16
0
16 Dec 2022
An Overview of the Data-Loader Landscape: Comparative Performance Analysis
BigData Congress [Services Society] (BSS), 2022
Iason Ofeidis
Diego Kiedanski
Leandros Tassiulas
191
7
0
27 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
IEEE Micro (IEEE Micro), 2022
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
221
31
0
19 Sep 2022
Snowmass 2021 Computational Frontier CompF4 Topical Group Report: Storage and Processing Resource Access
W. Bhimij
D. Carder
Eli Dart
Javier M. Duarte
I. Fisk
...
N. Tran
P. Gemmeren
G. Watts
B. Weaver
F. Würthwein
228
0
0
19 Sep 2022
Metadata Representations for Queryable ML Model Zoos
Ziyu Li
Rihan Hai
A. Bozzon
Asterios Katsifodimos
60
4
0
19 Jul 2022
FastML Science Benchmarks: Accelerating Real-Time Scientific Edge Machine Learning
Javier Mauricio Duarte
Nhan Tran
B. Hawks
C. Herwig
J. Muhizi
Yasmine Omri
Vijay Janapa Reddi
281
19
0
16 Jul 2022
Benchmarking of DL Libraries and Models on Mobile Devices
The Web Conference (WWW), 2022
Qiyang Zhang
Xiang Li
Xiangying Che
Xiao Ma
Ao Zhou
Mengwei Xu
Shangguang Wang
Xuhui Liu
Xuanzhe Liu
205
56
0
14 Feb 2022
Benchmarking Resource Usage for Efficient Distributed Deep Learning
IEEE Conference on High Performance Extreme Computing (HPEC), 2022
Nathan C. Frey
Baolin Li
Joseph McDonald
Dan Zhao
Michael Jones
David Bestor
Devesh Tiwari
V. Gadepally
S. Samsi
187
13
0
28 Jan 2022
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
Jordan Dudley
M. Emani
J. Balma
L. Drescher
Aleksandr Drozd
...
Akihiro Tabuchi
V. Vishwanath
Mohamed Wahib
Masafumi Yamazaki
Junqi Yin
VLM
240
50
0
21 Oct 2021
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
LLMAG
197
16
0
14 Apr 2021
A Case for 3D Integrated System Design for Neuromorphic Computing & AI Applications
Social Science Research Network (SSRN), 2021
Eren Kurshan
Xue Yang
Mingoo Seok
Yuan Xie
163
3
0
02 Mar 2021
A Runtime-Based Computational Performance Predictor for Deep Neural Network Training
USENIX Annual Technical Conference (USENIX ATC), 2021
Geoffrey X. Yu
Yubo Gao
P. Golikov
Gennady Pekhimenko
3DH
162
83
0
31 Jan 2021
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System
Huaizheng Zhang
Yizheng Huang
Yonggang Wen
Jianxiong Yin
K. Guan
177
3
0
04 Nov 2020
Cross-Stack Workload Characterization of Deep Recommendation Systems
IEEE International Symposium on Workload Characterization (IISWC), 2020
Samuel Hsia
Udit Gupta
Mark Wilkening
Carole-Jean Wu
Gu-Yeon Wei
David Brooks
BDL
GNN
HAI
239
38
0
10 Oct 2020
Bosch Deep Learning Hardware Benchmark
Armin Runge
Thomas Wenzel
Dimitrios Bariamis
B. Staffler
Lucas Drumond
Michael Pfeiffer
117
0
0
24 Aug 2020
AIPerf: Automated machine learning as an AI-HPC benchmark
Zhixiang Ren
Yongheng Liu
Tianhui Shi
Lei Xie
Yue Zhou
Jidong Zhai
Youhui Zhang
Yunquan Zhang
Wenguang Chen
319
29
0
17 Aug 2020
SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks
Suchita Pati
Shaizeen Aga
Matthew D. Sinclair
Nuwan Jayasena
134
10
0
20 Jul 2020
Comparison and Benchmarking of AI Models and Frameworks on Mobile Devices
Chunjie Luo
Xiwen He
Jianfeng Zhan
Lei Wang
Wanling Gao
Jiahui Dai
ELM
190
68
0
07 May 2020
AIBench Scenario: Scenario-distilling AI Benchmarking
International Conference on Parallel Architectures and Compilation Techniques (PACT), 2020
Wanling Gao
Fei Tang
Jianfeng Zhan
Xu Wen
Lei Wang
Zheng Cao
Chuanxin Lan
Chunjie Luo
Xiaoli Liu
Zihan Jiang
258
14
0
06 May 2020
AIBench Training: Balanced Industry-Standard AI Training Benchmarking
Fei Tang
Wanling Gao
Jianfeng Zhan
Chuanxin Lan
Xu Wen
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
187
3
0
30 Apr 2020
Energy Predictive Models for Convolutional Neural Networks on Mobile Platforms
Crefeda Faviola Rodrigues
Graham D. Riley
M. Luján
HAI
104
4
0
10 Apr 2020
AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite
Wanling Gao
Fei Tang
Jianfeng Zhan
Chuanxin Lan
Chunjie Luo
...
Gang Lu
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
149
1
0
17 Feb 2020
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
International Symposium on Computer Architecture (ISCA), 2020
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
218
200
0
08 Jan 2020
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
International Symposium on Computer Architecture (ISCA), 2019
Liu Ke
Udit Gupta
Carole-Jean Wu
B. Cho
Mark Hempstead
...
Dheevatsa Mudigere
Maxim Naumov
Martin D. Schatz
M. Smelyanskiy
Xiaodong Wang
189
252
0
30 Dec 2019
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended)
International Conference on Performance Engineering (ICPE), 2019
Cheng-rong Li
Abdul Dakkak
Jinjun Xiong
Wen-mei W. Hwu
222
0
0
18 Nov 2019
MLPerf Inference Benchmark
International Symposium on Computer Architecture (ISCA), 2019
Vijayarāghava Reḍḍī
C. Cheng
David Kanter
Pete H Mattson
Guenther Schmuelling
...
Bing Yu
George Y. Yuan
Aaron Zhong
P. Zhang
Yuchen Zhou
314
593
0
06 Nov 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
454
171
0
02 Nov 2019
Characterizing Deep Learning Training Workloads on Alibaba-PAI
IEEE International Symposium on Workload Characterization (IISWC), 2019
Mengdi Wang
Chen Meng
Guoping Long
Chuan Wu
Jun Yang
Jialin Li
Yangqing Jia
147
62
0
14 Oct 2019
MLPerf Training Benchmark
Conference on Machine Learning and Systems (MLSys), 2019
Arya D. McCarthy
Christine Cheng
Cody Coleman
Greg Diamos
Paulius Micikevicius
...
Carole-Jean Wu
Lingjie Xu
Masafumi Yamazaki
C. Young
Matei A. Zaharia
522
348
0
02 Oct 2019
AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers
Wei Zhang
Wei Wei
Lingjie Xu
Lingling Jin
Cheng Li
ELM
99
23
0
23 Sep 2019
Demystifying the MLPerf Benchmark Suite
Snehil Verma
Qinzhe Wu
Bagus Hanindhito
Gunjan Jha
E. John
R. Radhakrishnan
L. John
VLM
119
8
0
24 Aug 2019
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs
Cheng-rong Li
Abdul Dakkak
Jinjun Xiong
Wei Wei
Lingjie Xu
Wen-mei W. Hwu
133
16
0
19 Aug 2019
AIBench: An Industry Standard Internet Service AI Benchmark Suite
Wanling Gao
Fei Tang
Lei Wang
Jianfeng Zhan
Chunxin Lan
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
166
48
0
13 Aug 2019
HPC AI500: A Benchmark Suite for HPC AI Systems
BenchCouncil International Symposium (ISB), 2018
Zihan Jiang
Wanling Gao
Lei Wang
Xingwang Xiong
Yuchen Zhang
...
Yunquan Zhang
Shengzhong Feng
KenLi Li
Weijia Xu
Jianfeng Zhan
ELM
173
42
0
27 Jul 2019
A Workload and Programming Ease Driven Perspective of Processing-in-Memory
Saugata Ghose
Amirali Boroumand
Jeremie S. Kim
Juan Gómez Luna
O. Mutlu
109
11
0
26 Jul 2019
Benchmarking TPU, GPU, and CPU Platforms for Deep Learning
Y. Wang
Gu-Yeon Wei
David Brooks
ELM
VLM
292
305
0
24 Jul 2019
Performance Analysis and Characterization of Training Deep Learning Models on Mobile Devices
Jie Liu
Jiawen Liu
Wan Du
Dong Li
HAI
155
5
0
10 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
International Symposium on High-Performance Computer Architecture (HPCA), 2019
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
331
316
0
06 Jun 2019
Performance Analysis of Deep Learning Workloads on Leading-edge Systems
International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), 2019
Zhongjing Jiang
Shinjae Yoo
A. Hoisie
ELM
103
23
0
21 May 2019
DeepOBS: A Deep Learning Optimizer Benchmark Suite
Frank Schneider
Lukas Balles
Philipp Hennig
ODL
367
76
0
13 Mar 2019
1
2
Next
Page 1 of 2