ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.06905
  4. Cited By
TBD: Benchmarking and Analyzing Deep Neural Network Training

TBD: Benchmarking and Analyzing Deep Neural Network Training

16 March 2018
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
ArXivPDFHTML

Papers citing "TBD: Benchmarking and Analyzing Deep Neural Network Training"

32 / 32 papers shown
Title
The Framework Tax: Disparities Between Inference Efficiency in NLP
  Research and Deployment
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
Jared Fernandez
Jacob Kahn
Clara Na
Yonatan Bisk
Emma Strubell
FedML
25
10
0
13 Feb 2023
Computation vs. Communication Scaling for Future Transformers on Future
  Hardware
Computation vs. Communication Scaling for Future Transformers on Future Hardware
Suchita Pati
Shaizeen Aga
Mahzabeen Islam
Nuwan Jayasena
Matthew D. Sinclair
20
9
0
06 Feb 2023
SAIH: A Scalable Evaluation Methodology for Understanding AI Performance
  Trend on HPC Systems
SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems
Jiangsu Du
Dongsheng Li
Yingpeng Wen
Jiazhi Jiang
Dan Huang
Xia Liao
Yutong Lu
14
0
0
07 Dec 2022
An Overview of the Data-Loader Landscape: Comparative Performance
  Analysis
An Overview of the Data-Loader Landscape: Comparative Performance Analysis
Iason Ofeidis
Diego Kiedanski
Leandros Tassiulas
13
7
0
27 Sep 2022
DataPerf: Benchmarks for Data-Centric AI Development
DataPerf: Benchmarks for Data-Centric AI Development
Mark Mazumder
Colby R. Banbury
Xiaozhe Yao
Bojan Karlavs
W. G. Rojas
...
Carole-Jean Wu
Cody Coleman
Andrew Y. Ng
Peter Mattson
Vijay Janapa Reddi
VLM
33
101
0
20 Jul 2022
SplitPlace: Intelligent Placement of Split Neural Nets in Mobile Edge
  Environments
SplitPlace: Intelligent Placement of Split Neural Nets in Mobile Edge Environments
Shreshth Tuli
8
1
0
10 Oct 2021
Demystifying BERT: Implications for Accelerator Design
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
LLMAG
22
17
0
14 Apr 2021
On the Utility of Gradient Compression in Distributed Training Systems
On the Utility of Gradient Compression in Distributed Training Systems
Saurabh Agarwal
Hongyi Wang
Shivaram Venkataraman
Dimitris Papailiopoulos
23
46
0
28 Feb 2021
Srifty: Swift and Thrifty Distributed Training on the Cloud
Srifty: Swift and Thrifty Distributed Training on the Cloud
Liangchen Luo
Peter West
Arvind Krishnamurthy
Luis Ceze
22
11
0
29 Nov 2020
How much progress have we made in neural network training? A New
  Evaluation Protocol for Benchmarking Optimizers
How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Yuanhao Xiong
Xuanqing Liu
Li-Cheng Lan
Yang You
Si Si
Cho-Jui Hsieh
OOD
13
1
0
19 Oct 2020
AIPerf: Automated machine learning as an AI-HPC benchmark
AIPerf: Automated machine learning as an AI-HPC benchmark
Zhixiang Ren
Yongheng Liu
Tianhui Shi
Lei Xie
Yue Zhou
Jidong Zhai
Youhui Zhang
Yunquan Zhang
Wenguang Chen
19
22
0
17 Aug 2020
AIBench Scenario: Scenario-distilling AI Benchmarking
AIBench Scenario: Scenario-distilling AI Benchmarking
Wanling Gao
Fei Tang
Jianfeng Zhan
Xu Wen
Lei Wang
Zheng Cao
Chuanxin Lan
Chunjie Luo
Xiaoli Liu
Zihan Jiang
21
14
0
06 May 2020
AIBench Training: Balanced Industry-Standard AI Training Benchmarking
AIBench Training: Balanced Industry-Standard AI Training Benchmarking
Fei Tang
Wanling Gao
Jianfeng Zhan
Chuanxin Lan
Xu Wen
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
22
3
0
30 Apr 2020
AIBench: An Agile Domain-specific Benchmarking Methodology and an AI
  Benchmark Suite
AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite
Wanling Gao
Fei Tang
Jianfeng Zhan
Chuanxin Lan
Chunjie Luo
...
Gang Lu
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
17
1
0
17 Feb 2020
The Design and Implementation of a Scalable DL Benchmarking Platform
The Design and Implementation of a Scalable DL Benchmarking Platform
Cheng-rong Li
Abdul Dakkak
Jinjun Xiong
Wen-mei W. Hwu
ALM
ELM
14
4
0
19 Nov 2019
On-Device Machine Learning: An Algorithms and Learning Theory
  Perspective
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
17
141
0
02 Nov 2019
Progressive Compressed Records: Taking a Byte out of Deep Learning Data
Progressive Compressed Records: Taking a Byte out of Deep Learning Data
Michael Kuchnik
George Amvrosiadis
Virginia Smith
11
9
0
01 Nov 2019
Demystifying the MLPerf Benchmark Suite
Demystifying the MLPerf Benchmark Suite
Snehil Verma
Qinzhe Wu
Bagus Hanindhito
Gunjan Jha
E. John
R. Radhakrishnan
L. John
VLM
19
8
0
24 Aug 2019
AIBench: An Industry Standard Internet Service AI Benchmark Suite
AIBench: An Industry Standard Internet Service AI Benchmark Suite
Wanling Gao
Fei Tang
Lei Wang
Jianfeng Zhan
Chunxin Lan
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
17
45
0
13 Aug 2019
HPC AI500: A Benchmark Suite for HPC AI Systems
HPC AI500: A Benchmark Suite for HPC AI Systems
Zihan Jiang
Wanling Gao
Lei Wang
Xingwang Xiong
Yuchen Zhang
...
Yunquan Zhang
Shengzhong Feng
KenLi Li
Weijia Xu
Jianfeng Zhan
ELM
11
40
0
27 Jul 2019
Priority-based Parameter Propagation for Distributed DNN Training
Priority-based Parameter Propagation for Distributed DNN Training
Anand Jayarajan
Jinliang Wei
Garth A. Gibson
Alexandra Fedorova
Gennady Pekhimenko
AI4CE
11
178
0
10 May 2019
DeepOBS: A Deep Learning Optimizer Benchmark Suite
DeepOBS: A Deep Learning Optimizer Benchmark Suite
Frank Schneider
Lukas Balles
Philipp Hennig
ODL
20
71
0
13 Mar 2019
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep
  Learning
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning
Youngeun Kwon
Minsoo Rhu
16
56
0
18 Feb 2019
A Modular Benchmarking Infrastructure for High-Performance and
  Reproducible Deep Learning
A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning
Tal Ben-Nun
Maciej Besta
Simon Huber
A. Ziogas
D. Peter
Torsten Hoefler
ELM
ALM
12
77
0
29 Jan 2019
Tango: A Deep Neural Network Benchmark Suite for Various Accelerators
Tango: A Deep Neural Network Benchmark Suite for Various Accelerators
A. Karki
Chethan Palangotu Keshava
Spoorthi Mysore Shivakumar
Joshua Skow
Goutam Madhukeshwar Hegde
Hyeran Jeon
13
43
0
14 Jan 2019
Frustrated with Replicating Claims of a Shared Model? A Solution
Frustrated with Replicating Claims of a Shared Model? A Solution
Abdul Dakkak
Cheng-rong Li
Jinjun Xiong
Wen-mei W. Hwu
11
7
0
24 Nov 2018
A Comparative Measurement Study of Deep Learning as a Service Framework
A Comparative Measurement Study of Deep Learning as a Service Framework
Yanzhao Wu
Ling Liu
C. Pu
Wenqi Cao
Semih Sahin
Wenqi Wei
Qi Zhang
14
45
0
29 Oct 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance
  Benchmark
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
11
117
0
04 Jun 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN
  Training
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
19
44
0
22 May 2018
Parameter Hub: a Rack-Scale Parameter Server for Distributed Deep Neural
  Network Training
Parameter Hub: a Rack-Scale Parameter Server for Distributed Deep Neural Network Training
Liang Luo
Jacob Nelson
Luis Ceze
Amar Phanishayee
Arvind Krishnamurthy
64
120
0
21 May 2018
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
216
7,923
0
17 Aug 2015
1