Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1608.06581
Cited By

Fathom: Reference Workloads for Modern Deep Learning Methods

Fathom: Reference Workloads for Modern Deep Learning Methods

IEEE International Symposium on Workload Characterization (IISWC), 2016

23 August 2016

David Brooks

ArXiv (abs)PDF HTML

Papers citing "Fathom: Reference Workloads for Modern Deep Learning Methods"

50 / 60 papers shown

An MLCommons Scientific Benchmarks Ontology

An MLCommons Scientific Benchmarks Ontology

G. V. Laszewski

Matthew D. Sinclair

Shivaram Venkataraman

Geoffrey C. Fox

96

1

0

06 Nov 2025

On the Performance and Memory Footprint of Distributed Training: An
Empirical Study on Transformers

On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers

Tao Li

221

4

0

02 Jul 2024

I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey

I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey

495

4

0

16 Apr 2024

GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System

GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System

197

4

0

05 Apr 2024

Beyond Inference: Performance Analysis of DNN Server Overheads for
Computer Vision

Beyond Inference: Performance Analysis of DNN Server Overheads for Computer Vision

Ahmed F. AbouElhamayed

Deshanand Singh

Mohamed S. Abdelfattah

109

0

0

02 Mar 2024

TorchBench: Benchmarking PyTorch with High API Surface Coverage

TorchBench: Benchmarking PyTorch with High API Surface Coverage

William Constable

342

12

0

27 Apr 2023

Hierarchical Training of Deep Neural Networks Using Early Exiting

Hierarchical Training of Deep Neural Networks Using Early ExitingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

A. C. Yüzügüler

243

11

0

04 Mar 2023

Experimenting with Emerging RISC-V Systems for Decentralised Machine
Learning

Experimenting with Emerging RISC-V Systems for Decentralised Machine LearningACM International Conference on Computing Frontiers (CF), 2023

Gianluca Mittone

Iacopo Colonnelli

...

Francesco Beneventi

Massimo Torquati

Luca Benini

Marco Aldinucci

202

16

0

15 Feb 2023

Computation vs. Communication Scaling for Future Transformers on Future
Hardware

Computation vs. Communication Scaling for Future Transformers on Future Hardware

Suchita Pati

Shaizeen Aga

Mahzabeen Islam

Nuwan Jayasena

Matthew D. Sinclair

267

14

0

06 Feb 2023

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and
Inference Workloads on Multi-Instance GPUs

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

Huaizheng Zhang

Yang You

161

6

0

01 Jan 2023

Mystique: Enabling Accurate and Scalable Generation of Production AI
Benchmarks

Mystique: Enabling Accurate and Scalable Generation of Production AI BenchmarksInternational Symposium on Computer Architecture (ISCA), 2022

Louis Feng

Srinivas Sridharan

Christina Delimitrou

189

16

0

16 Dec 2022

An Overview of the Data-Loader Landscape: Comparative Performance
Analysis

An Overview of the Data-Loader Landscape: Comparative Performance AnalysisBigData Congress [Services Society] (BSS), 2022

Diego Kiedanski

Leandros Tassiulas

191

7

0

27 Sep 2022

Accelerating Neural Network Inference with Processing-in-DRAM: From the
Edge to the Cloud

Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the CloudIEEE Micro (IEEE Micro), 2022

Geraldo F. Oliveira

Juan Gómez Luna

Amirali Boroumand

221

31

0

19 Sep 2022

Snowmass 2021 Computational Frontier CompF4 Topical Group Report:
Storage and Processing Resource Access

Snowmass 2021 Computational Frontier CompF4 Topical Group Report: Storage and Processing Resource Access

Javier M. Duarte

...

228

0

0

19 Sep 2022

Metadata Representations for Queryable ML Model Zoos

Metadata Representations for Queryable ML Model Zoos

Ziyu Li

Rihan Hai

Asterios Katsifodimos

60

4

0

19 Jul 2022

FastML Science Benchmarks: Accelerating Real-Time Scientific Edge
Machine Learning

FastML Science Benchmarks: Accelerating Real-Time Scientific Edge Machine Learning

Javier Mauricio Duarte

Vijay Janapa Reddi

281

19

0

16 Jul 2022

Benchmarking of DL Libraries and Models on Mobile Devices

Benchmarking of DL Libraries and Models on Mobile DevicesThe Web Conference (WWW), 2022

Mengwei Xu

Shangguang Wang

205

56

0

14 Feb 2022

Benchmarking Resource Usage for Efficient Distributed Deep Learning

Benchmarking Resource Usage for Efficient Distributed Deep LearningIEEE Conference on High Performance Extreme Computing (HPEC), 2022

Baolin Li

Joseph McDonald

Michael Jones

Devesh Tiwari

187

13

0

28 Jan 2022

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning
on HPC Systems

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

Aleksandr Drozd

...

Akihiro Tabuchi

Masafumi Yamazaki

240

50

0

21 Oct 2021

Demystifying BERT: Implications for Accelerator Design

Demystifying BERT: Implications for Accelerator Design

Suchita Pati

Shaizeen Aga

Nuwan Jayasena

Matthew D. Sinclair

197

16

0

14 Apr 2021

A Case for 3D Integrated System Design for Neuromorphic Computing & AI
Applications

A Case for 3D Integrated System Design for Neuromorphic Computing & AI ApplicationsSocial Science Research Network (SSRN), 2021

163

3

0

02 Mar 2021

A Runtime-Based Computational Performance Predictor for Deep Neural
Network Training

A Runtime-Based Computational Performance Predictor for Deep Neural Network TrainingUSENIX Annual Technical Conference (USENIX ATC), 2021

Gennady Pekhimenko

162

83

0

31 Jan 2021

InferBench: Understanding Deep Learning Inference Serving with an
Automatic Benchmarking System

InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System

Huaizheng Zhang

177

3

0

04 Nov 2020

Cross-Stack Workload Characterization of Deep Recommendation Systems

Cross-Stack Workload Characterization of Deep Recommendation SystemsIEEE International Symposium on Workload Characterization (IISWC), 2020

David Brooks

239

38

0

10 Oct 2020

Bosch Deep Learning Hardware Benchmark

Bosch Deep Learning Hardware Benchmark

Dimitrios Bariamis

Michael Pfeiffer

117

0

0

24 Aug 2020

AIPerf: Automated machine learning as an AI-HPC benchmark

AIPerf: Automated machine learning as an AI-HPC benchmark

319

29

0

17 Aug 2020

SeqPoint: Identifying Representative Iterations of Sequence-based Neural
Networks

SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks

Suchita Pati

Shaizeen Aga

Matthew D. Sinclair

Nuwan Jayasena

134

10

0

20 Jul 2020

Comparison and Benchmarking of AI Models and Frameworks on Mobile
Devices

Comparison and Benchmarking of AI Models and Frameworks on Mobile Devices

Lei Wang

190

68

0

07 May 2020

AIBench Scenario: Scenario-distilling AI Benchmarking

AIBench Scenario: Scenario-distilling AI BenchmarkingInternational Conference on Parallel Architectures and Compilation Techniques (PACT), 2020

Lei Wang

258

14

0

06 May 2020

AIBench Training: Balanced Industry-Standard AI Training Benchmarking

AIBench Training: Balanced Industry-Standard AI Training Benchmarking

...

187

3

0

30 Apr 2020

Energy Predictive Models for Convolutional Neural Networks on Mobile
Platforms

Energy Predictive Models for Convolutional Neural Networks on Mobile Platforms

Crefeda Faviola Rodrigues

Graham D. Riley

104

4

0

10 Apr 2020

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI
Benchmark Suite

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

...

149

1

0

17 Feb 2020

DeepRecSys: A System for Optimizing End-To-End At-scale Neural
Recommendation Inference

DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation InferenceInternational Symposium on Computer Architecture (ISCA), 2020

Hsien-Hsin S. Lee

David Brooks

218

200

0

08 Jan 2020

RecNMP: Accelerating Personalized Recommendation with Near-Memory
Processing

RecNMP: Accelerating Personalized Recommendation with Near-Memory ProcessingInternational Symposium on Computer Architecture (ISCA), 2019

...

Dheevatsa Mudigere

Martin D. Schatz

189

252

0

30 Dec 2019

DLBricks: Composable Benchmark Generation to Reduce Deep Learning
Benchmarking Effort on CPUs (Extended)

DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended)International Conference on Performance Engineering (ICPE), 2019

Jinjun Xiong

222

0

0

18 Nov 2019

MLPerf Inference Benchmark

MLPerf Inference BenchmarkInternational Symposium on Computer Architecture (ISCA), 2019

Vijayarāghava Reḍḍī

Guenther Schmuelling

...

314

593

0

06 Nov 2019

On-Device Machine Learning: An Algorithms and Learning Theory
Perspective

On-Device Machine Learning: An Algorithms and Learning Theory Perspective

454

171

0

02 Nov 2019

Characterizing Deep Learning Training Workloads on Alibaba-PAI

Characterizing Deep Learning Training Workloads on Alibaba-PAIIEEE International Symposium on Workload Characterization (IISWC), 2019

Mengdi Wang

147

62

0

14 Oct 2019

MLPerf Training Benchmark

MLPerf Training BenchmarkConference on Machine Learning and Systems (MLSys), 2019

Arya D. McCarthy

Christine Cheng

Cody Coleman

Paulius Micikevicius

...

Masafumi Yamazaki

Matei A. Zaharia

522

348

0

02 Oct 2019

AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers

AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers

Wei Zhang

99

23

0

23 Sep 2019

Demystifying the MLPerf Benchmark Suite

Demystifying the MLPerf Benchmark Suite

Bagus Hanindhito

R. Radhakrishnan

119

8

0

24 Aug 2019

XSP: Across-Stack Profiling and Analysis of Machine Learning Models on
GPUs

XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs

Jinjun Xiong

133

16

0

19 Aug 2019

AIBench: An Industry Standard Internet Service AI Benchmark Suite

AIBench: An Industry Standard Internet Service AI Benchmark Suite

Lei Wang

...

166

48

0

13 Aug 2019

HPC AI500: A Benchmark Suite for HPC AI Systems

HPC AI500: A Benchmark Suite for HPC AI SystemsBenchCouncil International Symposium (ISB), 2018

Lei Wang

...

Shengzhong Feng

173

42

0

27 Jul 2019

A Workload and Programming Ease Driven Perspective of
Processing-in-Memory

A Workload and Programming Ease Driven Perspective of Processing-in-Memory

Amirali Boroumand

Juan Gómez Luna

109

11

0

26 Jul 2019

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning

David Brooks

292

305

0

24 Jul 2019

Performance Analysis and Characterization of Training Deep Learning
Models on Mobile Devices

Performance Analysis and Characterization of Training Deep Learning Models on Mobile Devices

155

5

0

10 Jun 2019

The Architectural Implications of Facebook's DNN-based Personalized
Recommendation

The Architectural Implications of Facebook's DNN-based Personalized RecommendationInternational Symposium on High-Performance Computer Architecture (HPCA), 2019

...

Andrey Malevich

Dheevatsa Mudigere

331

316

0

06 Jun 2019

Performance Analysis of Deep Learning Workloads on Leading-edge Systems

Performance Analysis of Deep Learning Workloads on Leading-edge SystemsInternational Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), 2019

Zhongjing Jiang

103

23

0

21 May 2019

DeepOBS: A Deep Learning Optimizer Benchmark Suite

DeepOBS: A Deep Learning Optimizer Benchmark Suite

Frank Schneider

367

76

0

13 Mar 2019

Page 1 of 2