Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.04598
Cited By
Time-Based Roofline for Deep Learning Performance Analysis
9 September 2020
Yunsong Wang
Charlene Yang
S. Farrell
Yan Zhang
Thorsten Kurth
Samuel Williams
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Time-Based Roofline for Deep Learning Performance Analysis"
4 / 4 papers shown
Title
Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning
Takaaki Fukai
Kento Sato
Takahiro Hirofuchi
22
2
0
04 Jan 2023
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
Felix Chern
Blake A. Hechtman
Andy Davis
Ruiqi Guo
David Majnemer
Surinder Kumar
94
22
0
28 Jun 2022
E^2TAD: An Energy-Efficient Tracking-based Action Detector
Xin Hu
Zhenyu Wu
Haoyuan Miao
Siqi Fan
Taiyu Long
...
Pengcheng Pi
Yi Wu
Zhou Ren
Zhangyang Wang
G. Hua
21
2
0
09 Apr 2022
8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other Tricks
Charlene Yang
10
10
0
26 Aug 2020
1