ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.04598
  4. Cited By
Time-Based Roofline for Deep Learning Performance Analysis

Time-Based Roofline for Deep Learning Performance Analysis

9 September 2020
Yunsong Wang
Charlene Yang
S. Farrell
Yan Zhang
Thorsten Kurth
Samuel Williams
ArXivPDFHTML

Papers citing "Time-Based Roofline for Deep Learning Performance Analysis"

4 / 4 papers shown
Title
Analyzing I/O Performance of a Hierarchical HPC Storage System for
  Distributed Deep Learning
Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning
Takaaki Fukai
Kento Sato
Takahiro Hirofuchi
22
2
0
04 Jan 2023
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
Felix Chern
Blake A. Hechtman
Andy Davis
Ruiqi Guo
David Majnemer
Surinder Kumar
94
22
0
28 Jun 2022
E^2TAD: An Energy-Efficient Tracking-based Action Detector
E^2TAD: An Energy-Efficient Tracking-based Action Detector
Xin Hu
Zhenyu Wu
Haoyuan Miao
Siqi Fan
Taiyu Long
...
Pengcheng Pi
Yi Wu
Zhou Ren
Zhangyang Wang
G. Hua
19
2
0
09 Apr 2022
8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other
  Tricks
8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other Tricks
Charlene Yang
8
10
0
26 Aug 2020
1