ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.03901
  4. Cited By
Characterizing and Demystifying the Implicit Convolution Algorithm on
  Commercial Matrix-Multiplication Accelerators

Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators

8 October 2021
Yangjie Zhou
Mengtian Yang
Cong Guo
Jingwen Leng
Yun Liang
Quan Chen
M. Guo
Yuhao Zhu
ArXivPDFHTML

Papers citing "Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators"

8 / 8 papers shown
Title
Potamoi: Accelerating Neural Rendering via a Unified Streaming
  Architecture
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture
Yu Feng
Weikai Lin
Zihan Liu
Jingwen Leng
Minyi Guo
Han Zhao
Xiaofeng Hou
Jieru Zhao
Yuhao Zhu
26
3
0
13 Aug 2024
Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural
  Rendering by Radiance Warping and Memory Optimizations
Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations
Yu Feng
Zihan Liu
Jingwen Leng
Minyi Guo
Yuhao Zhu
33
8
0
18 Apr 2024
Accelerating Sparse DNNs Based on Tiled GEMM
Accelerating Sparse DNNs Based on Tiled GEMM
Cong Guo
Fengchen Xue
Jingwen Leng
Yuxian Qiu
Yue Guan
Weihao Cui
Quan Chen
Minyi Guo
11
9
0
16 Feb 2024
Performance Analysis of DNN Inference/Training with Convolution and
  non-Convolution Operations
Performance Analysis of DNN Inference/Training with Convolution and non-Convolution Operations
H. Esmaeilzadeh
Soroush Ghodrati
A. Kahng
Sean Kinzer
Susmita Dey Manasi
S. Sapatnekar
Zhiang Wang
11
2
0
29 Jun 2023
DistSim: A performance model of large-scale hybrid distributed DNN
  training
DistSim: A performance model of large-scale hybrid distributed DNN training
Guandong Lu
Run Chen
Yakai Wang
Yangjie Zhou
Rui Zhang
...
Yanming Miao
Zhifang Cai
Li-Wei Li
Jingwen Leng
Minyi Guo
14
10
0
14 Jun 2023
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels
  on GPUs
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Yangjie Zhou
Yaoxu Song
Jingwen Leng
Zihan Liu
Weihao Cui
Zhendong Zhang
Cong Guo
Quan Chen
Li-Wei Li
Minyi Guo
GNN
17
1
0
27 May 2023
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural
  Network Quantization
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
4
52
0
30 Aug 2022
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services
  via Adaptive Compilation and Scheduling
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services via Adaptive Compilation and Scheduling
Zihan Liu
Jingwen Leng
Zhihui Zhang
Quan Chen
Chao Li
M. Guo
11
46
0
17 Jan 2022
1