Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.08326
Cited By
v1
v2 (latest)
Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration
Design Automation Conference (DAC), 2020
18 February 2020
Cong Guo
Yangjie Zhou
Jingwen Leng
Yuhao Zhu
Zidong Du
Quan Chen
Chao Li
Bin Yao
Minyi Guo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration"
10 / 10 papers shown
Accelerating Sparse DNNs Based on Tiled GEMM
Cong Guo
Fengchen Xue
Jingwen Leng
Yuxian Qiu
Yue Guan
Weihao Cui
Quan Chen
Minyi Guo
199
18
0
16 Feb 2024
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
ACM International Conference on Computing Frontiers (CF), 2023
Yangjie Zhou
Yaoxu Song
Jingwen Leng
Zihan Liu
Weihao Cui
Zhendong Zhang
Cong Guo
Quan Chen
Li-Wei Li
Minyi Guo
GNN
268
5
0
27 May 2023
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
148
12
0
16 Mar 2023
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Micro (MICRO), 2022
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
261
104
0
30 Aug 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation
International Conference on Learning Representations (ICLR), 2022
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
289
89
0
14 Feb 2022
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services via Adaptive Compilation and Scheduling
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
Zihan Liu
Jingwen Leng
Zhihui Zhang
Quan Chen
Chao Li
Minyi Guo
191
56
0
17 Jan 2022
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators
IEEE International Symposium on Workload Characterization (IISWC), 2021
Yangjie Zhou
Mengtian Yang
Cong Guo
Jingwen Leng
Yun Liang
Quan Chen
Minyi Guo
Yuhao Zhu
152
47
0
08 Oct 2021
Dual-side Sparse Tensor Core
International Symposium on Computer Architecture (ISCA), 2021
Yang-Feng Wang
Chen Zhang
Zhiqiang Xie
Cong Guo
Yunxin Liu
Jingwen Leng
277
94
0
20 May 2021
DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural Network Accelerator
Zihan Liu
Jingwen Leng
Quan Chen
Chao Li
Wenli Zheng
Li-Wei Li
Minyi Guo
155
8
0
11 Nov 2020
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020
Cong Guo
B. Hsueh
Jingwen Leng
Yuxian Qiu
Yue Guan
Zehuan Wang
Xiaoying Jia
Xipeng Li
Minyi Guo
Yuhao Zhu
194
92
0
29 Aug 2020
1
Page 1 of 1