ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08326
  4. Cited By
Balancing Efficiency and Flexibility for DNN Acceleration via Temporal
  GPU-Systolic Array Integration
v1v2 (latest)

Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration

Design Automation Conference (DAC), 2020
18 February 2020
Cong Guo
Yangjie Zhou
Jingwen Leng
Yuhao Zhu
Zidong Du
Quan Chen
Chao Li
Bin Yao
Minyi Guo
ArXiv (abs)PDFHTML

Papers citing "Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration"

10 / 10 papers shown
Accelerating Sparse DNNs Based on Tiled GEMM
Accelerating Sparse DNNs Based on Tiled GEMM
Cong Guo
Fengchen Xue
Jingwen Leng
Yuxian Qiu
Yue Guan
Weihao Cui
Quan Chen
Minyi Guo
199
18
0
16 Feb 2024
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels
  on GPUs
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUsACM International Conference on Computing Frontiers (CF), 2023
Yangjie Zhou
Yaoxu Song
Jingwen Leng
Zihan Liu
Weihao Cui
Zhendong Zhang
Cong Guo
Quan Chen
Li-Wei Li
Minyi Guo
GNN
268
5
0
27 May 2023
A High-Performance Accelerator for Super-Resolution Processing on
  Embedded GPU
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
148
12
0
16 Mar 2023
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural
  Network Quantization
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network QuantizationMicro (MICRO), 2022
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
261
104
0
30 Aug 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian
  Approximation
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian ApproximationInternational Conference on Learning Representations (ICLR), 2022
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
289
89
0
14 Feb 2022
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services
  via Adaptive Compilation and Scheduling
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services via Adaptive Compilation and SchedulingInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
Zihan Liu
Jingwen Leng
Zhihui Zhang
Quan Chen
Chao Li
Minyi Guo
191
56
0
17 Jan 2022
Characterizing and Demystifying the Implicit Convolution Algorithm on
  Commercial Matrix-Multiplication Accelerators
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication AcceleratorsIEEE International Symposium on Workload Characterization (IISWC), 2021
Yangjie Zhou
Mengtian Yang
Cong Guo
Jingwen Leng
Yun Liang
Quan Chen
Minyi Guo
Yuhao Zhu
152
47
0
08 Oct 2021
Dual-side Sparse Tensor Core
Dual-side Sparse Tensor CoreInternational Symposium on Computer Architecture (ISCA), 2021
Yang-Feng Wang
Chen Zhang
Zhiqiang Xie
Cong Guo
Yunxin Liu
Jingwen Leng
277
94
0
20 May 2021
DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural
  Network Accelerator
DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural Network Accelerator
Zihan Liu
Jingwen Leng
Quan Chen
Chao Li
Wenli Zheng
Li-Wei Li
Minyi Guo
155
8
0
11 Nov 2020
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise
  Sparsity
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise SparsityInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020
Cong Guo
B. Hsueh
Jingwen Leng
Yuxian Qiu
Yue Guan
Zehuan Wang
Xiaoying Jia
Xipeng Li
Minyi Guo
Yuhao Zhu
194
92
0
29 Aug 2020
1
Page 1 of 1