ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.03395
  4. Cited By
Low-memory GEMM-based convolution algorithms for deep neural networks

Low-memory GEMM-based convolution algorithms for deep neural networks

8 September 2017
Andrew Anderson
Aravind Vasudevan
Cormac Keane
David Gregg
ArXiv (abs)PDFHTML

Papers citing "Low-memory GEMM-based convolution algorithms for deep neural networks"

13 / 13 papers shown
Accelerating Machine Learning Primitives on Commodity Hardware
Accelerating Machine Learning Primitives on Commodity Hardware
R. Snytsar
160
0
0
08 Oct 2023
Photonic Accelerators for Image Segmentation in Autonomous Driving and
  Defect Detection
Photonic Accelerators for Image Segmentation in Autonomous Driving and Defect DetectionIEEE Conference on High Performance Extreme Computing (HPEC), 2023
Lakshmi Nair
David Widemann
Brad Turcott
Nick Moore
Alexandra Wleklinski
D. Bunandar
Ioannis Papavasileiou
Shihu Wang
Eric Logan
306
0
0
28 Sep 2023
Im2win: Memory Efficient Convolution On SIMD Architectures
Im2win: Memory Efficient Convolution On SIMD ArchitecturesIEEE Conference on High Performance Extreme Computing (HPEC), 2022
Shuai-bing Lu
Jun Chu
Xuantong Liu
155
7
0
25 Jun 2023
Sliding Window Sum Algorithms for Deep Neural Networks
Sliding Window Sum Algorithms for Deep Neural Networks
R. Snytsar
TPMAI4TS
143
3
0
25 May 2023
Characterizing and Demystifying the Implicit Convolution Algorithm on
  Commercial Matrix-Multiplication Accelerators
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication AcceleratorsIEEE International Symposium on Workload Characterization (IISWC), 2021
Yangjie Zhou
Mengtian Yang
Cong Guo
Jingwen Leng
Yun Liang
Quan Chen
Minyi Guo
Yuhao Zhu
160
48
0
08 Oct 2021
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning
Efficient and Generic 1D Dilated Convolution Layer for Deep Learning
Narendra Chaudhary
Sanchit Misra
Dhiraj D. Kalamkar
A. Heinecke
E. Georganas
Barukh Ziv
Menachem Adelman
Bharat Kaul
163
10
0
16 Apr 2021
Optimising the Performance of Convolutional Neural Networks across
  Computing Systems using Transfer Learning
Optimising the Performance of Convolutional Neural Networks across Computing Systems using Transfer Learning
Rik Mulder
Valentin Radu
Christophe Dubach
168
2
0
20 Oct 2020
Automated Design Space Exploration for optimised Deployment of DNN on
  Arm Cortex-A CPUs
Automated Design Space Exploration for optimised Deployment of DNN on Arm Cortex-A CPUs
Miguel de Prado
Andrew Mundy
Rabia Saeed
Maurizo Denna
Nuria Pazos
Luca Benini
234
11
0
09 Jun 2020
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
Yuan Wen
Andrew Anderson
Valentin Radu
Michael F. P. O'Boyle
David Gregg
224
11
0
21 May 2020
The Indirect Convolution Algorithm
The Indirect Convolution Algorithm
Marat Dukhan
203
46
0
03 Jul 2019
High-Performance Deep Learning via a Single Building Block
High-Performance Deep Learning via a Single Building Block
E. Georganas
K. Banerjee
Dhiraj D. Kalamkar
Sasikanth Avancha
Anand Venkat
Michael J. Anderson
G. Henry
Hans Pabst
A. Heinecke
208
13
0
15 Jun 2019
Anatomy Of High-Performance Deep Learning Convolutions On SIMD
  Architectures
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures
E. Georganas
Sasikanth Avancha
K. Banerjee
Dhiraj D. Kalamkar
G. Henry
Hans Pabst
A. Heinecke
BDL
217
112
0
16 Aug 2018
Optimal DNN Primitive Selection with Partitioned Boolean Quadratic
  Programming
Optimal DNN Primitive Selection with Partitioned Boolean Quadratic Programming
Andrew Anderson
David Gregg
184
36
0
03 Oct 2017
1
Page 1 of 1