ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.0759
  4. Cited By
cuDNN: Efficient Primitives for Deep Learning

cuDNN: Efficient Primitives for Deep Learning

3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
ArXivPDFHTML

Papers citing "cuDNN: Efficient Primitives for Deep Learning"

50 / 236 papers shown
Title
Restructuring Batch Normalization to Accelerate CNN Training
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
24
62
0
04 Jul 2018
Efficient ConvNets for Analog Arrays
Efficient ConvNets for Analog Arrays
Malte J. Rasch
Tayfun Gokmen
Mattia Rigotti
W. Haensch
28
11
0
03 Jul 2018
Multimodal feature fusion for CNN-based gait recognition: an empirical
  comparison
Multimodal feature fusion for CNN-based gait recognition: an empirical comparison
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
29
60
0
19 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted
  Sparse Projection and Layer Input Masking
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
19
36
0
12 Jun 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance
  Benchmark
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
13
117
0
04 Jun 2018
BindsNET: A machine learning-oriented spiking neural networks library in
  Python
BindsNET: A machine learning-oriented spiking neural networks library in Python
Hananel Hazan
D. J. Saunders
Hassaan Khan
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
AI4CE
30
229
0
04 Jun 2018
Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk
  Classification: A Deep Learning Approach
Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk Classification: A Deep Learning Approach
Rodrigo Berriel
Franco Schmidt Rossi
Alberto F. de Souza
Thiago Oliveira-Santos
30
50
0
30 May 2018
Accelerating CNN inference on FPGAs: A Survey
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
30
147
0
26 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN
  Training
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
29
26
0
21 May 2018
Decorrelated Batch Normalization
Decorrelated Batch Normalization
Lei Huang
Dawei Yang
B. Lang
Jia Deng
16
190
0
23 Apr 2018
Context-aware Synthesis for Video Frame Interpolation
Context-aware Synthesis for Video Frame Interpolation
Simon Niklaus
Feng Liu
48
406
0
29 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise
  Convolutions
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Zheng Qin
Zhaoning Zhang
Dongsheng Li
Yiming Zhang
Yuxing Peng
25
28
0
27 Mar 2018
Flex-Convolution (Million-Scale Point-Cloud Learning Beyond Grid-Worlds)
Flex-Convolution (Million-Scale Point-Cloud Learning Beyond Grid-Worlds)
F. Groh
P. Wieschollek
Hendrik P. A. Lensch
3DPC
16
107
0
20 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
25
80
0
16 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
45
1,304
0
12 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN
  Inference Engine
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
24
19
0
05 Mar 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
13
25
0
28 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
703
0
26 Feb 2018
Exploring Hidden Dimensions in Parallelizing Convolutional Neural
  Networks
Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks
Zhihao Jia
Sina Lin
C. Qi
A. Aiken
37
117
0
14 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
18
1,574
0
05 Feb 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent
  Mobile Cloud Computing Services
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
M. Abrishami
Massoud Pedram
FedML
34
248
0
25 Jan 2018
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural
  Networks
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks
Linnan Wang
Jinmian Ye
Yiyang Zhao
Wei Wu
Ang Li
Shuaiwen Leon Song
Zenglin Xu
Tim Kraska
3DH
46
264
0
13 Jan 2018
Neural networks catching up with finite differences in solving partial
  differential equations in higher dimensions
Neural networks catching up with finite differences in solving partial differential equations in higher dimensions
V. Avrutskiy
21
21
0
14 Dec 2017
200x Low-dose PET Reconstruction using Deep Learning
200x Low-dose PET Reconstruction using Deep Learning
Junshen Xu
Enhao Gong
John M. Pauly
Greg Zaharchuk
MedIm
22
131
0
12 Dec 2017
Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for
  Transferable Chemical Property Prediction
Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction
Garrett B. Goh
Charles Siegel
Abhinav Vishnu
Nathan Oken Hodas
21
90
0
07 Dec 2017
Deep Learning for Real-Time Crime Forecasting and its Ternarization
Deep Learning for Real-Time Crime Forecasting and its Ternarization
Bao Wang
Penghang Yin
Andrea L. Bertozzi
P. Brantingham
Stanley J. Osher
Jack Xin
AI4TS
38
82
0
23 Nov 2017
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
Franyell Silfa
Gem Dot
J. Arnau
Antonio González
33
39
0
20 Nov 2017
MegDet: A Large Mini-Batch Object Detector
MegDet: A Large Mini-Batch Object Detector
Chao Peng
Tete Xiao
Zeming Li
Yuning Jiang
Xiangyu Zhang
Kai Jia
Gang Yu
Jian Sun
ObjD
17
318
0
20 Nov 2017
Performance Modeling and Evaluation of Distributed Deep Learning
  Frameworks on GPUs
Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs
S. Shi
Qiang-qiang Wang
Xiaowen Chu
37
110
0
16 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent
  Networks
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
30
15
0
07 Nov 2017
Feedforward and Recurrent Neural Networks Backward Propagation and
  Hessian in Matrix Form
Feedforward and Recurrent Neural Networks Backward Propagation and Hessian in Matrix Form
Maxim Naumov
23
9
0
16 Sep 2017
Distributed Training Large-Scale Deep Architectures
Distributed Training Large-Scale Deep Architectures
Shang-Xuan Zou
Chun-Yen Chen
Jui-Lin Wu
Chun-Nan Chou
Chia-Chin Tsao
Kuan-Chieh Tung
Ting-Wei Lin
Cheng-Lung Sung
Edward Y. Chang
26
22
0
10 Aug 2017
Structure-Preserving Image Super-resolution via Contextualized
  Multi-task Learning
Structure-Preserving Image Super-resolution via Contextualized Multi-task Learning
Yukai Shi
Keze Wang
Chongyu Chen
Li Xu
Liang Lin
SupR
23
57
0
26 Jul 2017
Memory-Efficient Implementation of DenseNets
Memory-Efficient Implementation of DenseNets
Geoff Pleiss
Danlu Chen
Gao Huang
Tongcheng Li
L. V. D. van der Maaten
Kilian Q. Weinberger
36
159
0
21 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
101
2,506
0
19 Jul 2017
Learning Local Receptive Fields and their Weight Sharing Scheme on
  Graphs
Learning Local Receptive Fields and their Weight Sharing Scheme on Graphs
Jean-Charles Vialatte
Vincent Gripon
G. Coppin
19
5
0
08 Jun 2017
Brain Intelligence: Go Beyond Artificial Intelligence
Brain Intelligence: Go Beyond Artificial Intelligence
Huimin Lu
Yujie Li
Min Chen
Hyoungseop Kim
S. Serikawa
26
949
0
04 Jun 2017
Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs
Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs
Xiaoming Chen
Jianxu Chen
Danny Chen
X. S. Hu
19
10
0
29 May 2017
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional
  Neural Networks
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks
Syed Shakib Sarwar
Priyadarshini Panda
Kaushik Roy
CVBM
11
100
0
12 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
27
176
0
03 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of
  Rectifier Units
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
S. Shi
Xiaowen Chu
20
43
0
25 Apr 2017
CBinfer: Change-Based Inference for Convolutional Neural Networks on
  Video Data
CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data
Lukas Cavigelli
Philippe Degen
Luca Benini
BDL
25
51
0
14 Apr 2017
Parallel Multi Channel Convolution using General Matrix Multiplication
Parallel Multi Channel Convolution using General Matrix Multiplication
Aravind Vasudevan
Andrew Anderson
David Gregg
16
139
0
06 Apr 2017
Active Convolution: Learning the Shape of Convolution for Image
  Classification
Active Convolution: Learning the Shape of Convolution for Image Classification
Yunho Jeon
Junmo Kim
26
171
0
27 Mar 2017
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Jiehan Zhu
Ying Shan
JC Mao
Dong Yu
Holakou Rahmanian
Yi Zhang
14
52
0
15 Mar 2017
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language
  Sentiment Classification
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification
Jan Deriu
Aurelien Lucchi
V. D. Luca
Aliaksei Severyn
Simon Müller
Mark Cieliebak
Thomas Hofmann
Martin Jaggi
11
133
0
07 Mar 2017
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating
  Deep Convolutional Neural Networks
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating Deep Convolutional Neural Networks
Shihao Wang
Dajiang Zhou
Xushen Han
T. Yoshimura
3DV
11
51
0
04 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
39
37
0
02 Feb 2017
Towards End-to-End Speech Recognition with Deep Convolutional Neural
  Networks
Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Wenjie Qu
Mohammad Pezeshki
Philemon Brakel
Saizheng Zhang
Yoshua Bengio
Aaron Courville
27
366
0
10 Jan 2017
Previous
12345
Next