Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1410.0759
Cited By
cuDNN: Efficient Primitives for Deep Learning
3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"cuDNN: Efficient Primitives for Deep Learning"
50 / 236 papers shown
Title
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
24
62
0
04 Jul 2018
Efficient ConvNets for Analog Arrays
Malte J. Rasch
Tayfun Gokmen
Mattia Rigotti
W. Haensch
28
11
0
03 Jul 2018
Multimodal feature fusion for CNN-based gait recognition: an empirical comparison
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
29
60
0
19 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
19
36
0
12 Jun 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
13
117
0
04 Jun 2018
BindsNET: A machine learning-oriented spiking neural networks library in Python
Hananel Hazan
D. J. Saunders
Hassaan Khan
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
AI4CE
30
229
0
04 Jun 2018
Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk Classification: A Deep Learning Approach
Rodrigo Berriel
Franco Schmidt Rossi
Alberto F. de Souza
Thiago Oliveira-Santos
30
50
0
30 May 2018
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
30
147
0
26 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
29
26
0
21 May 2018
Decorrelated Batch Normalization
Lei Huang
Dawei Yang
B. Lang
Jia Deng
16
190
0
23 Apr 2018
Context-aware Synthesis for Video Frame Interpolation
Simon Niklaus
Feng Liu
48
406
0
29 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Zheng Qin
Zhaoning Zhang
Dongsheng Li
Yiming Zhang
Yuxing Peng
25
28
0
27 Mar 2018
Flex-Convolution (Million-Scale Point-Cloud Learning Beyond Grid-Worlds)
F. Groh
P. Wieschollek
Hendrik P. A. Lensch
3DPC
16
107
0
20 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
25
80
0
16 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
45
1,304
0
12 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
24
19
0
05 Mar 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
13
25
0
28 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
703
0
26 Feb 2018
Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks
Zhihao Jia
Sina Lin
C. Qi
A. Aiken
37
117
0
14 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
18
1,574
0
05 Feb 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
M. Abrishami
Massoud Pedram
FedML
34
248
0
25 Jan 2018
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks
Linnan Wang
Jinmian Ye
Yiyang Zhao
Wei Wu
Ang Li
Shuaiwen Leon Song
Zenglin Xu
Tim Kraska
3DH
46
264
0
13 Jan 2018
Neural networks catching up with finite differences in solving partial differential equations in higher dimensions
V. Avrutskiy
21
21
0
14 Dec 2017
200x Low-dose PET Reconstruction using Deep Learning
Junshen Xu
Enhao Gong
John M. Pauly
Greg Zaharchuk
MedIm
22
131
0
12 Dec 2017
Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction
Garrett B. Goh
Charles Siegel
Abhinav Vishnu
Nathan Oken Hodas
21
90
0
07 Dec 2017
Deep Learning for Real-Time Crime Forecasting and its Ternarization
Bao Wang
Penghang Yin
Andrea L. Bertozzi
P. Brantingham
Stanley J. Osher
Jack Xin
AI4TS
38
82
0
23 Nov 2017
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
Franyell Silfa
Gem Dot
J. Arnau
Antonio González
33
39
0
20 Nov 2017
MegDet: A Large Mini-Batch Object Detector
Chao Peng
Tete Xiao
Zeming Li
Yuning Jiang
Xiangyu Zhang
Kai Jia
Gang Yu
Jian Sun
ObjD
17
318
0
20 Nov 2017
Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs
S. Shi
Qiang-qiang Wang
Xiaowen Chu
37
110
0
16 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
30
15
0
07 Nov 2017
Feedforward and Recurrent Neural Networks Backward Propagation and Hessian in Matrix Form
Maxim Naumov
23
9
0
16 Sep 2017
Distributed Training Large-Scale Deep Architectures
Shang-Xuan Zou
Chun-Yen Chen
Jui-Lin Wu
Chun-Nan Chou
Chia-Chin Tsao
Kuan-Chieh Tung
Ting-Wei Lin
Cheng-Lung Sung
Edward Y. Chang
26
22
0
10 Aug 2017
Structure-Preserving Image Super-resolution via Contextualized Multi-task Learning
Yukai Shi
Keze Wang
Chongyu Chen
Li Xu
Liang Lin
SupR
23
57
0
26 Jul 2017
Memory-Efficient Implementation of DenseNets
Geoff Pleiss
Danlu Chen
Gao Huang
Tongcheng Li
L. V. D. van der Maaten
Kilian Q. Weinberger
36
159
0
21 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
101
2,506
0
19 Jul 2017
Learning Local Receptive Fields and their Weight Sharing Scheme on Graphs
Jean-Charles Vialatte
Vincent Gripon
G. Coppin
19
5
0
08 Jun 2017
Brain Intelligence: Go Beyond Artificial Intelligence
Huimin Lu
Yujie Li
Min Chen
Hyoungseop Kim
S. Serikawa
26
949
0
04 Jun 2017
Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs
Xiaoming Chen
Jianxu Chen
Danny Chen
X. S. Hu
19
10
0
29 May 2017
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks
Syed Shakib Sarwar
Priyadarshini Panda
Kaushik Roy
CVBM
11
100
0
12 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
27
176
0
03 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
S. Shi
Xiaowen Chu
20
43
0
25 Apr 2017
CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data
Lukas Cavigelli
Philippe Degen
Luca Benini
BDL
25
51
0
14 Apr 2017
Parallel Multi Channel Convolution using General Matrix Multiplication
Aravind Vasudevan
Andrew Anderson
David Gregg
16
139
0
06 Apr 2017
Active Convolution: Learning the Shape of Convolution for Image Classification
Yunho Jeon
Junmo Kim
26
171
0
27 Mar 2017
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Jiehan Zhu
Ying Shan
JC Mao
Dong Yu
Holakou Rahmanian
Yi Zhang
14
52
0
15 Mar 2017
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification
Jan Deriu
Aurelien Lucchi
V. D. Luca
Aliaksei Severyn
Simon Müller
Mark Cieliebak
Thomas Hofmann
Martin Jaggi
11
133
0
07 Mar 2017
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating Deep Convolutional Neural Networks
Shihao Wang
Dajiang Zhou
Xushen Han
T. Yoshimura
3DV
11
51
0
04 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
39
37
0
02 Feb 2017
Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Wenjie Qu
Mohammad Pezeshki
Philemon Brakel
Saizheng Zhang
Yoshua Bengio
Aaron Courville
27
366
0
10 Jan 2017
Previous
1
2
3
4
5
Next