Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1410.0759
Cited By
cuDNN: Efficient Primitives for Deep Learning
3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"cuDNN: Efficient Primitives for Deep Learning"
36 / 236 papers shown
Title
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
Bichen Wu
Alvin Wan
F. Iandola
Peter H. Jin
Kurt Keutzer
44
512
0
04 Dec 2016
CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression
Lukas Cavigelli
P. Hager
Luca Benini
14
195
0
22 Nov 2016
Factorized Bilinear Models for Image Recognition
Yanghao Li
Naiyan Wang
Jiaying Liu
Xiaodi Hou
19
96
0
17 Nov 2016
How to scale distributed deep learning?
Peter H. Jin
Qiaochu Yuan
F. Iandola
Kurt Keutzer
3DH
27
136
0
14 Nov 2016
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
R. Dicecco
Griffin Lacey
Jasmina Vasiljevic
P. Chow
Graham W. Taylor
S. Areibi
21
92
0
30 Sep 2016
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
Vadim Kantorov
Maxime Oquab
Minsu Cho
Ivan Laptev
WSOL
25
305
0
14 Sep 2016
Benchmarking State-of-the-Art Deep Learning Software Tools
S. Shi
Qiang-qiang Wang
Pengfei Xu
Xiaowen Chu
BDL
16
327
0
25 Aug 2016
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Min Wang
Baoyuan Liu
H. Foroosh
27
51
0
15 Aug 2016
Learning Structured Sparsity in Deep Neural Networks
W. Wen
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
47
2,323
0
12 Aug 2016
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Jonathan Tompson
Kristofer Schlachter
Pablo Sprechmann
Ken Perlin
58
528
0
13 Jul 2016
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Stefan Hadjis
Ce Zhang
Ioannis Mitliagkas
Dan Iter
Christopher Ré
20
65
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
38
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,059
0
07 Jun 2016
Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms
Matthew W. Moskewicz
F. Iandola
Kurt Keutzer
9
8
0
01 Jun 2016
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Z. Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
101
18,262
0
27 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
19
1,164
0
24 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
29
127
0
20 May 2016
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
37
2,335
0
09 May 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
54
10,168
0
27 Mar 2016
TTC: A high-performance Compiler for Tensor Transpositions
P. Springer
J. Hammond
Paolo Bientinesi
17
17
0
07 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
24
425
0
03 Mar 2016
Automatic learning of gait signatures for people identification
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
19
96
0
03 Mar 2016
DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters
Hanjoo Kim
Jaehong Park
Jaehee Jang
Sungroh Yoon
BDL
32
37
0
26 Feb 2016
Deep Learning on FPGAs: Past, Present, and Future
Griffin Lacey
Graham W. Taylor
S. Areibi
GNN
18
180
0
13 Feb 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
45
2,955
0
08 Dec 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
21
302
0
31 Oct 2015
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Jure Zbontar
Yann LeCun
3DV
11
1,384
0
20 Oct 2015
Semantic Image Segmentation via Deep Parsing Network
Ziwei Liu
Xiaoxiao Li
Ping Luo
Chen Change Loy
Xiaoou Tang
6
659
0
09 Sep 2015
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation
Marijn F. Stollenga
Wonmin Byeon
Marcus Liwicki
Jürgen Schmidhuber
23
294
0
24 Jun 2015
Fast ConvNets Using Group-wise Brain Damage
V. Lebedev
Victor Lempitsky
AAML
44
447
0
08 Jun 2015
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Stefan Hadjis
Firas Abuzaid
Ce Zhang
Christopher Ré
BDL
18
71
0
16 Apr 2015
Learning to Compare Image Patches via Convolutional Neural Networks
Sergey Zagoruyko
N. Komodakis
SSL
19
1,434
0
14 Apr 2015
Automatic differentiation in machine learning: a survey
A. G. Baydin
Barak A. Pearlmutter
Alexey Radul
J. Siskind
PINN
AI4CE
ODL
75
2,750
0
20 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
21
346
0
24 Dec 2014
Deep Speech: Scaling up end-to-end speech recognition
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
...
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
71
2,109
0
17 Dec 2014
Previous
1
2
3
4
5