ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.0759
  4. Cited By
cuDNN: Efficient Primitives for Deep Learning

cuDNN: Efficient Primitives for Deep Learning

3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
ArXivPDFHTML

Papers citing "cuDNN: Efficient Primitives for Deep Learning"

36 / 236 papers shown
Title
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural
  Networks for Real-Time Object Detection for Autonomous Driving
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
Bichen Wu
Alvin Wan
F. Iandola
Peter H. Jin
Kurt Keutzer
44
512
0
04 Dec 2016
CAS-CNN: A Deep Convolutional Neural Network for Image Compression
  Artifact Suppression
CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression
Lukas Cavigelli
P. Hager
Luca Benini
14
195
0
22 Nov 2016
Factorized Bilinear Models for Image Recognition
Factorized Bilinear Models for Image Recognition
Yanghao Li
Naiyan Wang
Jiaying Liu
Xiaodi Hou
19
96
0
17 Nov 2016
How to scale distributed deep learning?
How to scale distributed deep learning?
Peter H. Jin
Qiaochu Yuan
F. Iandola
Kurt Keutzer
3DH
27
136
0
14 Nov 2016
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
R. Dicecco
Griffin Lacey
Jasmina Vasiljevic
P. Chow
Graham W. Taylor
S. Areibi
21
92
0
30 Sep 2016
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised
  Localization
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
Vadim Kantorov
Maxime Oquab
Minsu Cho
Ivan Laptev
WSOL
25
305
0
14 Sep 2016
Benchmarking State-of-the-Art Deep Learning Software Tools
Benchmarking State-of-the-Art Deep Learning Software Tools
S. Shi
Qiang-qiang Wang
Pengfei Xu
Xiaowen Chu
BDL
16
327
0
25 Aug 2016
Design of Efficient Convolutional Layers using Single Intra-channel
  Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Min Wang
Baoyuan Liu
H. Foroosh
27
51
0
15 Aug 2016
Learning Structured Sparsity in Deep Neural Networks
Learning Structured Sparsity in Deep Neural Networks
W. Wen
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
47
2,323
0
12 Aug 2016
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Jonathan Tompson
Kristofer Schlachter
Pablo Sprechmann
Ken Perlin
58
528
0
13 Jul 2016
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Stefan Hadjis
Ce Zhang
Ioannis Mitliagkas
Dan Iter
Christopher Ré
20
65
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
38
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic
  Segmentation
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,059
0
07 Jun 2016
Boda-RTC: Productive Generation of Portable, Efficient Code for
  Convolutional Neural Networks on Mobile Computing Platforms
Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms
Matthew W. Moskewicz
F. Iandola
Kurt Keutzer
9
8
0
01 Jun 2016
TensorFlow: A system for large-scale machine learning
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Z. Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
101
18,262
0
27 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
19
1,164
0
24 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural
  Networks
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
29
127
0
20 May 2016
Theano: A Python framework for fast computation of mathematical
  expressions
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
37
2,335
0
09 May 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
54
10,168
0
27 Mar 2016
TTC: A high-performance Compiler for Tensor Transpositions
TTC: A high-performance Compiler for Tensor Transpositions
P. Springer
J. Hammond
Paolo Bientinesi
17
17
0
07 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
24
425
0
03 Mar 2016
Automatic learning of gait signatures for people identification
Automatic learning of gait signatures for people identification
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
19
96
0
03 Mar 2016
DeepSpark: A Spark-Based Distributed Deep Learning Framework for
  Commodity Clusters
DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters
Hanjoo Kim
Jaehong Park
Jaehee Jang
Sungroh Yoon
BDL
32
37
0
26 Feb 2016
Deep Learning on FPGAs: Past, Present, and Future
Deep Learning on FPGAs: Past, Present, and Future
Griffin Lacey
Graham W. Taylor
S. Areibi
GNN
18
180
0
13 Feb 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
45
2,955
0
08 Dec 2015
FireCaffe: near-linear acceleration of deep neural network training on
  compute clusters
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
21
302
0
31 Oct 2015
Stereo Matching by Training a Convolutional Neural Network to Compare
  Image Patches
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Jure Zbontar
Yann LeCun
3DV
11
1,384
0
20 Oct 2015
Semantic Image Segmentation via Deep Parsing Network
Semantic Image Segmentation via Deep Parsing Network
Ziwei Liu
Xiaoxiao Li
Ping Luo
Chen Change Loy
Xiaoou Tang
6
659
0
09 Sep 2015
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical
  Volumetric Image Segmentation
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation
Marijn F. Stollenga
Wonmin Byeon
Marcus Liwicki
Jürgen Schmidhuber
23
294
0
24 Jun 2015
Fast ConvNets Using Group-wise Brain Damage
Fast ConvNets Using Group-wise Brain Damage
V. Lebedev
Victor Lempitsky
AAML
44
447
0
08 Jun 2015
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Stefan Hadjis
Firas Abuzaid
Ce Zhang
Christopher Ré
BDL
18
71
0
16 Apr 2015
Learning to Compare Image Patches via Convolutional Neural Networks
Learning to Compare Image Patches via Convolutional Neural Networks
Sergey Zagoruyko
N. Komodakis
SSL
19
1,434
0
14 Apr 2015
Automatic differentiation in machine learning: a survey
Automatic differentiation in machine learning: a survey
A. G. Baydin
Barak A. Pearlmutter
Alexey Radul
J. Siskind
PINN
AI4CE
ODL
75
2,750
0
20 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
21
346
0
24 Dec 2014
Deep Speech: Scaling up end-to-end speech recognition
Deep Speech: Scaling up end-to-end speech recognition
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
...
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
71
2,109
0
17 Dec 2014
Previous
12345