ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.06228
  4. Cited By
Training Very Deep Networks

Training Very Deep Networks

22 July 2015
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
ArXivPDFHTML

Papers citing "Training Very Deep Networks"

50 / 559 papers shown
Title
CrescendoNet: A Simple Deep Convolutional Neural Network with Ensemble
  Behavior
CrescendoNet: A Simple Deep Convolutional Neural Network with Ensemble Behavior
Xiang Zhang
Nishant Vishwamitra
Hongxin Hu
Feng Luo
17
2
0
30 Oct 2017
Label Embedding Network: Learning Label Representation for Soft Training
  of Deep Networks
Label Embedding Network: Learning Label Representation for Soft Training of Deep Networks
Xu Sun
Bingzhen Wei
Xuancheng Ren
Shuming Ma
32
40
0
28 Oct 2017
Malware Detection by Eating a Whole EXE
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
38
538
0
25 Oct 2017
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional
  Networks with Guided Attention
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
Hideyuki Tachibana
Katsuya Uenoyama
Shunsuke Aihara
33
265
0
24 Oct 2017
Deep Triphone Embedding Improves Phoneme Recognition
Deep Triphone Embedding Improves Phoneme Recognition
Mohit Yadav
V. Tyagi
17
2
0
22 Oct 2017
Attentive Convolution: Equipping CNNs with RNN-style Attention
  Mechanisms
Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms
Wenpeng Yin
Hinrich Schütze
33
41
0
02 Oct 2017
Deep Competitive Pathway Networks
Deep Competitive Pathway Networks
Jia-Ren Chang
Yonghao Chen
14
0
0
29 Sep 2017
Slim-DP: A Light Communication Data Parallelism for DNN
Slim-DP: A Light Communication Data Parallelism for DNN
Shizhao Sun
Wei-neng Chen
Jiang Bian
Xiaoguang Liu
Tie-Yan Liu
14
0
0
27 Sep 2017
EDEN: Evolutionary Deep Networks for Efficient Machine Learning
EDEN: Evolutionary Deep Networks for Efficient Machine Learning
Emmanuel Dufourq
Bruce A. Bassett
25
71
0
26 Sep 2017
EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural
  Networks
EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural Networks
Xuanyi Dong
Guoliang Kang
Kun Zhan
Yi Yang
16
16
0
22 Sep 2017
Language Modeling with Highway LSTM
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
21
38
0
19 Sep 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
50
271
0
08 Sep 2017
Squeeze-and-Excitation Networks
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
93
26,062
0
05 Sep 2017
Patterns versus Characters in Subword-aware Neural Language Modeling
Patterns versus Characters in Subword-aware Neural Language Modeling
Rustem Takhanov
Z. Assylbekov
19
2
0
02 Sep 2017
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories,
  Tools and Challenges for the Community
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
John E. Ball
Derek T. Anderson
Chee Seng Chan
27
521
0
01 Sep 2017
DGM: A deep learning algorithm for solving partial differential
  equations
DGM: A deep learning algorithm for solving partial differential equations
Justin A. Sirignano
K. Spiliopoulos
AI4CE
14
2,026
0
24 Aug 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
25
761
0
17 Aug 2017
Graph Classification via Deep Learning with Virtual Nodes
Graph Classification via Deep Learning with Virtual Nodes
Trang Pham
T. Tran
K. Dam
Svetha Venkatesh
GNN
28
45
0
14 Aug 2017
Learning to Plan Chemical Syntheses
Learning to Plan Chemical Syntheses
Marwin H. S. Segler
Mike Preuss
M. Waller
41
1,356
0
14 Aug 2017
Early Improving Recurrent Elastic Highway Network
Early Improving Recurrent Elastic Highway Network
Hyunsin Park
Chang D. Yoo
24
5
0
14 Aug 2017
Radical-level Ideograph Encoder for RNN-based Sentiment Analysis of
  Chinese and Japanese
Radical-level Ideograph Encoder for RNN-based Sentiment Analysis of Chinese and Japanese
Y. Ke
M. Hagiwara
16
14
0
10 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks
3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks
Chuhang Zou
Ersin Yumer
Jimei Yang
Duygu Ceylan
Derek Hoiem
3DV
19
207
0
04 Aug 2017
Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
Taegyoung Lee
Wissam J. Baddar
S. T. Kim
Yong Man Ro
14
13
0
31 Jul 2017
Relative Depth Order Estimation Using Multi-scale Densely Connected
  Convolutional Networks
Relative Depth Order Estimation Using Multi-scale Densely Connected Convolutional Networks
Ruoxi Deng
Tianqi Zhao
Chunhua Shen
S. Liu
3DV
3DPC
36
4
0
25 Jul 2017
Syllable-aware Neural Language Models: A Failure to Beat Character-aware
  Ones
Syllable-aware Neural Language Models: A Failure to Beat Character-aware Ones
Z. Assylbekov
Rustem Takhanov
Bagdat Myrzakhmetov
Jonathan North Washington
38
17
0
20 Jul 2017
Orthogonal and Idempotent Transformations for Learning Deep Neural
  Networks
Orthogonal and Idempotent Transformations for Learning Deep Neural Networks
Jingdong Wang
Yajie Xing
Kexin Zhang
Cha Zhang
190
2
0
19 Jul 2017
Vision-based Real Estate Price Estimation
Vision-based Real Estate Price Estimation
Omid Poursaeed
Tomas Matera
Serge J. Belongie
11
117
0
18 Jul 2017
Efficient Architecture Search by Network Transformation
Efficient Architecture Search by Network Transformation
Han Cai
Tianyao Chen
Weinan Zhang
Yong Yu
Jun Wang
OOD
3DV
34
67
0
16 Jul 2017
Automatic Speech Recognition with Very Large Conversational Finnish and
  Estonian Vocabularies
Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies
Seppo Enarvi
Peter Smit
Sami Virpioja
M. Kurimo
26
37
0
13 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
36
81
0
10 Jul 2017
A Deep Network with Visual Text Composition Behavior
A Deep Network with Visual Text Composition Behavior
Hongyu Guo
CoGe
24
3
0
05 Jul 2017
Multi-scale Multi-band DenseNets for Audio Source Separation
Multi-scale Multi-band DenseNets for Audio Source Separation
Naoya Takahashi
Yuki Mitsufuji
10
151
0
29 Jun 2017
Toward Computation and Memory Efficient Neural Network Acoustic Models
  with Binary Weights and Activations
Toward Computation and Memory Efficient Neural Network Acoustic Models with Binary Weights and Activations
Liang Lu
MQ
21
4
0
28 Jun 2017
When Neurons Fail
When Neurons Fail
El-Mahdi El-Mhamdi
R. Guerraoui
19
36
0
27 Jun 2017
GM-Net: Learning Features with More Efficiency
GM-Net: Learning Features with More Efficiency
Yujia Chen
Ce Li
24
6
0
21 Jun 2017
Advanced Steel Microstructural Classification by Deep Learning Methods
Advanced Steel Microstructural Classification by Deep Learning Methods
Seyedmajid Azimi
D. Britz
M. Engstler
Mario Fritz
F. Mücklich
13
359
0
20 Jun 2017
A Fully Trainable Network with RNN-based Pooling
A Fully Trainable Network with RNN-based Pooling
Shuai Li
W. Li
Chris Cook
Ce Zhu
Yanbo Gao
18
19
0
16 Jun 2017
Self-Normalizing Neural Networks
Self-Normalizing Neural Networks
Günter Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
88
2,487
0
08 Jun 2017
Learning Deep Representations for Scene Labeling with Semantic Context
  Guided Supervision
Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision
Zhe Wang
Hongsheng Li
Wanli Ouyang
Xiaogang Wang
SSL
20
2
0
08 Jun 2017
Non-Markovian Control with Gated End-to-End Memory Policy Networks
Non-Markovian Control with Gated End-to-End Memory Policy Networks
J. Perez
T. Silander
OffRL
13
6
0
31 May 2017
Deep Complex Networks
Deep Complex Networks
C. Trabelsi
O. Bilaniuk
Ying Zhang
Dmitriy Serdyuk
Sandeep Subramanian
J. F. Santos
Soroush Mehri
Negar Rostamzadeh
Yoshua Bengio
C. Pal
39
824
0
27 May 2017
Deriving Neural Architectures from Sequence and Graph Kernels
Deriving Neural Architectures from Sequence and Graph Kernels
Tao Lei
Wengong Jin
Regina Barzilay
Tommi Jaakkola
GNN
45
137
0
25 May 2017
An overview and comparative analysis of Recurrent Neural Networks for
  Short Term Load Forecasting
An overview and comparative analysis of Recurrent Neural Networks for Short Term Load Forecasting
F. Bianchi
E. Maiorino
Michael C. Kampffmeyer
A. Rizzi
Robert Jenssen
AI4TS
22
218
0
11 May 2017
Deep Neural Machine Translation with Linear Associative Unit
Deep Neural Machine Translation with Linear Associative Unit
Mingxuan Wang
Zhengdong Lu
Jie Zhou
Qun Liu
25
54
0
02 May 2017
Inception Recurrent Convolutional Neural Network for Object Recognition
Inception Recurrent Convolutional Neural Network for Object Recognition
Md. Zahangir Alom
Mahmudul Hasan
C. Yakopcic
T. Taha
39
86
0
25 Apr 2017
Residual Attention Network for Image Classification
Residual Attention Network for Image Classification
Fei Wang
Mengqing Jiang
Chao Qian
Shuo Yang
Cheng Li
Honggang Zhang
Xiaogang Wang
Xiaoou Tang
66
3,288
0
23 Apr 2017
Character-Word LSTM Language Models
Character-Word LSTM Language Models
Lyan Verwimp
J. Pelemans
Hugo Van hamme
P. Wambacq
30
53
0
10 Apr 2017
A Good Practice Towards Top Performance of Face Recognition: Transferred
  Deep Feature Fusion
A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion
Lin Xiong
J. Karlekar
Jian-jun Zhao
Yi Cheng
Yan Xu
Jiashi Feng
Sugiri Pranata
Shengmei Shen
CVBM
36
32
0
03 Apr 2017
DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling
DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling
Lachlan Tychsen-Smith
L. Petersson
ObjD
27
113
0
30 Mar 2017
Previous
123...10111289
Next