v1v2v3v4 (latest)

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations

International Conference on Learning Representations (ICLR), 2016

3 June 2016

Aaron Courville

Papers citing "Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations"

50 / 180 papers shown

Stroke-based sketched symbol reconstruction and segmentation

Kurmanbek Kaiyrbekov

M. Sezgin

184

10 Jan 2019

Learning latent representations for style control and transfer in end-to-end speech synthesis

Shifeng Pan

193

240

11 Dec 2018

Drop-Activation: Implicit Parameter Reduction and Harmonic Regularization

150

14 Nov 2018

Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series

138

08 Nov 2018

Cycle-consistency training for end-to-end speech recognition

Takaaki Hori

Ramón Fernández Astudillo

179

02 Nov 2018

DropBlock: A regularization method for convolutional networks

Golnaz Ghiasi

Nayeon Lee

Quoc V. Le

272

1,018

30 Oct 2018

Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language

Yusuke Yasuda

Xin Wang

Shinji Takaki

Junichi Yamagishi

197

29 Oct 2018

Sequence-to-Sequence Acoustic Modeling for Voice Conversion

356

136

16 Oct 2018

Dropout as a Structured Shrinkage Prior

Eric T. Nalisnick

José Miguel Hernández-Lobato

Padhraic Smyth

BDL UQCV

217

09 Oct 2018

h-detach: Modifying the LSTM Gradient Towards Better Optimization

Alexia Jolicoeur-Martineau

Yoshua Bengio

312

06 Oct 2018

Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis

Yuxuan Wang

138

120

30 Aug 2018

Dropout with Tabu Strategy for Regularizing Deep Neural Networks

159

29 Aug 2018

Neural Architecture Optimization

439

693

22 Aug 2018

Improved Language Modeling by Decoding the Past

Siddhartha Brahma

BDL AI4TS

275

14 Aug 2018

Confidence penalty, annealing Gaussian noise and zoneout for biLSTM-CRF networks for named entity recognition

Antonio Jimeno Yepes

112

13 Aug 2018

Character-Level Language Modeling with Deeper Self-Attention

346

409

09 Aug 2018

Back-Translation-Style Data Augmentation for End-to-End ASR

Ramón Fernández Astudillo

K. Takeda

174

109

28 Jul 2018

Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)

Guoqiang Zhang

Hao Li

112

26 Jul 2018

Recent Advances in Deep Learning: An Overview

Matiur Rahman Minar

Jibon Naher

VLM

188

131

21 Jul 2018

Recurrent DNNs and its Ensembles on the TIMIT Phone Recognition Task

19 Jun 2018

Towards Binary-Valued Gates for Robust LSTM Training

149

08 Jun 2018

Efficient Full-Matrix Adaptive Regularization

136

08 Jun 2018

Grow and Prune Compact, Fast, and Accurate LSTMs

181

30 May 2018

Highway State Gating for Recurrent Highway Networks: improving information flow through time

Ron Shoham

Haim Permuter

23 May 2018

Token-level and sequence-level loss smoothing for RNN language models

Maha Elbayad

Laurent Besacier

Jakob Verbeek

154

14 May 2018

Noisin: Unbiased Regularization for Recurrent Neural Networks

145

03 May 2018

The unreasonable effectiveness of the forget gate

J. Westhuizen

Joan Lasenby

180

13 Apr 2018

Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

215

30 Mar 2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Yuxuan Wang

Rif A. Saurous

286

890

23 Mar 2018

An Analysis of Neural Language Modeling at Multiple Scales

Stephen Merity

N. Keskar

R. Socher

180

172

22 Mar 2018

Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN

Shuai Li

369

797

13 Mar 2018

Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-BatchesInternational Conference on Learning Representations (ICLR), 2018

Yeming Wen

Paul Vicol

Jimmy Ba

Dustin Tran

Roger C. Grosse

BDL

413

338

12 Mar 2018

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Shaojie Bai

J. Zico Kolter

V. Koltun

DRL

360

5,922

04 Mar 2018

Nested LSTMs

Joel Ruben Antony Moniz

David M. Krueger

175

31 Jan 2018

Scalable and accurate deep learning for electronic health records

...

426

2,441

24 Jan 2018

Recent Advances in Recurrent Neural Networks

499

692

29 Dec 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

...

Yuxuan Wang

RJ Skerry-Ryan

Rif A. Saurous

Yannis Agiomyrgiannakis

Yonghui Wu

450

2,937

16 Dec 2017

114

15 Nov 2017

Fine-tuning Tree-LSTM for phrase-level sentiment classification on a Polish dependency treebank. Submission to PolEval task 2

Tomasz Korbak

Paulina Zak

03 Nov 2017

Neural Language Modeling by Jointly Learning Syntax and Lexicon

Songlin Yang

Zhouhan Lin

Chin-Wei Huang

Aaron Courville

224

182

02 Nov 2017

186

31 Oct 2017

Rotational Unit of MemoryInternational Conference on Learning Representations (ICLR), 2017

Rumen Dangovski

L. Jing

Marin Soljacic

163

26 Oct 2017

Dilated Recurrent Neural Networks

Thomas S. Huang

312

338

05 Oct 2017

Shifting Mean Activation Towards Zero with Bipolar Activation Functions

L. Eidnes

Arild Nøkland

200

12 Sep 2017

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks

301

227

22 Aug 2017

Twin Networks: Matching the Future for Sequence Generation

182

22 Aug 2017

Regularizing and Optimizing LSTM Language ModelsInternational Conference on Learning Representations (ICLR), 2017

Stephen Merity

N. Keskar

R. Socher

332

1,147

07 Aug 2017

Revisiting Activation Regularization for Language RNNs

Stephen Merity

Bryan McCann

R. Socher

186

03 Aug 2017

Bayesian Sparsification of Recurrent Neural Networks

E. Lobacheva

Nadezhda Chirkova

Dmitry Vetrov

UQCV BDL

197

31 Jul 2017

Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks

117

25 Jul 2017