ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01305
  4. Cited By
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations
v1v2v3v4 (latest)

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations

International Conference on Learning Representations (ICLR), 2016
3 June 2016
David M. Krueger
Tegan Maharaj
János Kramár
Mohammad Pezeshki
Nicolas Ballas
Nan Rosemary Ke
Anirudh Goyal
Yoshua Bengio
Aaron Courville
C. Pal
ArXiv (abs)PDFHTML

Papers citing "Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations"

50 / 180 papers shown
Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting
Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting
Kangjie Chen
Yingji Zhong
Zhihao Li
Jiaqi Lin
Youyu Chen
Minghan Qin
Haoqian Wang
3DGS
189
0
0
18 Aug 2025
Y-Drop: A Conductance based Dropout for fully connected layers
Y-Drop: A Conductance based Dropout for fully connected layers
Efthymios Georgiou
Georgios Paraskevopoulos
Alexandros Potamianos
188
0
0
11 Sep 2024
Advancing Spiking Neural Networks towards Multiscale Spatiotemporal
  Interaction Learning
Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning
Yimeng Shan
Malu Zhang
Rui-jie Zhu
Xuerui Qiu
Nhan Duy Truong
Haicheng Qu
289
11
0
22 May 2024
Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training
Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN TrainingAAAI Conference on Artificial Intelligence (AAAI), 2023
Xi Chen
Chang Gao
Zuowen Wang
Longbiao Cheng
Sheng Zhou
Shih-Chii Liu
T. Delbruck
199
4
0
14 Dec 2023
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence
  Learning
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence Learning
Ziyu Wang
Wenhao Jiang
Zixuan Zhang
Wei Tang
Junchi Yan
213
0
0
03 Nov 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for
  Decision Making
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision MakingInternational Conference on Learning Representations (ICLR), 2023
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
300
26
0
04 Oct 2023
Modularity in Deep Learning: A Survey
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
312
5
0
02 Oct 2023
Chunked Attention-based Encoder-Decoder Model for Streaming Speech
  Recognition
Chunked Attention-based Encoder-Decoder Model for Streaming Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Mohammad Zeineldeen
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
306
9
0
15 Sep 2023
A Comprehensive Overview of Large Language Models
A Comprehensive Overview of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Lin Wang
OffRL
854
1,173
0
12 Jul 2023
SeqAug: Sequential Feature Resampling as a modality agnostic
  augmentation method
SeqAug: Sequential Feature Resampling as a modality agnostic augmentation method
Efthymios Georgiou
Alexandros Potamianos
164
2
0
03 May 2023
DropDim: A Regularization Method for Transformer Networks
DropDim: A Regularization Method for Transformer NetworksIEEE Signal Processing Letters (IEEE SPL), 2023
Hao Zhang
Dan Qu
Kejia Shao
Xu Yang
190
14
0
20 Apr 2023
Optimum Output Long Short-Term Memory Cell for High-Frequency Trading
  Forecasting
Optimum Output Long Short-Term Memory Cell for High-Frequency Trading Forecasting
Adamantios Ntakaris
Moncef Gabbouj
Juho Kanniainen
AI4TS
209
2
0
17 Apr 2023
End-to-End Speech Recognition: A Survey
End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
285
243
0
03 Mar 2023
A Review of the Role of Causality in Developing Trustworthy AI Systems
A Review of the Role of Causality in Developing Trustworthy AI Systems
Niloy Ganguly
Dren Fazlija
Maryam Badar
M. Fisichella
Sandipan Sikdar
...
Koustav Rudra
Manolis Koubarakis
Gourab K. Patro
W. Z. E. Amri
Wolfgang Nejdl
CML
321
26
0
14 Feb 2023
State-Regularized Recurrent Neural Networks to Extract Automata and
  Explain Predictions
State-Regularized Recurrent Neural Networks to Extract Automata and Explain PredictionsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
215
3
0
10 Dec 2022
Efficient Transformers with Dynamic Token Pooling
Efficient Transformers with Dynamic Token PoolingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Piotr Nawrot
J. Chorowski
Adrian Lañcucki
Edoardo Ponti
238
69
0
17 Nov 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Learning Sequence Representations by Non-local Recurrent Neural MemoryInternational Journal of Computer Vision (IJCV), 2022
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
289
0
0
20 Jul 2022
ZoDIAC: Zoneout Dropout Injection Attention Calculation
ZoDIAC: Zoneout Dropout Injection Attention Calculation
Zanyar Zohourianshahzadi
Terrance Boult
Jugal Kalita
245
0
0
28 Jun 2022
RF-Next: Efficient Receptive Field Search for Convolutional Neural
  Networks
RF-Next: Efficient Receptive Field Search for Convolutional Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shanghua Gao
Zhong-Yu Li
Qi Han
Ming-Ming Cheng
Liang Wang
299
41
0
14 Jun 2022
Efficient recurrent architectures through activity sparsity and sparse
  back-propagation through time
Efficient recurrent architectures through activity sparsity and sparse back-propagation through timeInternational Conference on Learning Representations (ICLR), 2022
Anand Subramoney
Khaleelulla Khan Nazeer
Mark Schöne
Christian Mayr
David Kappel
327
29
0
13 Jun 2022
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing
  Mechanisms in Sequence Learning
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence LearningNeural Information Processing Systems (NeurIPS), 2022
Aniket Didolkar
Kshitij Gupta
Anirudh Goyal
Nitesh B. Gundavarapu
Alex Lamb
Nan Rosemary Ke
Yoshua Bengio
AI4CE
450
21
0
30 May 2022
A Survey on Dropout Methods and Experimental Verification in
  Recommendation
A Survey on Dropout Methods and Experimental Verification in RecommendationIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yongqian Li
Weizhi Ma
C. L. Philip Chen
Hao Fei
Yiqun Liu
Shaoping Ma
Yue Yang
300
16
0
05 Apr 2022
Look Backward and Forward: Self-Knowledge Distillation with
  Bidirectional Decoder for Neural Machine Translation
Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation
Xuan Zhang
Libin Shen
Disheng Pan
Liangguo Wang
Yanjun Miao
187
1
0
10 Mar 2022
Improving End-to-End Models for Set Prediction in Spoken Language
  Understanding
Improving End-to-End Models for Set Prediction in Spoken Language UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
H. Kuo
Zoltán Tüske
Samuel Thomas
Brian Kingsbury
G. Saon
128
0
0
28 Jan 2022
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Artem Gorodetskii
Ivan Ozhiganov
182
4
0
25 Jan 2022
Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via
  Event-based Sparse Temporal Attention Network
Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via Event-based Sparse Temporal Attention NetworkInternational Journal of Intelligent Systems (IJIS), 2022
Yan Pang
Chao Liu
329
11
0
04 Jan 2022
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End
  Speech Synthesis
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Alexandra Vioni
Myrsini Christidou
Nikolaos Ellinas
G. Vamvoukakis
Panos Kakoulidis
Taehoon Kim
June Sig Sung
Hyoungmin Park
Aimilios Chalamandaris
Pirros Tsiakoulis
150
12
0
19 Nov 2021
Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
K. Markopoulos
Nikolaos Ellinas
Alexandra Vioni
Myrsini Christidou
Panos Kakoulidis
...
Georgia Maniati
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
Aimilios Chalamandaris
143
2
0
17 Nov 2021
High Quality Streaming Speech Synthesis with Low,
  Sentence-Length-Independent Latency
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Nikolaos Ellinas
G. Vamvoukakis
K. Markopoulos
Aimilios Chalamandaris
Georgia Maniati
Panos Kakoulidis
S. Raptis
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
194
39
0
17 Nov 2021
Meta-Forecasting by combining Global Deep Representations with Local
  Adaptation
Meta-Forecasting by combining Global Deep Representations with Local Adaptation
Riccardo Grazzi
Valentin Flunkert
David Salinas
Tim Januschowski
Matthias Seeger
Cédric Archambeau
AI4TSAI4CE
193
7
0
05 Nov 2021
Preventing posterior collapse in variational autoencoders for text
  generation via decoder regularization
Preventing posterior collapse in variational autoencoders for text generation via decoder regularization
Alban Petit
Caio Corro
DRL
214
3
0
28 Oct 2021
Long Expressive Memory for Sequence Modeling
Long Expressive Memory for Sequence ModelingInternational Conference on Learning Representations (ICLR), 2021
T. Konstantin Rusch
Siddhartha Mishra
N. Benjamin Erichson
Michael W. Mahoney
AI4TS
443
57
0
10 Oct 2021
ChiNet: Deep Recurrent Convolutional Learning for Multimodal Spacecraft
  Pose Estimation
ChiNet: Deep Recurrent Convolutional Learning for Multimodal Spacecraft Pose EstimationIEEE Transactions on Aerospace and Electronic Systems (T-AES), 2021
Duarte Rondao
Nabil Aouf
M.A. Richardson
3DPC
170
20
0
23 Aug 2021
Translatotron 2: High-quality direct speech-to-speech translation with
  voice preservation
Translatotron 2: High-quality direct speech-to-speech translation with voice preservationInternational Conference on Machine Learning (ICML), 2021
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
416
93
0
19 Jul 2021
Discrete-Valued Neural Communication
Discrete-Valued Neural Communication
Dianbo Liu DianboLiu
Alex Lamb
Kenji Kawaguchi
Anirudh Goyal
Chen Sun
Michael C. Mozer
Yoshua Bengio
260
53
0
06 Jul 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for
  Efficient Training
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient TrainingNeural Information Processing Systems (NeurIPS), 2021
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
89
1
0
22 Jun 2021
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNN
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNNNeural Networks (NN), 2021
Haowei Jiang
Fei-wei Qin
Jin Cao
Yong Peng
Yanli Shao
LRMODL
143
51
0
22 Jun 2021
Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy
  Weights in Deep Networks
Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks
Mohammad Mahdi Bejani
M. Ghatee
AI4CE
68
0
0
17 Jun 2021
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
WaveGrad 2: Iterative Refinement for Text-to-Speech SynthesisInterspeech (Interspeech), 2021
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
DiffM
213
97
0
17 Jun 2021
On the limit of English conversational speech recognition
On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
Investigating Methods to Improve Language Model Integration for
  Attention-based Encoder-Decoder ASR Models
Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR ModelsInterspeech (Interspeech), 2021
Mohammad Zeineldeen
Aleksandr Glushko
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
180
44
0
12 Apr 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic
  Speech Recognition Architectures
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition ArchitecturesAutomatic Speech Recognition & Understanding (ASRU), 2021
Nick Rossenbach
Mohammad Zeineldeen
Benedikt Hilmes
Ralf Schluter
Hermann Ney
170
12
0
12 Apr 2021
UniDrop: A Simple yet Effective Technique to Improve Transformer without
  Extra Cost
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra CostNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Zhen Wu
Lijun Wu
Qi Meng
Ziheng Lu
Shufang Xie
Tao Qin
Xinyu Dai
Tie-Yan Liu
201
25
0
11 Apr 2021
Librispeech Transducer Model with Internal Language Model Prior
  Correction
Librispeech Transducer Model with Internal Language Model Prior CorrectionInterspeech (Interspeech), 2021
Albert Zeyer
André Merboldt
Wilfried Michel
Ralf Schluter
Hermann Ney
134
34
0
07 Apr 2021
Noise Injection-based Regularization for Point Cloud Processing
Noise Injection-based Regularization for Point Cloud Processing
Xiao Zang
Yi Xie
Siyu Liao
Jie Chen
Bo Yuan
3DPC
122
3
0
28 Mar 2021
LocalDrop: A Hybrid Regularization for Deep Neural Networks
LocalDrop: A Hybrid Regularization for Deep Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Ziqing Lu
Chang Xu
Bo Du
Takashi Ishida
Guang Dai
Masashi Sugiyama
177
17
0
01 Mar 2021
Zero Training Overhead Portfolios for Learning to Solve Combinatorial
  Problems
Zero Training Overhead Portfolios for Learning to Solve Combinatorial Problems
Yiwei Bai
Wenting Zhao
Daniel Schwalbe-Koda
218
1
0
05 Feb 2021
Distilling Large Language Models into Tiny and Effective Students using
  pQRNN
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
135
18
0
21 Jan 2021
Sequential Deep Learning for Credit Risk Monitoring with Tabular
  Financial Data
Sequential Deep Learning for Credit Risk Monitoring with Tabular Financial Data
Jillian M. Clements
Di Xu
N. Yousefi
Dmitry Efimov
174
51
0
30 Dec 2020
Regularizing Recurrent Neural Networks via Sequence Mixup
Regularizing Recurrent Neural Networks via Sequence Mixup
Armin Karamzade
Amir Najafi
S. Motahari
118
0
0
27 Nov 2020
1234
Next