Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1606.01305
Cited By
v1
v2
v3
v4 (latest)
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations
International Conference on Learning Representations (ICLR), 2016
3 June 2016
David M. Krueger
Tegan Maharaj
János Kramár
Mohammad Pezeshki
Nicolas Ballas
Nan Rosemary Ke
Anirudh Goyal
Yoshua Bengio
Aaron Courville
C. Pal
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations"
50 / 180 papers shown
Title
Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting
Kangjie Chen
Yingji Zhong
Zhihao Li
Jiaqi Lin
Youyu Chen
Minghan Qin
Haoqian Wang
3DGS
181
0
0
18 Aug 2025
Y-Drop: A Conductance based Dropout for fully connected layers
Efthymios Georgiou
Georgios Paraskevopoulos
Alexandros Potamianos
175
0
0
11 Sep 2024
Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning
Yimeng Shan
Malu Zhang
Rui-jie Zhu
Xuerui Qiu
Nhan Duy Truong
Haicheng Qu
270
11
0
22 May 2024
Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xi Chen
Chang Gao
Zuowen Wang
Longbiao Cheng
Sheng Zhou
Shih-Chii Liu
T. Delbruck
175
4
0
14 Dec 2023
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence Learning
Ziyu Wang
Wenhao Jiang
Zixuan Zhang
Wei Tang
Junchi Yan
209
0
0
03 Nov 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
International Conference on Learning Representations (ICLR), 2023
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
272
26
0
04 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
297
5
0
02 Oct 2023
Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Mohammad Zeineldeen
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
297
9
0
15 Sep 2023
A Comprehensive Overview of Large Language Models
ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Lin Wang
OffRL
794
1,158
0
12 Jul 2023
SeqAug: Sequential Feature Resampling as a modality agnostic augmentation method
Efthymios Georgiou
Alexandros Potamianos
164
2
0
03 May 2023
DropDim: A Regularization Method for Transformer Networks
IEEE Signal Processing Letters (IEEE SPL), 2023
Hao Zhang
Dan Qu
Kejia Shao
Xu Yang
185
14
0
20 Apr 2023
Optimum Output Long Short-Term Memory Cell for High-Frequency Trading Forecasting
Adamantios Ntakaris
Moncef Gabbouj
Juho Kanniainen
AI4TS
209
2
0
17 Apr 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
274
241
0
03 Mar 2023
A Review of the Role of Causality in Developing Trustworthy AI Systems
Niloy Ganguly
Dren Fazlija
Maryam Badar
M. Fisichella
Sandipan Sikdar
...
Koustav Rudra
Manolis Koubarakis
Gourab K. Patro
W. Z. E. Amri
Wolfgang Nejdl
CML
313
26
0
14 Feb 2023
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
211
3
0
10 Dec 2022
Efficient Transformers with Dynamic Token Pooling
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Piotr Nawrot
J. Chorowski
Adrian Lañcucki
Edoardo Ponti
225
69
0
17 Nov 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
International Journal of Computer Vision (IJCV), 2022
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
266
0
0
20 Jul 2022
ZoDIAC: Zoneout Dropout Injection Attention Calculation
Zanyar Zohourianshahzadi
Terrance Boult
Jugal Kalita
225
0
0
28 Jun 2022
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shanghua Gao
Zhong-Yu Li
Qi Han
Ming-Ming Cheng
Liang Wang
276
41
0
14 Jun 2022
Efficient recurrent architectures through activity sparsity and sparse back-propagation through time
International Conference on Learning Representations (ICLR), 2022
Anand Subramoney
Khaleelulla Khan Nazeer
Mark Schöne
Christian Mayr
David Kappel
291
29
0
13 Jun 2022
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Neural Information Processing Systems (NeurIPS), 2022
Aniket Didolkar
Kshitij Gupta
Anirudh Goyal
Nitesh B. Gundavarapu
Alex Lamb
Nan Rosemary Ke
Yoshua Bengio
AI4CE
434
21
0
30 May 2022
A Survey on Dropout Methods and Experimental Verification in Recommendation
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yongqian Li
Weizhi Ma
C. L. Philip Chen
Hao Fei
Yiqun Liu
Shaoping Ma
Yue Yang
294
16
0
05 Apr 2022
Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation
Xuan Zhang
Libin Shen
Disheng Pan
Liangguo Wang
Yanjun Miao
183
1
0
10 Mar 2022
Improving End-to-End Models for Set Prediction in Spoken Language Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
H. Kuo
Zoltán Tüske
Samuel Thomas
Brian Kingsbury
G. Saon
124
0
0
28 Jan 2022
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Artem Gorodetskii
Ivan Ozhiganov
174
4
0
25 Jan 2022
Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via Event-based Sparse Temporal Attention Network
International Journal of Intelligent Systems (IJIS), 2022
Yan Pang
Chao Liu
305
10
0
04 Jan 2022
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Alexandra Vioni
Myrsini Christidou
Nikolaos Ellinas
G. Vamvoukakis
Panos Kakoulidis
Taehoon Kim
June Sig Sung
Hyoungmin Park
Aimilios Chalamandaris
Pirros Tsiakoulis
130
12
0
19 Nov 2021
Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
K. Markopoulos
Nikolaos Ellinas
Alexandra Vioni
Myrsini Christidou
Panos Kakoulidis
...
Georgia Maniati
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
Aimilios Chalamandaris
143
2
0
17 Nov 2021
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Nikolaos Ellinas
G. Vamvoukakis
K. Markopoulos
Aimilios Chalamandaris
Georgia Maniati
Panos Kakoulidis
S. Raptis
June Sig Sung
Hyoungmin Park
Pirros Tsiakoulis
194
39
0
17 Nov 2021
Meta-Forecasting by combining Global Deep Representations with Local Adaptation
Riccardo Grazzi
Valentin Flunkert
David Salinas
Tim Januschowski
Matthias Seeger
Cédric Archambeau
AI4TS
AI4CE
178
7
0
05 Nov 2021
Preventing posterior collapse in variational autoencoders for text generation via decoder regularization
Alban Petit
Caio Corro
DRL
213
3
0
28 Oct 2021
Long Expressive Memory for Sequence Modeling
International Conference on Learning Representations (ICLR), 2021
T. Konstantin Rusch
Siddhartha Mishra
N. Benjamin Erichson
Michael W. Mahoney
AI4TS
435
57
0
10 Oct 2021
ChiNet: Deep Recurrent Convolutional Learning for Multimodal Spacecraft Pose Estimation
IEEE Transactions on Aerospace and Electronic Systems (T-AES), 2021
Duarte Rondao
Nabil Aouf
M.A. Richardson
3DPC
154
20
0
23 Aug 2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
International Conference on Machine Learning (ICML), 2021
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
416
93
0
19 Jul 2021
Discrete-Valued Neural Communication
Dianbo Liu DianboLiu
Alex Lamb
Kenji Kawaguchi
Anirudh Goyal
Chen Sun
Michael C. Mozer
Yoshua Bengio
226
53
0
06 Jul 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training
Neural Information Processing Systems (NeurIPS), 2021
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
81
1
0
22 Jun 2021
Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNN
Neural Networks (NN), 2021
Haowei Jiang
Fei-wei Qin
Jin Cao
Yong Peng
Yanli Shao
LRM
ODL
130
51
0
22 Jun 2021
Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks
Mohammad Mahdi Bejani
M. Ghatee
AI4CE
68
0
0
17 Jun 2021
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Interspeech (Interspeech), 2021
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
Najim Dehak
William Chan
DiffM
197
97
0
17 Jun 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR Models
Interspeech (Interspeech), 2021
Mohammad Zeineldeen
Aleksandr Glushko
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
176
44
0
12 Apr 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Automatic Speech Recognition & Understanding (ASRU), 2021
Nick Rossenbach
Mohammad Zeineldeen
Benedikt Hilmes
Ralf Schluter
Hermann Ney
170
12
0
12 Apr 2021
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Zhen Wu
Lijun Wu
Qi Meng
Ziheng Lu
Shufang Xie
Tao Qin
Xinyu Dai
Tie-Yan Liu
200
25
0
11 Apr 2021
Librispeech Transducer Model with Internal Language Model Prior Correction
Interspeech (Interspeech), 2021
Albert Zeyer
André Merboldt
Wilfried Michel
Ralf Schluter
Hermann Ney
134
34
0
07 Apr 2021
Noise Injection-based Regularization for Point Cloud Processing
Xiao Zang
Yi Xie
Siyu Liao
Jie Chen
Bo Yuan
3DPC
122
3
0
28 Mar 2021
LocalDrop: A Hybrid Regularization for Deep Neural Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Ziqing Lu
Chang Xu
Bo Du
Takashi Ishida
Guang Dai
Masashi Sugiyama
177
17
0
01 Mar 2021
Zero Training Overhead Portfolios for Learning to Solve Combinatorial Problems
Yiwei Bai
Wenting Zhao
Daniel Schwalbe-Koda
198
1
0
05 Feb 2021
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
131
18
0
21 Jan 2021
Sequential Deep Learning for Credit Risk Monitoring with Tabular Financial Data
Jillian M. Clements
Di Xu
N. Yousefi
Dmitry Efimov
166
51
0
30 Dec 2020
Regularizing Recurrent Neural Networks via Sequence Mixup
Armin Karamzade
Amir Najafi
S. Motahari
118
0
0
27 Nov 2020
1
2
3
4
Next