Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00144
Cited By
Learning Longer-term Dependencies in RNNs with Auxiliary Losses
1 March 2018
Trieu H. Trinh
Andrew M. Dai
Thang Luong
Quoc V. Le
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Longer-term Dependencies in RNNs with Auxiliary Losses"
42 / 92 papers shown
Title
Multivariate Temporal Autoencoder for Predictive Reconstruction of Deep Sequences
J. Aungiers
AI4TS
9
0
0
07 Oct 2020
HiPPO: Recurrent Memory with Optimal Polynomial Projections
Albert Gu
Tri Dao
Stefano Ermon
Atri Rudra
Christopher Ré
17
484
0
17 Aug 2020
Learning Transition Models with Time-delayed Causal Relations
Junchi Liang
Abdeslam Boularias
OffRL
12
3
0
04 Aug 2020
Principles and Algorithms for Forecasting Groups of Time Series: Locality and Globality
Pablo Montero-Manso
Rob J. Hyndman
AI4TS
11
133
0
02 Aug 2020
Distributed Associative Memory Network with Memory Refreshing Loss
Taewon Park
Inchul Choi
Minho Lee
CLL
9
6
0
21 Jul 2020
Attention Sequence to Sequence Model for Machine Remaining Useful Life Prediction
Mohamed Ragab
Zhenghua Chen
Min-man Wu
C. Kwoh
Ruqiang Yan
Xiaoli Li
9
5
0
20 Jul 2020
Auxiliary Learning by Implicit Differentiation
Aviv Navon
Idan Achituve
Haggai Maron
Gal Chechik
Ethan Fetaya
23
59
0
22 Jun 2020
Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding
Xinya Du
Claire Cardie
16
101
0
13 May 2020
Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary Tasks
Yufan Zhao
Can Xu
Wei Yu Wu
Lei Yu
24
28
0
04 Apr 2020
Explainable Deep Relational Networks for Predicting Compound-Protein Affinities and Contacts
Mostafa Karimi
Di Wu
Zhangyang Wang
Yang Shen
27
46
0
29 Dec 2019
On the Initialization of Long Short-Term Memory Networks
Mostafa Mehdipour-Ghazi
Mads Nielsen
A. Pai
Marc Modat
M. Jorge Cardoso
Sebastien Ourselin
Lauge Sørensen
ODL
9
14
0
22 Dec 2019
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling
Ruizhe Zhao
Brian K. Vogel
Tanvir Ahmed
Wayne Luk
17
37
0
14 Nov 2019
Compositional Generalization with Tree Stack Memory Units
Forough Arabshahi
Zhichu Lu
Pranay Mundra
Sameer Singh
Anima Anandkumar
9
10
0
05 Nov 2019
Improving the Gating Mechanism of Recurrent Neural Networks
Albert Gu
Çağlar Gülçehre
T. Paine
Matthew W. Hoffman
Razvan Pascanu
AI4CE
9
2
0
22 Oct 2019
Relaxed Softmax for learning from Positive and Unlabeled data
Ugo Tanielian
Flavian Vasile
13
10
0
17 Sep 2019
Deep Equilibrium Models
Shaojie Bai
J. Zico Kolter
V. Koltun
12
658
0
03 Sep 2019
Quantity doesn't buy quality syntax with neural language models
Marten van Schijndel
Aaron Mueller
Tal Linzen
13
68
0
31 Aug 2019
Calibration, Entropy Rates, and Memory in Language Models
M. Braverman
Xinyi Chen
Sham Kakade
Karthik Narasimhan
Cyril Zhang
Yi Zhang
9
38
0
11 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
14
84
0
02 Jun 2019
Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer
Yanshuai Cao
Peng-Tao Xu
9
2
0
28 May 2019
A Cross-Domain Transferable Neural Coherence Model
Peng-Tao Xu
H. Saghir
Jin Sung Kang
Teng Long
A. Bose
Yanshuai Cao
Jackie C.K. Cheung
6
46
0
28 May 2019
Population-based Global Optimisation Methods for Learning Long-term Dependencies with RNNs
Bryan Lim
S. Zohren
Stephen J. Roberts
6
2
0
23 May 2019
Quantifying Long Range Dependence in Language and User Behavior to improve RNNs
Francois Belletti
Minmin Chen
Ed H. Chi
AI4TS
6
21
0
23 May 2019
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic Search in POMDPs
Luchen Li
Matthieu Komorowski
Aldo A. Faisal
OffRL
16
13
0
17 May 2019
Efficient Optimization of Loops and Limits with Randomized Telescoping Sums
Alex Beatson
Ryan P. Adams
9
21
0
16 May 2019
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
18
127
0
15 May 2019
Neural-Attention-Based Deep Learning Architectures for Modeling Traffic Dynamics on Lane Graphs
Matthew A. Wright
Simon F. G. Ehlers
R. Horowitz
AI4CE
GNN
12
4
0
18 Apr 2019
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
Kuan Fang
Alexander Toshev
Li Fei-Fei
Silvio Savarese
OffRL
11
199
0
09 Mar 2019
Alternating Synthetic and Real Gradients for Neural Language Modeling
Fangxin Shang
Hao Zhang
16
1
0
27 Feb 2019
Dynamical Isometry and a Mean Field Theory of LSTMs and GRUs
D. Gilboa
B. Chang
Minmin Chen
Greg Yang
S. Schoenholz
Ed H. Chi
Jeffrey Pennington
34
39
0
25 Jan 2019
State-Regularized Recurrent Neural Networks
Cheng Wang
Mathias Niepert
18
39
0
25 Jan 2019
Reducing state updates via Gaussian-gated LSTMs
Matthew Thornton
Jithendar Anumula
Shih-Chii Liu
19
1
0
22 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
10
3,671
0
09 Jan 2019
Learning to Remember More with Less Memorization
Hung Le
T. Tran
Svetha Venkatesh
19
38
0
05 Jan 2019
Image-based Natural Language Understanding Using 2D Convolutional Neural Networks
Erinc Merdivan
Anastasios Vafeiadis
D. Kalatzis
S. Hanke
J. Kropf
...
Dimitrios Giakoumis
Dimitrios Tzovaras
Liming Luke Chen
R. Hamzaoui
M. Geist
VLM
17
2
0
24 Oct 2018
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
15
145
0
15 Oct 2018
h-detach: Modifying the LSTM Gradient Towards Better Optimization
Devansh Arpit
Bhargav Kanuparthi
Giancarlo Kerg
Nan Rosemary Ke
Ioannis Mitliagkas
Yoshua Bengio
23
32
0
06 Oct 2018
General Value Function Networks
M. Schlegel
Andrew Jacobsen
Zaheer Abbas
Andrew Patterson
Adam White
Martha White
19
30
0
18 Jul 2018
IGLOO: Slicing the Features Space to Represent Sequences
Vsevolod Sourkov
VLM
18
5
0
09 Jul 2018
The challenge of realistic music generation: modelling raw audio at scale
Sander Dieleman
Aaron van den Oord
Karen Simonyan
13
184
0
26 Jun 2018
State-Denoised Recurrent Neural Networks
Michael C. Mozer
Denis Kazakov
Robert V. Lindsey
AI4TS
14
7
0
22 May 2018
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,923
0
17 Aug 2015
Previous
1
2