ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.00144
  4. Cited By
Learning Longer-term Dependencies in RNNs with Auxiliary Losses

Learning Longer-term Dependencies in RNNs with Auxiliary Losses

1 March 2018
Trieu H. Trinh
Andrew M. Dai
Thang Luong
Quoc V. Le
ArXivPDFHTML

Papers citing "Learning Longer-term Dependencies in RNNs with Auxiliary Losses"

50 / 92 papers shown
Title
A Diagonal Structured State Space Model on Loihi 2 for Efficient
  Streaming Sequence Processing
A Diagonal Structured State Space Model on Loihi 2 for Efficient Streaming Sequence Processing
Svea Marie Meyer
Philipp Weidel
Philipp Plank
L. Campos-Macias
Sumit Bam Shrestha
Philipp Stratmann
M. R
36
4
0
23 Sep 2024
Reparameterized Multi-Resolution Convolutions for Long Sequence
  Modelling
Reparameterized Multi-Resolution Convolutions for Long Sequence Modelling
Harry Jake Cunningham
Giorgio Giannone
Mingtian Zhang
M. Deisenroth
28
0
0
18 Aug 2024
Short-Long Convolutions Help Hardware-Efficient Linear Attention to
  Focus on Long Sequences
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences
Zicheng Liu
Siyuan Li
Li Wang
Zedong Wang
Yunfan Liu
Stan Z. Li
33
7
0
12 Jun 2024
LongVQ: Long Sequence Modeling with Vector Quantization on Structured
  Memory
LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory
Zicheng Liu
Li Wang
Siyuan Li
Zedong Wang
Haitao Lin
Stan Z. Li
VLM
27
4
0
17 Apr 2024
Parallelized Spatiotemporal Binding
Parallelized Spatiotemporal Binding
Gautam Singh
Yue Wang
Jiawei Yang
B. Ivanovic
Sungjin Ahn
Marco Pavone
Tong Che
48
1
0
26 Feb 2024
HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full
  Context Interaction
HyperZ⋅\cdot⋅Z⋅\cdot⋅W Operator Connects Slow-Fast Networks for Full Context Interaction
Harvie Zhang
31
0
0
31 Jan 2024
Enhancing Molecular Property Prediction with Auxiliary Learning and
  Task-Specific Adaptation
Enhancing Molecular Property Prediction with Auxiliary Learning and Task-Specific Adaptation
Vishal Dey
Xia Ning
AAML
AI4CE
19
0
0
29 Jan 2024
Learning under Label Proportions for Text Classification
Learning under Label Proportions for Text Classification
Jatin Chauhan
Xiaoxuan Wang
Wei Wang
25
1
0
18 Oct 2023
Contraction Properties of the Global Workspace Primitive
Contraction Properties of the Global Workspace Primitive
Michaela Ennis
L. Kozachkov
Jean-Jacques E. Slotine
22
0
0
02 Oct 2023
Parallelizing non-linear sequential models over the sequence length
Parallelizing non-linear sequential models over the sequence length
Yi Heng Lim
Qi Zhu
Joshua Selfridge
M. F. Kasim
22
13
0
21 Sep 2023
Neural Machine Translation for Mathematical Formulae
Neural Machine Translation for Mathematical Formulae
Felix Petersen
M. Schubotz
André Greiner-Petter
Bela Gipp
15
7
0
25 May 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For
  Vision-and-Language Navigation
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
Liuyi Wang
Chengju Liu
Zongtao He
Shu Li
Qingqing Yan
Huiyi Chen
Qi Chen
21
9
0
19 May 2023
Sequence Modeling with Multiresolution Convolutional Memory
Sequence Modeling with Multiresolution Convolutional Memory
Jiaxin Shi
Ke Alexander Wang
E. Fox
39
13
0
02 May 2023
SMPConv: Self-moving Point Representations for Continuous Convolution
SMPConv: Self-moving Point Representations for Continuous Convolution
Sanghyeon Kim
Eunbyung Park
3DPC
34
12
0
05 Apr 2023
End-to-End Speech Recognition: A Survey
End-to-End Speech Recognition: A Survey
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
21
148
0
03 Mar 2023
State-Regularized Recurrent Neural Networks to Extract Automata and
  Explain Predictions
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
15
3
0
10 Dec 2022
Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
N. Benjamin Erichson
S. H. Lim
Michael W. Mahoney
13
6
0
01 Dec 2022
Lempel-Ziv Networks
Lempel-Ziv Networks
Rebecca Saul
Mohammad Mahmudul Alam
John Hurwitz
Edward Raff
Tim Oates
James Holt
13
2
0
23 Nov 2022
Liquid Structural State-Space Models
Liquid Structural State-Space Models
Ramin Hasani
Mathias Lechner
Tsun-Hsuan Wang
Makram Chahine
Alexander Amini
Daniela Rus
AI4TS
97
95
0
26 Sep 2022
Image Classification using Sequence of Pixels
Image Classification using Sequence of Pixels
Gajraj Kuldeep
6
0
0
23 Sep 2022
On the Parameterization and Initialization of Diagonal State Space
  Models
On the Parameterization and Initialization of Diagonal State Space Models
Albert Gu
Ankit Gupta
Karan Goel
Christopher Ré
14
297
0
23 Jun 2022
Improving Multi-Document Summarization through Referenced Flexible
  Extraction with Credit-Awareness
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness
Yun-Zhu Song
Yi-Syuan Chen
Hong-Han Shuai
35
20
0
04 May 2022
LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough
  Differential Equations
LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations
Jaehoon Lee
Jinsung Jeon
Sheo Yon Jhin
Jihyeon Hyeong
Jayoung Kim
Minju Jo
Kook Seungji
Noseong Park
AI4TS
8
2
0
19 Apr 2022
Path Development Network with Finite-dimensional Lie Group
  Representation
Path Development Network with Finite-dimensional Lie Group Representation
Han Lou
Siran Li
Hao Ni
11
7
0
02 Apr 2022
MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient
  Magnitudes of Auxiliary Tasks
MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks
Yun He
Xuening Feng
Cheng Cheng
Geng Ji
Yunsong Guo
James Caverlee
6
42
0
14 Mar 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
29
53
0
17 Feb 2022
Lerna: Transformer Architectures for Configuring Error Correction Tools
  for Short- and Long-Read Genome Sequencing
Lerna: Transformer Architectures for Configuring Error Correction Tools for Short- and Long-Read Genome Sequencing
Atul Sharma
Pranjali Jain
Ashraf Y. Mahgoub
Zihan Zhou
K. Mahadik
Somali Chaterji
18
8
0
19 Dec 2021
Transfer Learning in Conversational Analysis through Reusing
  Preprocessing Data as Supervisors
Transfer Learning in Conversational Analysis through Reusing Preprocessing Data as Supervisors
Joshua Y. Kim
Tongliang Liu
K. Yacef
14
0
0
02 Dec 2021
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via
  Deep-Learning UWB Radar
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar
Tianyue Zheng
Zhe Chen
Shujie Zhang
Chao Cai
Jun-Jie Luo
10
93
0
16 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
25
1,648
0
31 Oct 2021
Combining Recurrent, Convolutional, and Continuous-time Models with
  Linear State-Space Layers
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers
Albert Gu
Isys Johnson
Karan Goel
Khaled Kamal Saab
Tri Dao
Atri Rudra
Christopher Ré
37
546
0
26 Oct 2021
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel
  Sizes
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes
David W. Romero
Robert-Jan Bruintjes
Jakub M. Tomczak
Erik J. Bekkers
Mark Hoogendoorn
J. C. V. Gemert
80
81
0
15 Oct 2021
On the difficulty of learning chaotic dynamics with RNNs
On the difficulty of learning chaotic dynamics with RNNs
Jonas M. Mikhaeil
Zahra Monfared
Daniel Durstewitz
57
50
0
14 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
8
103
0
11 Oct 2021
Combining Rules and Embeddings via Neuro-Symbolic AI for Knowledge Base
  Completion
Combining Rules and Embeddings via Neuro-Symbolic AI for Knowledge Base Completion
Prithviraj Sen
B. W. Carvalho
Ibrahim Abdelaziz
Pavan Kapanipathi
F. Luus
Salim Roukos
Alexander G. Gray
NAI
29
6
0
16 Sep 2021
RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent
  Neural Networks
RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks
L. Kozachkov
Michaela Ennis
Jean-Jacques E. Slotine
14
18
0
16 Jun 2021
PILOT: Introducing Transformers for Probabilistic Sound Event
  Localization
PILOT: Introducing Transformers for Probabilistic Sound Event Localization
C. Schymura
Benedikt T. Bönninghoff
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Tomohiro Nakatani
S. Araki
D. Kolossa
17
24
0
07 Jun 2021
Improving Computer Generated Dialog with Auxiliary Loss Functions and
  Custom Evaluation Metrics
Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics
T. Conley
Jack St. Clair
Jugal Kalita
39
1
0
04 Jun 2021
Warming up recurrent neural networks to maximise reachable
  multistability greatly improves learning
Warming up recurrent neural networks to maximise reachable multistability greatly improves learning
Gaspard Lambrechts
Florent De Geeter
Nicolas Vecoven
D. Ernst
G. Drion
17
2
0
02 Jun 2021
Predictive Representation Learning for Language Modeling
Predictive Representation Learning for Language Modeling
Qingfeng Lan
Luke N. Kumar
Martha White
Alona Fyshe
OffRL
AI4TS
16
1
0
29 May 2021
On the Memory Mechanism of Tensor-Power Recurrent Models
On the Memory Mechanism of Tensor-Power Recurrent Models
Hejia Qiu
Chao Li
Ying Weng
Zhun Sun
Xingyu He
Qibin Zhao
8
6
0
02 Mar 2021
Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and
  Cost Map
Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map
Elmira Amirloo
Mohsen Rohani
Ershad Banijamali
Jun-Jie Luo
Pascal Poupart
SSL
24
4
0
01 Mar 2021
Extremal learning: extremizing the output of a neural network in
  regression problems
Extremal learning: extremizing the output of a neural network in regression problems
Zakaria Patel
M. Rummel
17
4
0
06 Feb 2021
CKConv: Continuous Kernel Convolution For Sequential Data
CKConv: Continuous Kernel Convolution For Sequential Data
David W. Romero
Anna Kuzina
Erik J. Bekkers
Jakub M. Tomczak
Mark Hoogendoorn
20
123
0
04 Feb 2021
Deep Inertial Odometry with Accurate IMU Preintegration
Deep Inertial Odometry with Accurate IMU Preintegration
Rooholla Khorrambakht
Chris Xiaoxuan Lu
H. Damirchi
Zhenghua Chen
Zhengguo Li
13
1
0
18 Jan 2021
Taxonomy Completion via Triplet Matching Network
Taxonomy Completion via Triplet Matching Network
Jieyu Zhang
Xiangchen Song
Ying Zeng
Jiaze Chen
Jiaming Shen
Yuning Mao
Lei Li
20
37
0
06 Jan 2021
Kaleidoscope: An Efficient, Learnable Representation For All Structured
  Linear Maps
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps
Tri Dao
N. Sohoni
Albert Gu
Matthew Eichhorn
Amit Blonder
Megan Leszczynski
Atri Rudra
Christopher Ré
17
43
0
29 Dec 2020
Simple or Complex? Learning to Predict Readability of Bengali Texts
Simple or Complex? Learning to Predict Readability of Bengali Texts
Susmoy Chakraborty
Mir Tafseer Nayeem
Wasi Uddin Ahmad
16
19
0
09 Dec 2020
Supervised Learning Achieves Human-Level Performance in MOBA Games: A
  Case Study of Honor of Kings
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Deheng Ye
Guibin Chen
P. Zhao
Fuhao Qiu
Bo Yuan
...
Liang Wang
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
24
48
0
25 Nov 2020
Language Through a Prism: A Spectral Approach for Multiscale Language
  Representations
Language Through a Prism: A Spectral Approach for Multiscale Language Representations
Alex Tamkin
Dan Jurafsky
Noah D. Goodman
21
42
0
09 Nov 2020
12
Next