Learning Longer-term Dependencies in RNNs with Auxiliary Losses

1 March 2018

Papers citing "Learning Longer-term Dependencies in RNNs with Auxiliary Losses"

50 / 92 papers shown

Title
A Diagonal Structured State Space Model on Loihi 2 for Efficient Streaming Sequence Processing Svea Marie Meyer Philipp Weidel Philipp Plank L. Campos-Macias Sumit Bam Shrestha Philipp Stratmann M. R 36 4 0 23 Sep 2024
Reparameterized Multi-Resolution Convolutions for Long Sequence Modelling Harry Jake Cunningham Giorgio Giannone Mingtian Zhang M. Deisenroth 28 0 0 18 Aug 2024
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences Zicheng Liu Siyuan Li Li Wang Zedong Wang Yunfan Liu Stan Z. Li 33 7 0 12 Jun 2024
LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory Zicheng Liu Li Wang Siyuan Li Zedong Wang Haitao Lin Stan Z. Li VLM 27 4 0 17 Apr 2024
Parallelized Spatiotemporal Binding Gautam Singh Yue Wang Jiawei Yang B. Ivanovic Sungjin Ahn Marco Pavone Tong Che 48 1 0 26 Feb 2024
$HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction$ HyperZ $\cdot$ Z $\cdot$ W Operator Connects Slow-Fast Networks for Full Context Interaction Harvie Zhang 31 0 0 31 Jan 2024
Enhancing Molecular Property Prediction with Auxiliary Learning and Task-Specific Adaptation Vishal Dey Xia Ning AAML AI4CE 19 0 0 29 Jan 2024
Learning under Label Proportions for Text Classification Jatin Chauhan Xiaoxuan Wang Wei Wang 25 1 0 18 Oct 2023
Contraction Properties of the Global Workspace Primitive Michaela Ennis L. Kozachkov Jean-Jacques E. Slotine 22 0 0 02 Oct 2023
Parallelizing non-linear sequential models over the sequence length Yi Heng Lim Qi Zhu Joshua Selfridge M. F. Kasim 22 13 0 21 Sep 2023
Neural Machine Translation for Mathematical Formulae Felix Petersen M. Schubotz André Greiner-Petter Bela Gipp 15 7 0 25 May 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation Liuyi Wang Chengju Liu Zongtao He Shu Li Qingqing Yan Huiyi Chen Qi Chen 21 9 0 19 May 2023
Sequence Modeling with Multiresolution Convolutional Memory Jiaxin Shi Ke Alexander Wang E. Fox 39 13 0 02 May 2023
SMPConv: Self-moving Point Representations for Continuous Convolution Sanghyeon Kim Eunbyung Park 3DPC 34 12 0 05 Apr 2023
End-to-End Speech Recognition: A Survey Rohit Prabhavalkar Takaaki Hori Tara N. Sainath Ralf Schluter Shinji Watanabe VLM 21 148 0 03 Mar 2023
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions Cheng Wang Carolin (Haas) Lawrence Mathias Niepert 15 3 0 10 Dec 2022
Gated Recurrent Neural Networks with Weighted Time-Delay Feedback N. Benjamin Erichson S. H. Lim Michael W. Mahoney 13 6 0 01 Dec 2022
Lempel-Ziv Networks Rebecca Saul Mohammad Mahmudul Alam John Hurwitz Edward Raff Tim Oates James Holt 13 2 0 23 Nov 2022
Liquid Structural State-Space Models Ramin Hasani Mathias Lechner Tsun-Hsuan Wang Makram Chahine Alexander Amini Daniela Rus AI4TS 97 95 0 26 Sep 2022
Image Classification using Sequence of Pixels Gajraj Kuldeep 6 0 0 23 Sep 2022
On the Parameterization and Initialization of Diagonal State Space Models Albert Gu Ankit Gupta Karan Goel Christopher Ré 14 297 0 23 Jun 2022
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness Yun-Zhu Song Yi-Syuan Chen Hong-Han Shuai 35 20 0 04 May 2022
LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations Jaehoon Lee Jinsung Jeon Sheo Yon Jhin Jihyeon Hyeong Jayoung Kim Minju Jo Kook Seungji Noseong Park AI4TS 8 2 0 19 Apr 2022
Path Development Network with Finite-dimensional Lie Group Representation Han Lou Siran Li Hao Ni 11 7 0 02 Apr 2022
MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks Yun He Xuening Feng Cheng Cheng Geng Ji Yunsong Guo James Caverlee 6 42 0 14 Mar 2022
Retrieval-Augmented Reinforcement Learning Anirudh Goyal A. Friesen Andrea Banino T. Weber Nan Rosemary Ke ... Michal Valko Simon Osindero Timothy Lillicrap N. Heess Charles Blundell OffRL 29 53 0 17 Feb 2022
Lerna: Transformer Architectures for Configuring Error Correction Tools for Short- and Long-Read Genome Sequencing Atul Sharma Pranjali Jain Ashraf Y. Mahgoub Zihan Zhou K. Mahadik Somali Chaterji 18 8 0 19 Dec 2021
Transfer Learning in Conversational Analysis through Reusing Preprocessing Data as Supervisors Joshua Y. Kim Tongliang Liu K. Yacef 14 0 0 02 Dec 2021
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar Tianyue Zheng Zhe Chen Shujie Zhang Chao Cai Jun-Jie Luo 10 93 0 16 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces Albert Gu Karan Goel Christopher Ré 25 1,648 0 31 Oct 2021
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers Albert Gu Isys Johnson Karan Goel Khaled Kamal Saab Tri Dao Atri Rudra Christopher Ré 37 546 0 26 Oct 2021
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes David W. Romero Robert-Jan Bruintjes Jakub M. Tomczak Erik J. Bekkers Mark Hoogendoorn J. C. V. Gemert 80 81 0 15 Oct 2021
On the difficulty of learning chaotic dynamics with RNNs Jonas M. Mikhaeil Zahra Monfared Daniel Durstewitz 57 50 0 14 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs Tianwei Ni Benjamin Eysenbach Ruslan Salakhutdinov 8 103 0 11 Oct 2021
Combining Rules and Embeddings via Neuro-Symbolic AI for Knowledge Base Completion Prithviraj Sen B. W. Carvalho Ibrahim Abdelaziz Pavan Kapanipathi F. Luus Salim Roukos Alexander G. Gray NAI 29 6 0 16 Sep 2021
RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks L. Kozachkov Michaela Ennis Jean-Jacques E. Slotine 14 18 0 16 Jun 2021
PILOT: Introducing Transformers for Probabilistic Sound Event Localization C. Schymura Benedikt T. Bönninghoff Tsubasa Ochiai Marc Delcroix K. Kinoshita Tomohiro Nakatani S. Araki D. Kolossa 17 24 0 07 Jun 2021
Improving Computer Generated Dialog with Auxiliary Loss Functions and Custom Evaluation Metrics T. Conley Jack St. Clair Jugal Kalita 39 1 0 04 Jun 2021
Warming up recurrent neural networks to maximise reachable multistability greatly improves learning Gaspard Lambrechts Florent De Geeter Nicolas Vecoven D. Ernst G. Drion 17 2 0 02 Jun 2021
Predictive Representation Learning for Language Modeling Qingfeng Lan Luke N. Kumar Martha White Alona Fyshe OffRL AI4TS 16 1 0 29 May 2021
On the Memory Mechanism of Tensor-Power Recurrent Models Hejia Qiu Chao Li Ying Weng Zhun Sun Xingyu He Qibin Zhao 8 6 0 02 Mar 2021
Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map Elmira Amirloo Mohsen Rohani Ershad Banijamali Jun-Jie Luo Pascal Poupart SSL 24 4 0 01 Mar 2021
Extremal learning: extremizing the output of a neural network in regression problems Zakaria Patel M. Rummel 17 4 0 06 Feb 2021
CKConv: Continuous Kernel Convolution For Sequential Data David W. Romero Anna Kuzina Erik J. Bekkers Jakub M. Tomczak Mark Hoogendoorn 20 123 0 04 Feb 2021
Deep Inertial Odometry with Accurate IMU Preintegration Rooholla Khorrambakht Chris Xiaoxuan Lu H. Damirchi Zhenghua Chen Zhengguo Li 13 1 0 18 Jan 2021
Taxonomy Completion via Triplet Matching Network Jieyu Zhang Xiangchen Song Ying Zeng Jiaze Chen Jiaming Shen Yuning Mao Lei Li 20 37 0 06 Jan 2021
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps Tri Dao N. Sohoni Albert Gu Matthew Eichhorn Amit Blonder Megan Leszczynski Atri Rudra Christopher Ré 17 43 0 29 Dec 2020
Simple or Complex? Learning to Predict Readability of Bengali Texts Susmoy Chakraborty Mir Tafseer Nayeem Wasi Uddin Ahmad 16 19 0 09 Dec 2020
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings Deheng Ye Guibin Chen P. Zhao Fuhao Qiu Bo Yuan ... Liang Wang Tengfei Shi Qiang Fu Wei Yang Lanxiao Huang 24 48 0 25 Nov 2020
Language Through a Prism: A Spectral Approach for Multiscale Language Representations Alex Tamkin Dan Jurafsky Noah D. Goodman 21 42 0 09 Nov 2020