On the Long-Term Memory of Deep Recurrent Networks

25 October 2017

Papers citing "On the Long-Term Memory of Deep Recurrent Networks"

10 / 10 papers shown

Title
A Tensor Decomposition Perspective on Second-order RNNs M. Lizaire Michael Rizvi-Martel Marawan Gamal Abdel Hameed Guillaume Rabusseau 55 0 0 07 Jun 2024
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs Or Sharir Anima Anandkumar 32 0 0 27 Jul 2023
On the Ability of Graph Neural Networks to Model Interactions Between Vertices Noam Razin Tom Verbin Nadav Cohen 23 10 0 29 Nov 2022
Transformer Vs. MLP-Mixer: Exponential Expressive Gap For NLP Problems D. Navon A. Bronstein MoE 38 0 0 17 Aug 2022
High-Order Pooling for Graph Neural Networks with Tensor Decomposition Chenqing Hua Guillaume Rabusseau Jian Tang 78 25 0 24 May 2022
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks Noam Razin Asaf Maman Nadav Cohen 46 29 0 27 Jan 2022
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design Yoav Levine Noam Wies Daniel Jannai D. Navon Yedid Hoshen Amnon Shashua AI4CE 35 36 0 09 Oct 2021
Which transformer architecture fits my data? A vocabulary bottleneck in self-attention Noam Wies Yoav Levine Daniel Jannai Amnon Shashua 40 20 0 09 May 2021
The Depth-to-Width Interplay in Self-Attention Yoav Levine Noam Wies Or Sharir Hofit Bata Amnon Shashua 30 45 0 22 Jun 2020
Learning Relevant Features of Data with Multi-scale Tensor Networks Tayssir Doghri 25 137 0 31 Dec 2017