Long Range Arena: A Benchmark for Efficient Transformers

8 November 2020

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (757★)

Papers citing "Long Range Arena: A Benchmark for Efficient Transformers"

50 / 571 papers shown

DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

500

19 Nov 2025

Semantic Multiplexing

Mohammad Abdi

Francesca Meneghello

Francesco Restuccia

136

16 Nov 2025

Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations

Reginald Zhiyan Chen

Heng-Sheng Chang

P. Mehta

113

13 Nov 2025

BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models

Chandra Vamsi Krishna Alla

Harish Naidu Gaddam

Manohar Kommi

RALM

341

07 Nov 2025

EchoLSTM: A Self-Reflective Recurrent Network for Stabilizing Long-Range Memory

Prasanth K K

Shubham Sharma

KELM

170

03 Nov 2025

Hankel Singular Value Regularization for Highly Compressible State Space Models

Paul Schwerdtner

Jules Berman

Benjamin Peherstorfer

245

27 Oct 2025

A Deep State-Space Model Compression Method using Upper Bound on Output Error

Hiroki Sakamoto

Kazuhiro Sato

102

16 Oct 2025

Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy SparsityInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

221

12 Oct 2025

Task-Level Insights from Eigenvalues across Sequence Models

127

10 Oct 2025

Design Principles for Sequence Models via Coefficient Dynamics

156

10 Oct 2025

Beyond independent component analysis: identifiability and algorithms

120

08 Oct 2025

The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures

173

06 Oct 2025

RACE Attention: A Strictly Linear-Time Attention for Long-Sequence Training

Anshumali Shrivastava

262

05 Oct 2025

Wave-PDE Nets: Trainable Wave-Equation Layers as an Alternative to Attention

Harshil Vejendla

117

05 Oct 2025

The Curious Case of In-Training Compression of State Space Models

236

03 Oct 2025

Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models

111

01 Oct 2025

Where to Add PDE Diffusion in Transformers

Yukun Zhang

Xueqing Zhou

AI4CE

201

27 Sep 2025

Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models

216

26 Sep 2025

Aligning Inductive Bias for Data-Efficient Generalization in State Space Models

Qiyu Chen

Guozhang Chen

352

25 Sep 2025

Myosotis: structured computation for attention like layer

202

24 Sep 2025

Mamba Modulation: On the Length Generalization of Mamba

366

23 Sep 2025

An overview of neural architectures for self-supervised audio representation learning from masked spectrograms

277

23 Sep 2025

CogniLoad: A Synthetic Natural Language Reasoning Benchmark With Tunable Length, Intrinsic Difficulty, and Distractor Density

191

22 Sep 2025

Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling

266

21 Sep 2025

Holographic Transformers for Complex-Valued Signal Processing: Integrating Phase Interference into Self-Attention

166

14 Sep 2025

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

348

11 Sep 2025

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Rishiraj Acharya

30 Aug 2025

Uncovering the Spectral Bias in Diagonal State Space Models

163

28 Aug 2025

Revisiting associative recall in modern recurrent models

Destiny Okpekpe

Antonio Orvieto

178

26 Aug 2025

Small transformer architectures for task switchingInternational Conference on Artificial Neural Networks (ICANN), 2025

Claudius Gros

137

06 Aug 2025

Systolic Array-based Accelerator for Structured State-Space Models

342

29 Jul 2025

Modality Agnostic Efficient Long Range Encoder

T. Parag

Ahmed Elgammal

192

25 Jul 2025

SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models

Wonjun Jeong

Dongseok Kim

Taegkeun Whangbo

265

24 Jul 2025

Compression Method for Deep Diagonal State Space Model Based on

H^2

Optimal ReductionIEEE Control Systems Letters (L-CSS), 2025

Hiroki Sakamoto

Kazuhiro Sato

259

14 Jul 2025

A Quantile Regression Approach for Remaining Useful Life Estimation with State Space Models

Davide Frizzo

Francesco Borsatti

Gian Antonio Susto

172

20 Jun 2025

From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation

391

19 Jun 2025

SKOLR: Structured Koopman Operator Linear RNN for Time-Series Forecasting

351

17 Jun 2025

A Scalable Hybrid Training Approach for Recurrent Spiking Neural Networks

200

17 Jun 2025

Scaling Algorithm Distillation for Continuous Control with Mamba

Samuel Beaussant

Mehdi Mounsif

244

16 Jun 2025

Revisiting Transformers with Insights from Image Filtering and Boosting

342

12 Jun 2025

Uncovering the Computational Roles of Nonlinearity in Sequence Modeling Using Almost-Linear RNNs

Manuel Brenner

G. Koppe

266

09 Jun 2025

Improving the Efficiency of Long Document Classification using Sentence Ranking Approach

237

08 Jun 2025

Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models

185

06 Jun 2025

Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions

406

06 Jun 2025

Context Is Not Comprehension

Alex Pan

Mary-Anne Williams

LRM

434

05 Jun 2025

SiLIF: Structured State Space Model Dynamics and Parametrization for Spiking Neural Networks

Maxime Fabre

Lyubov Dudchenko

Emre Neftci

524

04 Jun 2025

Mamba Drafters for Speculative Decoding

...

336

01 Jun 2025

Weight-Space Linear Recurrent Neural Networks

Roussel Desmond Nzoyem

Nawid Keshtmand

Enrique Crespo Fernandez

Idriss Tsayem

Raúl Santos-Rodríguez

David A.W. Barton

Tom Deakin

385

01 Jun 2025

Adaptive Two Sided Laplace Transforms: A Learnable, Interpretable, and Scalable Replacement for Self-Attention

Andrew Kiruluta

221

01 Jun 2025

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

321

29 May 2025