Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.04279
Cited By
Staircase Attention for Recurrent Processing of Sequences
8 June 2021
Da Ju
Stephen Roller
Sainbayar Sukhbaatar
Jason Weston
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Staircase Attention for Recurrent Processing of Sequences"
4 / 4 papers shown
Title
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
39
1
0
01 Feb 2024
Recurrent Memory Transformer
Aydar Bulatov
Yuri Kuratov
Mikhail Burtsev
CLL
11
101
0
14 Jul 2022
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
33
158
0
11 Jul 2022
Block-Recurrent Transformers
DeLesley S. Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
18
94
0
11 Mar 2022
1