Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.04239
Cited By
CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers
6 February 2024
Adjorn van Engelenhoven
Nicola Strisciuglio
Estefanía Talavera
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers"
3 / 3 papers shown
Title
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
239
2,554
0
04 May 2021
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
1,982
0
28 Jul 2020
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
214
7,687
0
17 Aug 2015
1