Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.09193
Cited By
Smart Bird: Learnable Sparse Attention for Efficient and Effective Transformer
20 August 2021
Chuhan Wu
Fangzhao Wu
Tao Qi
Binxing Jiao
Daxin Jiang
Yongfeng Huang
Xing Xie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Smart Bird: Learnable Sparse Attention for Efficient and Effective Transformer"
3 / 3 papers shown
Title
Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation
Botao Yu
Peiling Lu
Rui Wang
Wei Hu
Xu Tan
Wei Ye
Shikun Zhang
Tao Qin
Tie-Yan Liu
MGen
16
54
0
19 Oct 2022
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
2,009
0
28 Jul 2020
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
196
1,363
0
06 Jun 2016
1