Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.17025
Cited By
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs
27 May 2024
Zhenyu Bai
Pranav Dangi
Huize Li
Tulika Mitra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs"
3 / 3 papers shown
Title
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design
Yanbiao Liang
Huihong Shi
Haikuo Shao
Zhongfeng Wang
18
0
0
07 Apr 2025
SALO: An Efficient Spatial Accelerator Enabling Hybrid Sparse Attention Mechanisms for Long Sequences
Guan Shen
Jieru Zhao
Quan Chen
Jingwen Leng
C. Li
Minyi Guo
34
26
0
29 Jun 2022
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
1,982
0
28 Jul 2020
1