Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.09347
Cited By
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
14 March 2024
Sun Ao
Weilin Zhao
Xu Han
Cheng Yang
Zhiyuan Liu
Chuan Shi
Maosong Sun
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences"
3 / 3 papers shown
Title
NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation
Baharul Islam
Nasim Ahmad
F. Barbhuiya
Kuntal Dey
28
0
0
06 May 2025
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
Jennifer D’Souza
Sameer Sadruddin
Holger Israel
Mathias Begoin
Diana Slawig
52
5
0
09 Apr 2025
ZeRO-Offload: Democratizing Billion-Scale Model Training
Jie Ren
Samyam Rajbhandari
Reza Yazdani Aminabadi
Olatunji Ruwase
Shuangyang Yang
Minjia Zhang
Dong Li
Yuxiong He
MoE
160
399
0
18 Jan 2021
1