Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.12085
Cited By
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs
17 February 2025
Yuxiang Huang
Mingye Li
Xu Han
Chaojun Xiao
Weilin Zhao
Sun Ao
Hao Zhou
Jie Zhou
Zhiyuan Liu
Maosong Sun
Re-assign community
ArXiv
PDF
HTML
Papers citing
"APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs"
Title
No papers