ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.12085
  4. Cited By
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

17 February 2025
Yuxiang Huang
Mingye Li
Xu Han
Chaojun Xiao
Weilin Zhao
Sun Ao
Hao Zhou
Jie Zhou
Zhiyuan Liu
Maosong Sun
ArXivPDFHTML

Papers citing "APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs"

Title
No papers