ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.07942
  4. Cited By
Towards Low-bit Communication for Tensor Parallel LLM Inference

Towards Low-bit Communication for Tensor Parallel LLM Inference

12 November 2024
Harry Dong
Tyler Johnson
Minsik Cho
Emad Soroush
    MQ
ArXiv (abs)PDFHTML

Papers citing "Towards Low-bit Communication for Tensor Parallel LLM Inference"

1 / 1 papers shown
Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks
Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks
Xiumei Deng
Zehui Xiong
Binbin Chen
Dong In Kim
Mérouane Debbah
H. Vincent Poor
FedML
165
0
0
04 Nov 2025
1