Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2411.07942
Cited By
Towards Low-bit Communication for Tensor Parallel LLM Inference
12 November 2024
Harry Dong
Tyler Johnson
Minsik Cho
Emad Soroush
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Low-bit Communication for Tensor Parallel LLM Inference"
1 / 1 papers shown
Federated Attention: A Distributed Paradigm for Collaborative LLM Inference over Edge Networks
Xiumei Deng
Zehui Xiong
Binbin Chen
Dong In Kim
Mérouane Debbah
H. Vincent Poor
FedML
165
0
0
04 Nov 2025
1