Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.17954
Cited By
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference
23 October 2024
Xin He
Shunkang Zhang
Yuxin Wang
Haiyan Yin
Zihao Zeng
Shaohuai Shi
Zhenheng Tang
Xiaowen Chu
Ivor Tsang
Ong Yew Soon
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference"
2 / 2 papers shown
Title
FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion
Zhenheng Tang
Yonggang Zhang
Peijie Dong
Y. Cheung
Amelie Chi Zhou
Bo Han
Xiaowen Chu
FedML
MoMe
AI4CE
45
6
0
27 Oct 2024
FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression
Zhenheng Tang
Xueze Kang
Yiming Yin
Xinglin Pan
Yuxin Wang
...
Shaohuai Shi
Amelie Chi Zhou
Bo Li
Bingsheng He
Xiaowen Chu
AI4CE
50
1
0
16 Oct 2024
1