Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.04398
Cited By
Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling
6 March 2025
Yan Li
Pengfei Zheng
Shuang Chen
Zewei Xu
Yuanhao Lai
Yunfei Du
Z. Wang
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling"
Title
No papers