ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.09716
  4. Cited By

MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based Batching

12 March 2025
Tairan Xu
Leyang Xue
Zhan Lu
Adrian Jackson
Luo Mai
    MoE
ArXivPDFHTML

Papers citing "MoE-Gen: High-Throughput MoE Inference on a Single GPU with Module-Based Batching"

1 / 1 papers shown
Title
MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints
MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints
Yichao Yuan
Lin Ma
Nishil Talati
MoE
54
0
0
12 Apr 2025
1