ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.11873
  4. Cited By
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

21 January 2025
Z. Qiu
Zeyu Huang
Bo Zheng
Kaiyue Wen
Z. Wang
Rui Men
Ivan Titov
Dayiheng Liu
Jingren Zhou
Junyang Lin
    MoE
ArXivPDFHTML

Papers citing "Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models"

1 / 1 papers shown
Title
Neural network task specialization via domain constraining
Neural network task specialization via domain constraining
Roman Malashin
Daniil Ilyukhin
49
0
0
28 Apr 2025
1