ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.19261
  4. Cited By
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

26 February 2025
Taishi Nakamura
Takuya Akiba
Kazuki Fujii
Yusuke Oda
Rio Yokota
Jun Suzuki
    MoMe
    MoE
ArXivPDFHTML

Papers citing "Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization"

1 / 1 papers shown
Title
Mixture of Group Experts for Learning Invariant Representations
Mixture of Group Experts for Learning Invariant Representations
Lei Kang
Jia Li
Mi Tian
Hua Huang
MoE
25
0
0
12 Apr 2025
1