ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.24389
  4. Cited By
LLaDA-MoE: A Sparse MoE Diffusion Language Model

LLaDA-MoE: A Sparse MoE Diffusion Language Model

29 September 2025
Fengqi Zhu
Zebin You
Yipeng Xing
Zenan Huang
Lin Liu
Yihong Zhuang
Guoshan Lu
Kangyu Wang
Xudong Wang
Lanning Wei
Hongrui Guo
Jiaqi Hu
Wentao Ye
Tieyuan Chen
Chenchen Li
Chengfu Tang
Haibo Feng
Jun Hu
Jun Zhou
Xiaolu Zhang
Zhenzhong Lan
Junbo Zhao
Da Zheng
Chongxuan Li
Jianguo Li
J. Wen
    MoE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "LLaDA-MoE: A Sparse MoE Diffusion Language Model"

5 / 5 papers shown
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules
Amr Mohamed
Yang Zhang
Michalis Vazirgiannis
Guokan Shang
AI4CE
186
0
0
02 Dec 2025
Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models
Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models
Linye Wei
Wenjue Chen
Pingzhi Tang
Xiaotian Guo
Le Ye
Runsheng Wang
Meng Li
AI4CE
109
0
0
24 Nov 2025
Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones
Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones
Ranfei Chen
Ming Chen
Kaifei Wang
DiffMAI4CELRM
207
0
0
19 Nov 2025
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
Marianne Arriola
Yair Schiff
Hao Phung
Aaron Gokaslan
Volodymyr Kuleshov
146
1
0
26 Oct 2025
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
Kangyu Wang
Zhiyun Jiang
Haibo Feng
Weijia Zhao
Lin Liu
Jianguo Li
Zhenzhong Lan
Weiyao Lin
122
3
0
07 Oct 2025
1
Page 1 of 1