Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02265
Cited By
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
4 November 2024
X. Sun
Yanfeng Chen
Y. Huang
Ruobing Xie
Jiaqi Zhu
K. Zhang
Shuaipeng Li
Zhen Yang
J. N. Han
Xiaobo Shu
Jiahao Bu
Z. Chen
Xuemeng Huang
Fengzong Lian
S. M. I. Simon X. Yang
Jianfeng Yan
Yuyuan Zeng
Xiaoqin Ren
Chao Yu
Lulu Wu
Yue Mao
Jun Xia
Tao Yang
S. Zheng
Kan Wu
Dian Jiao
J. Xue
X. Zhang
Decheng Wu
Kai Liu
Dengpeng Wu
Guanghui Xu
S. Chen
Shuang Chen
Xiao Feng
Yigeng Hong
Junqiang Zheng
Chengcheng Xu
Z. Li
Xiong Kuang
Jianglu Hu
Yiqi Chen
Yuchi Deng
Guiyang Li
Ao Liu
Chenchen Zhang
Shihui Hu
Zilong Zhao
Zifan Wu
Yao Ding
W. Wang
Han Liu
R. Wang
Hao Fei
Peijie Yu
Ze Zhao
Xun Cao
Hai Wang
Fusheng Xiang
Mengyuan Huang
Zhiyuan Xiong
Bin Hu
Xuebin Hou
Lei Jiang
Jianqiang Ma
Jiajia Wu
Yaping Deng
Yi Shen
Qian Wang
Weijie Liu
Jie Liu
Meng Chen
Liang Dong
W. Jia
H. Chen
F. Liu
Rui Yuan
Huilin Xu
Zhenxiang Yan
Tengfei Cao
Zhichao Hu
Xinhua Feng
Dong Du
T. Yu
Yangyu Tao
Feng Zhang
Jianchen Zhu
C. Xu
X. Li
Chong Zha
Wen Ouyang
Yinben Xia
Xiang Li
Zekun He
Rongpeng Chen
Jiawei Song
Ruibin Chen
F. Jiang
Chongqing Zhao
B. Wang
Hao Gong
Rong Gan
Winston Hu
Zhanhui Kang
Yong Yang
Yuhong Liu
Di Wang
Jie Jiang
MoE
ALM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent"
6 / 6 papers shown
Title
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Ashwinee Panda
Vatsal Baherwani
Zain Sarwar
Benjamin Thérien
Supriyo Chakraborty
Tom Goldstein
MoE
37
0
0
16 Apr 2025
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
Peijie Yu
Yifan Yang
J. Li
Zelong Zhang
Haorui Wang
Xiao Feng
Feng Zhang
LLMAG
109
0
0
03 Apr 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
120
1
0
10 Mar 2025
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
Minxuan Lv
Zhenpeng Su
Leiyu Pan
Yizhe Xiong
Zijia Lin
...
Guiguang Ding
Cheng Luo
Di Zhang
Kun Gai
Songlin Hu
MoE
39
0
0
18 Feb 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
Guangzhi Sun
Yudong Yang
Jimin Zhuang
Changli Tang
Y. Li
W. Li
Z. Ma
Chao Zhang
LRM
MLLM
VLM
64
3
0
17 Feb 2025
Scaling Laws for Floating Point Quantization Training
X. Sun
Shuaipeng Li
Ruobing Xie
Weidong Han
Kan Wu
...
Yangyu Tao
Zhanhui Kang
C. Xu
Di Wang
Jie Jiang
MQ
AIFin
58
0
0
05 Jan 2025
1