Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.05139
Cited By
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
7 March 2025
Ling Team
B. Zeng
C. Huang
Chao Zhang
Changxin Tian
C. Chen
Dingnan Jin
Feng Yu
Feng Zhu
Feng Yuan
Fakang Wang
G. Wang
Guangyao Zhai
Haitao Zhang
Huizhong Li
Jun Zhou
Jia-Ling Liu
Junpeng Fang
Junjie Ou
Jun Hu
Ji Luo
J. Zhang
Jian Liu
Jian Sha
Jianxue Qian
J. Wu
Junping Zhao
J. Li
Jubao Feng
Jingchao Di
Junming Xu
J. Yao
Kuan Xu
Kewei Du
Longfei Li
Lei Liang
Lu Yu
Li Tang
Lin Ju
Peng Xu
Qing Cui
Song Liu
Shicheng Li
S.
Song Yan
Tengwei Cai
Tianyi Chen
Ting Guo
Ting Huang
Tao Feng
Tao Wu
Wei Wu
Xiaolu Zhang
X. J. Yang
Xin Zhao
Xiaobo Hu
Xin Lin
Yao Zhao
Y. Wang
Yongzhen Guo
Y. Wang
Yue Yang
Yang Cao
Yuhao Fu
Y. Xiong
Y. Li
Zhe Li
Zhiqiang Zhang
Ziqi Liu
Zhaoxin Huan
Zujie Wen
Zhenhang Sun
Zhuoxuan Du
Z. He
MoE
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs"
2 / 2 papers shown
Title
Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models
Ling Team
Caizhi Tang
Chilin Fu
Chunwei Wu
Jia Guo
...
Shuaicheng Li
Y. Zhang
Yingting Wu
Y. Liu
Zhenyu Huang
LRM
14
0
0
09 Apr 2025
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM
Codefuse
Ling Team
Wenting Cai
Yuchen Cao
C. Chen
...
Wei Zhang
Z. Zhang
Hailin Zhao
Xunjin Zheng
Jun Zhou
ALM
MoE
49
0
0
22 Mar 2025
1