Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.02156
Cited By
The Race to Efficiency: A New Perspective on AI Scaling Laws
4 January 2025
Chien-Ping Lu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Race to Efficiency: A New Perspective on AI Scaling Laws"
1 / 1 papers shown
Title
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Xing Hu
Zhixuan Chen
Dawei Yang
Zukang Xu
Chen Xu
Zhihang Yuan
Sifan Zhou
Jiangyong Yu
MoE
MQ
37
0
0
02 May 2025
1