Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08681
Cited By
A Mean Field Ansatz for Zero-Shot Weight Transfer
16 August 2024
Xingyuan Chen
Wenwei Kuang
Lei Deng
Wei Han
Bo Bai
Goncalo dos Reis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Mean Field Ansatz for Zero-Shot Weight Transfer"
2 / 2 papers shown
Title
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Xiaozhe Ren
Pingyi Zhou
Xinfan Meng
Xinjing Huang
Yadao Wang
...
Jiansheng Wei
Xin Jiang
Teng Su
Qun Liu
Jun Yao
ALM
MoE
44
59
0
20 Mar 2023
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
217
3,054
0
23 Jan 2020
1