Title |
---|
![]() Scaling Smart: Accelerating Large Language Model Pre-training with Small
Model Initialization Mohammad Samragh Iman Mirzadeh Keivan Alizadeh Vahid Fartash Faghri Minsik Cho Moin Nabi Devang Naik Mehrdad Farajtabar |
![]() 52B to 1T: Lessons Learned via Tele-FLM Series Xiang Li Yiqun Yao Xin Jiang Xuezhi Fang Chao Wang ...Yequan Wang Zhongjiang He Zhongyuan Wang Xuelong Li Tiejun Huang |
![]() FLM-101B: An Open LLM and How to Train It with Xiang Li Yiqun Yao Xin Jiang Xuezhi Fang Xuying Meng ...LI DU Bowen Qin Zheng-Wei Zhang Aixin Sun Yequan Wang |