
![]() Tele-FLM Technical Report Xiang Li Yiqun Yao Xin Jiang Xuezhi Fang Chao Wang ...Yequan Wang Zhongjiang He Zhongyuan Wang Xuelong Li Tiejun Huang |
![]() MuPT: A Generative Symbolic Music Pretrained TransformerInternational Conference on Learning Representations (ICLR), 2024 |
![]() Sailor: Open Language Models for South-East AsiaConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() ViTamin: Designing Scalable Vision Models in the Vision-Language EraComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() Accelerating Transformer Pre-training with 2:4 SparsityInternational Conference on Machine Learning (ICML), 2024 |
![]() KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable
AdaptationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
![]() ExeGPT: Constraint-Aware Resource Scheduling for LLM InferenceInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024 |
![]() Revealing the Parallel Multilingual Learning within Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() Language models scale reliably with over-training and on downstream
tasksInternational Conference on Learning Representations (ICLR), 2024 |
![]() Rethinking Generative Large Language Model Evaluation for Semantic
ComprehensionInternational Conference on Machine Learning (ICML), 2024 |