Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.02954
Cited By
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
5 January 2024
DeepSeek-AI Xiao Bi
:
Xiao Bi
Deli Chen
Guanting Chen
Shanhuang Chen
Damai Dai
Chengqi Deng
Honghui Ding
Kai Dong
Qiushi Du
Zhe Fu
Huazuo Gao
Kaige Gao
W. Gao
Ruiqi Ge
Kang Guan
Daya Guo
Jianzhong Guo
Guangbo Hao
Zhewen Hao
Ying He
Wen-Hui Hu
Panpan Huang
Erhang Li
Guowei Li
Jiashi Li
Yao Li
Y. K. Li
W. Liang
Fangyun Lin
A. Liu
Bo Liu
Wen Liu
Xiaodong Liu
Xin Liu
Yiyuan Liu
Haoyu Lu
Shanghao Lu
Fuli Luo
Shirong Ma
Xiaotao Nie
Tian Pei
Yishi Piao
Junjie Qiu
Hui Qu
Tongzheng Ren
Zehui Ren
Chong Ruan
Zhangli Sha
Zhihong Shao
Jun-Mei Song
Xuecheng Su
Jingxiang Sun
Yaofeng Sun
Min Tang
Bing-Li Wang
Peiyi Wang
Shiyu Wang
Yaohui Wang
Yongji Wang
Tong Wu
Yu-Huan Wu
Xin Xie
Zhenda Xie
Ziwei Xie
Yi Xiong
Hanwei Xu
R. X. Xu
Yanhong Xu
Dejian Yang
Yu-mei You
Shuiping Yu
Xin-yuan Yu
Bo Zhang
Haowei Zhang
Lecong Zhang
Liyue Zhang
Mingchuan Zhang
Minghu Zhang
Wentao Zhang
Yichao Zhang
Chenggang Zhao
Yao Zhao
Shangyan Zhou
Shunfeng Zhou
Qihao Zhu
Yuheng Zou
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepSeek LLM: Scaling Open-Source Language Models with Longtermism"
6 / 6 papers shown
Title
AlignBench: Benchmarking Chinese Alignment of Large Language Models
Xiao Liu
Xuanyu Lei
Sheng-Ping Wang
Yue Huang
Zhuoer Feng
...
Hongning Wang
Jing Zhang
Minlie Huang
Yuxiao Dong
Jie Tang
ELM
LM&MA
ALM
60
16
0
30 Nov 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
198
8,441
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
ReLM
LM&Ro
203
5,177
0
28 Jan 2022
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
124
1,508
0
31 Dec 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
142
3,054
0
23 Jan 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
157
1,436
0
17 Sep 2019
1