Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.00413
Cited By
CPM: A Large-scale Generative Chinese Pre-trained Language Model
AI Open (AO), 2020
1 December 2020
Zhengyan Zhang
Xu Han
Hao Zhou
Pei Ke
Yuxian Gu
Deming Ye
Yujia Qin
Yusheng Su
Haozhe Ji
Jian Guan
Fanchao Qi
Xiaozhi Wang
Yanan Zheng
Guoyang Zeng
Huanqi Cao
S. Chen
Daixuan Li
Zhenbo Sun
Zhiyuan Liu
Shiyu Huang
Wentao Han
Jie Tang
Juan-Zi Li
Xiaoyan Zhu
Maosong Sun
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1585★)
Papers citing
"CPM: A Large-scale Generative Chinese Pre-trained Language Model"
50 / 56 papers shown
CAP-LLM: Context-Augmented Personalized Large Language Models for News Headline Generation
Raymond Wilson
Cole Graham
Chase Carter
Zefeng Yang
Ruiqi Gu
130
0
0
05 Aug 2025
Uncovering the Fragility of Trustworthy LLMs through Chinese Textual Ambiguity
Xinwei Wu
Haojie Li
Hongyu Liu
Xinyu Ji
Ruohan Li
Yule Chen
Yigeng Zhang
169
3
0
30 Jul 2025
HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration
Jiaqi Lv
Xufeng He
Yanchen Liu
Xu Dai
Aocheng Shen
Yinghao Li
Jiachen Hao
Jianrong Ding
Yang Hu
Shouyi Yin
336
2
0
12 Jun 2025
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
International Conference on Learning Representations (ICLR), 2024
Ruochen Zhang
Qinan Yu
Matianyu Zang
Carsten Eickhoff
Ellie Pavlick
293
17
0
11 Oct 2024
Can Pre-trained Language Models Understand Chinese Humor?
Yuyan Chen
Zhixu Li
Jiaqing Liang
Yanghua Xiao
Bang Liu
Yunwen Chen
238
26
0
04 Jul 2024
Modeling Comparative Logical Relation with Contrastive Learning for Text Generation
Yuhao Dan
Junfeng Tian
Jie Zhou
Ming Yan
Ji Zhang
Qin Chen
Liang He
301
0
0
13 Jun 2024
Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback
Jingwei Sun
Zhixu Du
Yiran Chen
KELM
290
4
0
30 May 2024
SambaLingo: Teaching Large Language Models New Languages
Zoltan Csaki
Bo Li
Jonathan Li
Qiantong Xu
Pian Pawakapan
Leon Zhang
Yun Du
Hengyu Zhao
Changran Hu
Urmish Thakker
257
14
0
08 Apr 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
329
90
0
15 Feb 2024
STADEE: STAtistics-based DEEp Detection of Machine Generated Text
International Conference on Intelligent Computing (ICIC), 2023
Zheng Chen
Huming Liu
DeLMO
133
11
0
04 Dec 2023
Character-level Chinese Backpack Language Models
Hao Sun
John Hewitt
166
1
0
19 Oct 2023
FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
International Conference on Machine Learning (ICML), 2023
Jingwei Sun
Ziyue Xu
Hongxu Yin
Dong Yang
Daguang Xu
Yiran Chen
Holger R. Roth
VLM
281
37
0
02 Oct 2023
At Which Training Stage Does Code Data Help LLMs Reasoning?
International Conference on Learning Representations (ICLR), 2023
Xiaogang Jia
Yue Liu
Yue Yu
Yuanliang Zhang
Yu Jiang
Changjian Wang
Shanshan Li
LRM
SyDa
400
96
0
28 Sep 2023
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
International Conference on Learning Representations (ICLR), 2023
Jinyi Hu
Yuan Yao
Chong Wang
Shanonan Wang
Yinxu Pan
...
Yankai Lin
Jiao Xue
Dahai Li
Zhiyuan Liu
Maosong Sun
MLLM
VLM
301
78
0
23 Aug 2023
Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuxuan Wang
Jianghui Wang
Dongyan Zhao
Zilong Zheng
193
5
0
30 May 2023
WebCPM: Interactive Web Search for Chinese Long-form Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yujia Qin
Zihan Cai
Di Jin
Lan Yan
Shi Liang
...
Ruobing Xie
Fanchao Qi
Zhiyuan Liu
Maosong Sun
Jie Zhou
RALM
257
115
0
11 May 2023
ChatGPT: Vision and Challenges
Internet of Things and Cyber-Physical Systems (IoT-CPS), 2023
S. Gill
Rupinder Kaur
191
173
0
08 May 2023
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Xiaozhe Ren
Pingyi Zhou
Xinfan Meng
Xinjing Huang
Yadao Wang
...
Jiansheng Wei
Xin Jiang
Teng Su
Qun Liu
Jun Yao
ALM
MoE
266
86
0
20 Mar 2023
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianyi Tang
Junyi Li
Zhongfu Chen
Yiwen Hu
Zhuohao Yu
...
Xiaoxue Cheng
Yuhao Wang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
145
9
0
26 Dec 2022
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks
Weilong Dong
Xinwei Wu
Junzhuo Li
Shuangzhi Wu
Chao Bian
Deyi Xiong
FedML
266
7
0
16 Dec 2022
SLING: Sino Linguistic Evaluation of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
290
15
0
21 Oct 2022
WeLM: A Well-Read Pre-trained Language Model for Chinese
Hui Su
Xiao Zhou
Houjin Yu
Xiaoyu Shen
Yuwen Chen
Zilin Zhu
Yang Yu
Jie Zhou
294
25
0
21 Sep 2022
In conversation with Artificial Intelligence: aligning language models with human values
Philosophy & Technology (PT), 2022
Atoosa Kasirzadeh
Iason Gabriel
427
139
0
01 Sep 2022
Domain-Specific Text Generation for Machine Translation
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
206
25
0
11 Aug 2022
Machine Learning Model Sizes and the Parameter Gap
Pablo Villalobos
J. Sevilla
T. Besiroglu
Lennart Heim
A. Ho
Marius Hobbhahn
ALM
ELM
AI4CE
279
81
0
05 Jul 2022
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Benyou Wang
Xiang Wu
Xiaokang Liu
Jianquan Li
Prayag Tiwari
Qianqian Xie
192
8
0
02 Jul 2022
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding
Knowledge Discovery and Data Mining (KDD), 2022
Wayne Xin Zhao
Kun Zhou
Zheng Gong
Beichen Zhang
Yuanhang Zhou
Jing Sha
Zhigang Chen
Shijin Wang
Cong Liu
Ji-Rong Wen
213
22
0
13 Jun 2022
Nominal Metaphor Generation with Multitask Learning
International Conference on Natural Language Generation (INLG), 2022
Yucheng Li
Chenghua Lin
Frank Guerin
210
15
0
10 Jun 2022
Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information
Zhipeng Zhang
Xinglin Hou
K. Niu
Zhongzhen Huang
Bo Xiao
Yuning Jiang
Qi Wu
Peifeng Wang
168
5
0
07 May 2022
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Machine Intelligence Research (MIR), 2022
Yuxian Gu
Jiaxin Wen
Hao Sun
Yi Song
Pei Ke
...
Zheng Zhang
Jianzhu Yao
Lei Liu
Xiaoyan Zhu
Shiyu Huang
276
57
0
17 Mar 2022
Exploring and Adapting Chinese GPT to Pinyin Input Method
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Minghuan Tan
Yong Dai
Duyu Tang
Zhangyin Feng
Guoping Huang
Jing Jiang
Jiwei Li
Shuming Shi
AI4CE
300
12
0
01 Mar 2022
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jingyan Zhou
Deng Jiawen
Fei Mi
Yitong Li
Yasheng Wang
Shiyu Huang
Xin Jiang
Qun Liu
Helen Meng
277
48
0
16 Feb 2022
Compute Trends Across Three Eras of Machine Learning
IEEE International Joint Conference on Neural Network (IJCNN), 2022
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
651
380
0
11 Feb 2022
ZeroPrompt: Scaling Prompt-Based Pretraining to 1,000 Tasks Improves Zero-Shot Generalization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hanwei Xu
Yujun Chen
Yulun Du
Nan Shao
Yanggang Wang
Haiyu Li
Zhilin Yang
VLM
LRM
AI4CE
262
72
0
18 Jan 2022
COLD: A Benchmark for Chinese Offensive Language Detection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Deng Jiawen
Jingyan Zhou
Hao Sun
Chujie Zheng
Fei Mi
Helen M. Meng
Shiyu Huang
428
144
0
16 Jan 2022
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
ACM Computing Surveys (ACM CSUR), 2022
Hanqing Zhang
Haolin Song
Shaoyu Li
Ming Zhou
Dawei Song
632
313
0
14 Jan 2022
Pretrained Language Models for Text Generation: A Survey
ACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
582
284
0
14 Jan 2022
Black-Box Tuning for Language-Model-as-a-Service
International Conference on Machine Learning (ICML), 2022
Tianxiang Sun
Yunfan Shao
Hong Qian
Xuanjing Huang
Xipeng Qiu
VLM
546
334
0
10 Jan 2022
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
239
149
0
28 Dec 2021
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Tri Dao
Beidi Chen
Kaizhao Liang
Jiaming Yang
Zhao Song
Atri Rudra
Christopher Ré
466
93
0
30 Nov 2021
FPM: A Collection of Large-scale Foundation Pre-trained Language Models
Dezhou Shen
AI4CE
150
0
0
09 Nov 2021
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Zhuosheng Zhang
Hanqing Zhang
Keming Chen
Yuhang Guo
Jingyun Hua
Yulong Wang
Ming Zhou
VLM
264
79
0
13 Oct 2021
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning
Shaohua Wu
Xudong Zhao
Tong Yu
Rongguo Zhang
C. Shen
...
Feng Li
Hong Zhu
Jiangang Luo
Liang Xu
Xuanwei Zhang
ALM
303
68
0
10 Oct 2021
M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Junyang Lin
An Yang
Jinze Bai
Chang Zhou
Le Jiang
...
Jie Zhang
Yong Li
Jialin Li
Jingren Zhou
Hongxia Yang
MoE
362
46
0
08 Oct 2021
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
...
Xinxian Huang
Xin Tian
Xinchao Xu
Yingzhan Lin
Zhengyu Niu
VLM
ALM
228
69
0
20 Sep 2021
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Yunfan Shao
Zhichao Geng
Yitao Liu
Junqi Dai
Hang Yan
Fei Yang
Li Zhe
Hujun Bao
Xipeng Qiu
MedIm
376
172
0
13 Sep 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Transactions of the Association for Computational Linguistics (TACL), 2021
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Shiyu Huang
247
35
0
30 Aug 2021
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Hao Zhou
Pei Ke
Zheng Zhang
Yuxian Gu
Yinhe Zheng
...
Xiaocong Yang
Bosi Wen
Xiaoyan Zhu
Shiyu Huang
Jie Tang
149
57
0
03 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
ACM Computing Surveys (CSUR), 2021
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
812
5,079
0
28 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
254
572
0
05 Jul 2021
1
2
Next
Page 1 of 2