ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.03197
  4. Cited By
Unified Language Model Pre-training for Natural Language Understanding
  and Generation

Unified Language Model Pre-training for Natural Language Understanding and Generation

8 May 2019
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu-Chiang Frank Wang
Jianfeng Gao
M. Zhou
H. Hon
    ELM
    AI4CE
ArXivPDFHTML

Papers citing "Unified Language Model Pre-training for Natural Language Understanding and Generation"

50 / 847 papers shown
Title
It is AI's Turn to Ask Humans a Question: Question-Answer Pair
  Generation for Children's Story Books
It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books
Bingsheng Yao
Dakuo Wang
Tongshuang Wu
Zheng Zhang
Toby Jia-Jun Li
Mo Yu
Ying Xu
AI4Ed
17
43
0
08 Sep 2021
DialogLM: Pre-trained Model for Long Dialogue Understanding and
  Summarization
DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization
Ming Zhong
Yang Liu
Yichong Xu
Chenguang Zhu
Michael Zeng
VLM
AI4CE
27
123
0
06 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for
  Code Understanding and Generation
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
210
1,489
0
02 Sep 2021
Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness
  Trade-off in Abstractive Summarization
Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive Summarization
Faisal Ladhak
Esin Durmus
He He
Claire Cardie
Kathleen McKeown
14
64
0
31 Aug 2021
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot
  Learners
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
Ningyu Zhang
Luoqiu Li
Xiang Chen
Shumin Deng
Zhen Bi
Chuanqi Tan
Fei Huang
Huajun Chen
VLM
28
170
0
30 Aug 2021
Scheduled Sampling Based on Decoding Steps for Neural Machine
  Translation
Scheduled Sampling Based on Decoding Steps for Neural Machine Translation
Yijin Liu
Fandong Meng
Yufeng Chen
Jinan Xu
Jie Zhou
20
16
0
30 Aug 2021
Generating Answer Candidates for Quizzes and Answer-Aware Question
  Generators
Generating Answer Candidates for Quizzes and Answer-Aware Question Generators
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
19
5
0
29 Aug 2021
Analyzing and Mitigating Interference in Neural Architecture Search
Analyzing and Mitigating Interference in Neural Architecture Search
Jin Xu
Xu Tan
Kaitao Song
Renqian Luo
Yichong Leng
Tao Qin
Tie-Yan Liu
Jian Li
MoMe
18
29
0
29 Aug 2021
Smoothing Dialogue States for Open Conversational Machine Reading
Smoothing Dialogue States for Open Conversational Machine Reading
Zhuosheng Zhang
Siru Ouyang
Hai Zhao
Masao Utiyama
Eiichiro Sumita
24
6
0
28 Aug 2021
Self-training Improves Pre-training for Few-shot Learning in
  Task-oriented Dialog Systems
Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi
Wanhao Zhou
Feng Cai
Lingjing Kong
Minlie Huang
Boi Faltings
27
32
0
28 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Pierre Colombo
Guillaume Staerman
Chloé Clavel
Pablo Piantanida
27
41
0
27 Aug 2021
LayoutReader: Pre-training of Text and Layout for Reading Order
  Detection
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang
Yiheng Xu
Lei Cui
Jingbo Shang
Furu Wei
11
75
0
26 Aug 2021
Regularizing Transformers With Deep Probabilistic Layers
Regularizing Transformers With Deep Probabilistic Layers
Aurora Cobo Aguilera
Pablo Martínez Olmos
Antonio Artés-Rodríguez
Fernando Pérez-Cruz
8
7
0
23 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive
  Machine Translation
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
26
11
0
19 Aug 2021
CUSTOM: Aspect-Oriented Product Summarization for E-Commerce
CUSTOM: Aspect-Oriented Product Summarization for E-Commerce
Jiahui Liang
Junwei Bao
Yifan Wang
Youzheng Wu
Xiaodong He
Bowen Zhou
24
2
0
18 Aug 2021
On Multi-Modal Learning of Editing Source Code
On Multi-Modal Learning of Editing Source Code
Saikat Chakraborty
Baishakhi Ray
KELM
16
58
0
15 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
26
258
0
12 Aug 2021
ICAF: Iterative Contrastive Alignment Framework for Multimodal
  Abstractive Summarization
ICAF: Iterative Contrastive Alignment Framework for Multimodal Abstractive Summarization
Zijian Zhang
Chang Shu
Youxin Chen
Jing Xiao
Qian Zhang
Lu Zheng
13
5
0
11 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
23
3,828
0
28 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
13
37
0
15 Jul 2021
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
Xiachong Feng
Xiaocheng Feng
Bing Qin
21
100
0
07 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge
  Transfer
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Mohit Bansal
VLM
22
29
0
06 Jul 2021
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and
  Generation
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Jing Liu
Xinxin Zhu
Fei Liu
Longteng Guo
Zijia Zhao
...
Weining Wang
Hanqing Lu
Shiyu Zhou
Jiajun Zhang
Jinqiao Wang
23
36
0
01 Jul 2021
Improving Factual Consistency of Abstractive Summarization on Customer
  Feedback
Improving Factual Consistency of Abstractive Summarization on Customer Feedback
Yang Liu
Yifei Sun
Vincent Gao
HILM
11
6
0
30 Jun 2021
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA
Zewen Chi
Shaohan Huang
Li Dong
Shuming Ma
Bo Zheng
...
Payal Bajaj
Xia Song
Xian-Ling Mao
Heyan Huang
Furu Wei
39
118
0
30 Jun 2021
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin
  Information
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
Zijun Sun
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Xiang Ao
Qing He
Fei Wu
Jiwei Li
SSeg
29
183
0
30 Jun 2021
The Values Encoded in Machine Learning Research
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
14
273
0
29 Jun 2021
SCARF: Self-Supervised Contrastive Learning using Random Feature
  Corruption
SCARF: Self-Supervised Contrastive Learning using Random Feature Corruption
Dara Bahri
Heinrich Jiang
Yi Tay
Donald Metzler
SSL
17
163
0
29 Jun 2021
DeltaLM: Encoder-Decoder Pre-training for Language Generation and
  Translation by Augmenting Pretrained Multilingual Encoders
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
SLR
AI4CE
25
80
0
25 Jun 2021
Learning to Sample Replacements for ELECTRA Pre-Training
Learning to Sample Replacements for ELECTRA Pre-Training
Y. Hao
Li Dong
Hangbo Bao
Ke Xu
Furu Wei
MU
6
11
0
25 Jun 2021
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained
  Language Models for Domains
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains
Yunzhi Yao
Shaohan Huang
Wenhui Wang
Li Dong
Furu Wei
VLM
ALM
13
46
0
25 Jun 2021
Domain-Specific Pretraining for Vertical Search: Case Study on
  Biomedical Literature
Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature
Yu-Chiang Frank Wang
Jinchao Li
Tristan Naumann
Chenyan Xiong
Hao Cheng
...
Yang Qin
Eric Horvitz
Paul N. Bennett
Jianfeng Gao
Hoifung Poon
OOD
25
13
0
25 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
11
804
0
22 Jun 2021
How well do you know your summarization datasets?
How well do you know your summarization datasets?
Priyam Tejaswin
Dhruv Naik
Peng Liu
22
26
0
21 Jun 2021
Enhancing Question Generation with Commonsense Knowledge
Enhancing Question Generation with Commonsense Knowledge
Xin Jia
Hao Wang
D. Yin
Yunfang Wu
6
6
0
19 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
10
2,742
0
15 Jun 2021
SAS: Self-Augmentation Strategy for Language Model Pre-training
SAS: Self-Augmentation Strategy for Language Model Pre-training
Yifei Xu
Jingqiao Zhang
Ru He
Liangzhu Ge
Chao Yang
Cheng Yang
Ying Wu
26
1
0
14 Jun 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
24
811
0
14 Jun 2021
To Beam Or Not To Beam: That is a Question of Cooperation for Language
  GANs
To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
19
19
0
11 Jun 2021
UniKeyphrase: A Unified Extraction and Generation Framework for
  Keyphrase Prediction
UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction
Huanqin Wu
Wei Liu
Lei Li
Dan Nie
Tao Chen
Feng Zhang
Di Wang
6
22
0
09 Jun 2021
FastSeq: Make Sequence Generation Faster
FastSeq: Make Sequence Generation Faster
Yu Yan
Fei Hu
Jiusheng Chen
Nikhil Bhendawade
Ting Ye
Yeyun Gong
Nan Duan
Desheng Cui
Bingyu Chi
Ruifei Zhang
VLM
16
15
0
08 Jun 2021
Diverse Pretrained Context Encodings Improve Document Translation
Diverse Pretrained Context Encodings Improve Document Translation
Domenic Donato
Lei Yu
Chris Dyer
14
15
0
07 Jun 2021
BERTGEN: Multi-task Generation through BERT
BERTGEN: Multi-task Generation through BERT
Faidon Mitzalis
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
VLM
19
7
0
07 Jun 2021
Real-Time Cognitive Evaluation of Online Learners through Automatically
  Generated Questions
Real-Time Cognitive Evaluation of Online Learners through Automatically Generated Questions
Ritu Gala
Revathi Vijayaraghavan
V. Nikam
Arvind W. Kiwelekar
6
4
0
06 Jun 2021
Visual Question Rewriting for Increasing Response Rate
Visual Question Rewriting for Increasing Response Rate
Jiayi Wei
Xilian Li
Yi Zhang
Xin Eric Wang
20
2
0
04 Jun 2021
Addressing Inquiries about History: An Efficient and Practical Framework
  for Evaluating Open-domain Chatbot Consistency
Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot Consistency
Zekang Li
Jinchao Zhang
Zhengcong Fei
Yang Feng
Jie Zhou
6
14
0
04 Jun 2021
Defending Against Backdoor Attacks in Natural Language Generation
Defending Against Backdoor Attacks in Natural Language Generation
Xiaofei Sun
Xiaoya Li
Yuxian Meng
Xiang Ao
Fei Wu
Jiwei Li
Tianwei Zhang
AAML
SILM
18
47
0
03 Jun 2021
Evaluating the Efficacy of Summarization Evaluation across Languages
Evaluating the Efficacy of Summarization Evaluation across Languages
Fajri Koto
Jey Han Lau
Timothy Baldwin
42
19
0
02 Jun 2021
One Teacher is Enough? Pre-trained Language Model Distillation from
  Multiple Teachers
One Teacher is Enough? Pre-trained Language Model Distillation from Multiple Teachers
Chuhan Wu
Fangzhao Wu
Yongfeng Huang
13
63
0
02 Jun 2021
Question-aware Transformer Models for Consumer Health Question
  Summarization
Question-aware Transformer Models for Consumer Health Question Summarization
S. Yadav
D. Gupta
Asma Ben Abacha
Dina Demner-Fushman
ELM
MedIm
17
28
0
01 Jun 2021
Previous
123...101112...151617
Next