ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.11928
  4. Cited By
GLGE: A New General Language Generation Evaluation Benchmark
v1v2v3 (latest)

GLGE: A New General Language Generation Evaluation Benchmark

Findings (Findings), 2020
24 November 2020
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
Jian Jiao
Weizhu Chen
Jie Fu
Linjun Shou
Ming Gong
Pengcheng Wang
Jiusheng Chen
Daxin Jiang
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
    ELM
ArXiv (abs)PDFHTMLGithub (57★)

Papers citing "GLGE: A New General Language Generation Evaluation Benchmark"

49 / 49 papers shown
Idiom Understanding as a Tool to Measure the Dialect Gap
Idiom Understanding as a Tool to Measure the Dialect Gap
David Beauchemin
Yan Tremblay
Mohamed Amine Youssef
Richard Khoury
232
2
0
06 Oct 2025
QFrBLiMP: a Quebec-French Benchmark of Linguistic Minimal Pairs
QFrBLiMP: a Quebec-French Benchmark of Linguistic Minimal Pairs
David Beauchemin
Pier-Luc Veilleux
Richard Khoury
Johanna-Pascale Roy
216
1
0
30 Sep 2025
QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments
QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments
David Beauchemin
Richard Khoury
189
2
0
23 Aug 2025
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Khloud Al Jallad
Nada Ghneim
Ghaida Rebdawi
LM&MAELM
286
0
0
27 Jul 2025
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLMCoGeReLMVLMLRM
242
1
0
17 Oct 2024
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks
  in English
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English
T. Y. S. S. Santosh
Cornelius Weiss
Matthias Grabmair
AILawELM
559
14
0
12 Oct 2024
IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning
IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning
Abhinav Joshi
Shounak Paul
Akshat Sharma
Pawan Goyal
Saptarshi Ghosh
Ashutosh Modi
AILawELM
260
39
0
07 Jul 2024
Language Generation with Strictly Proper Scoring Rules
Language Generation with Strictly Proper Scoring Rules
Chenze Shao
Fandong Meng
Yijin Liu
Jie Zhou
375
7
0
29 May 2024
UT5: Pretraining Non autoregressive T5 with unrolled denoising
UT5: Pretraining Non autoregressive T5 with unrolled denoising
Mahmoud G. Salem
Jiayu Ye
Chu-Cheng Lin
Frederick Liu
AI4CE
186
0
0
14 Nov 2023
Beyond MLE: Convex Learning for Text Generation
Beyond MLE: Convex Learning for Text GenerationNeural Information Processing Systems (NeurIPS), 2023
Chenze Shao
Zhengrui Ma
Min Zhang
Yang Feng
297
4
0
26 Oct 2023
NoCoLA: The Norwegian Corpus of Linguistic Acceptability
NoCoLA: The Norwegian Corpus of Linguistic AcceptabilityNordic Conference of Computational Linguistics (NODALIDA), 2023
Matias Jentoft
David Samuel
270
17
0
13 Jun 2023
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
Dolphin: A Challenging and Diverse Benchmark for Arabic NLGConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Ahmed Oumar El-Shangiti
Muhammad Abdul-Mageed
LM&MA
373
29
0
24 May 2023
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
AR-Diffusion: Auto-Regressive Diffusion Model for Text GenerationNeural Information Processing Systems (NeurIPS), 2023
Tong Wu
Zhihao Fan
Xiao Liu
Yeyun Gong
Yelong Shen
...
Juntao Li
Zhongyu Wei
Jian Guo
Nan Duan
Weizhu Chen
VLM
451
134
0
16 May 2023
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative
  Story Understanding and Generation
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yulun Du
Lydia B. Chilton
282
8
0
14 May 2023
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive
  Text Generation
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text GenerationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Kun Zhou
Yifan Li
Wayne Xin Zhao
Ji-Rong Wen
DiffM
302
34
0
06 May 2023
NorBench -- A Benchmark for Norwegian Language Models
NorBench -- A Benchmark for Norwegian Language ModelsNordic Conference of Computational Linguistics (NODALIDA), 2023
David Samuel
Andrey Kutuzov
Samia Touileb
Erik Velldal
Lilja Ovrelid
Egil Rønningstad
Elina Sigdel
Anna Palatkina
318
34
0
06 May 2023
Directed Acyclic Transformer Pre-training for High-quality
  Non-autoregressive Text Generation
Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text GenerationTransactions of the Association for Computational Linguistics (TACL), 2023
Fei Huang
Pei Ke
Shiyu Huang
AI4CE
233
13
0
24 Apr 2023
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianyi Tang
Junyi Li
Zhongfu Chen
Yiwen Hu
Zhuohao Yu
...
Xiaoxue Cheng
Yuhao Wang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
165
9
0
26 Dec 2022
Swing Distillation: A Privacy-Preserving Knowledge Distillation
  Framework
Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework
Junzhuo Li
Xinwei Wu
Weilong Dong
Shuangzhi Wu
Chao Bian
Deyi Xiong
417
5
0
16 Dec 2022
Collaborating Heterogeneous Natural Language Processing Tasks via
  Federated Learning
Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
176
6
0
12 Dec 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP
  benchmark for Polish
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for PolishNeural Information Processing Systems (NeurIPS), 2022
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
355
14
0
23 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
A Survey of Knowledge Enhanced Pre-trained Language ModelsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELMVLM
525
212
0
11 Nov 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and
  Effective Text Generation
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
308
20
0
24 Oct 2022
P$^3$LM: Probabilistically Permuted Prophet Language Modeling for
  Generative Pre-Training
P3^33LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Junwei Bao
Yifan Wang
Jiangyong Ying
Yeyun Gong
Jing Zhao
Youzheng Wu
Xiaodong He
246
1
0
22 Oct 2022
Draft, Command, and Edit: Controllable Text Editing in E-Commerce
Draft, Command, and Edit: Controllable Text Editing in E-Commerce
Kexin Yang
Dayiheng Liu
Wenqiang Lei
Baosong Yang
Qian Qu
Jiancheng Lv
330
0
0
11 Aug 2022
Joint Generator-Ranker Learning for Natural Language Generation
Joint Generator-Ranker Learning for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Weizhou Shen
Yeyun Gong
Yelong Shen
Song Wang
Xiaojun Quan
Nan Duan
Weizhu Chen
368
6
0
28 Jun 2022
MVP: Multi-task Supervised Pre-training for Natural Language Generation
MVP: Multi-task Supervised Pre-training for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
320
30
0
24 Jun 2022
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating
  Low-Resource Natural Language Generation in Bangla
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaFindings (Findings), 2022
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Rifat Shahriyar
AIMatLM&MA
420
54
0
23 May 2022
Near-Negative Distinction: Giving a Second Life to Human Evaluation
  Datasets
Near-Negative Distinction: Giving a Second Life to Human Evaluation DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Philippe Laban
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
335
4
0
13 May 2022
Learning to Transfer Prompts for Text Generation
Learning to Transfer Prompts for Text GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Junyi Li
Tianyi Tang
J. Nie
Ji-Rong Wen
Wayne Xin Zhao
234
44
0
03 May 2022
Variational Autoencoder with Disentanglement Priors for Low-Resource
  Task-Specific Natural Language Generation
Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhuang Li
Zhuang Li
Xingliang Yuan
Tongtong Wu
Tianyang Zhan
Gholamreza Haffari
CoGeUDDRL
343
5
0
27 Feb 2022
MuLD: The Multitask Long Document Benchmark
MuLD: The Multitask Long Document BenchmarkInternational Conference on Language Resources and Evaluation (LREC), 2022
G. Hudson
Noura Al Moubayed
253
12
0
15 Feb 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
652
288
0
14 Jan 2022
CUGE: A Chinese Language Understanding and Generation Evaluation
  Benchmark
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Yuan Yao
Qingxiu Dong
Jian Guan
Boxi Cao
Zhengyan Zhang
...
Zhiyuan Liu
Xianpei Han
Erhong Yang
Zhifang Sui
Maosong Sun
ALMELM
294
22
0
27 Dec 2021
Improving Non-autoregressive Generation with Mixup Training
Improving Non-autoregressive Generation with Mixup Training
Ting Jiang
Shaohan Huang
Zihan Zhang
Deqing Wang
Fuzhen Zhuang
Furu Wei
Haizhen Huang
Liangjie Zhang
Tao Gui
152
10
0
21 Oct 2021
Compression, Transduction, and Creation: A Unified Framework for
  Evaluating Natural Language Generation
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation
Mingkai Deng
Bowen Tan
Zhengzhong Liu
Eric Xing
Zhiting Hu
237
80
0
14 Sep 2021
Asking Questions Like Educational Experts: Automatically Generating
  Question-Answer Pairs on Real-World Examination Data
Asking Questions Like Educational Experts: Automatically Generating Question-Answer Pairs on Real-World Examination Data
Fanyi Qu
Xin Jia
Hao Sun
AI4Ed
299
27
0
11 Sep 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text
  Understanding and Generation
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and GenerationTransactions of the Association for Computational Linguistics (TACL), 2021
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Shiyu Huang
281
37
0
30 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLMLM&MA
410
321
0
12 Aug 2021
Human Evaluation of Creative NLG Systems: An Interdisciplinary Survey on
  Recent Papers
Human Evaluation of Creative NLG Systems: An Interdisciplinary Survey on Recent Papers
Mika Hämäläinen
Khalid Alnajjar
ELMLM&MA
251
22
0
31 Jul 2021
Indian Legal NLP Benchmarks : A Survey
Indian Legal NLP Benchmarks : A Survey
Prathamesh Kalamkar
Janani Venugopalan
Vivek Raghavan
ELMAILawVLM
188
8
0
13 Jul 2021
GEM: A General Evaluation Benchmark for Multimodal Tasks
GEM: A General Evaluation Benchmark for Multimodal TasksFindings (Findings), 2021
Lin Su
Nan Duan
Edward Cui
Lei Ji
Chenfei Wu
Huaishao Luo
Yongfei Liu
Ming Zhong
Taroon Bharti
Arun Sacheti
VLM
274
22
0
18 Jun 2021
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language
  Generation
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xin Liu
Baosong Yang
Dayiheng Liu
Haibo Zhang
Weihua Luo
Min Zhang
Haiying Zhang
Jinsong Su
187
20
0
11 Jun 2021
EL-Attention: Memory Efficient Lossless Attention for Generation
EL-Attention: Memory Efficient Lossless Attention for GenerationInternational Conference on Machine Learning (ICML), 2021
Yu Yan
Jiusheng Chen
Weizhen Qi
Nikhil Bhendawade
Yeyun Gong
Nan Duan
Ruofei Zhang
VLM
213
9
0
11 May 2021
Prediction, Selection, and Generation: Exploration of Knowledge-Driven
  Conversation System
Prediction, Selection, and Generation: Exploration of Knowledge-Driven Conversation System
Cheng Luo
Dayiheng Liu
Chanjuan Li
Li Lu
Jiancheng Lv
175
0
0
23 Apr 2021
Problems and Countermeasures in Natural Language Processing Evaluation
Problems and Countermeasures in Natural Language Processing Evaluation
Qingxiu Dong
Zhifang Sui
Weidong Zhan
Baobao Chang
ELM
122
3
0
20 Apr 2021
The GEM Benchmark: Natural Language Generation, its Evaluation and
  Metrics
The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsIEEE Games Entertainment Media Conference (IEEE GEM), 2021
Sebastian Gehrmann
Tosin Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
972
315
0
02 Feb 2021
BANG: Bridging Autoregressive and Non-autoregressive Generation with
  Large Scale Pretraining
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale PretrainingInternational Conference on Machine Learning (ICML), 2020
Weizhen Qi
Yeyun Gong
Jian Jiao
Yu Yan
Weizhu Chen
...
Houqiang Li
Jiusheng Chen
Ruofei Zhang
Ming Zhou
Nan Duan
375
52
0
31 Dec 2020
A Survey of Knowledge-Enhanced Text Generation
A Survey of Knowledge-Enhanced Text GenerationACM Computing Surveys (ACM CSUR), 2020
Wenhao Yu
Chenguang Zhu
Zaitang Li
Zhiting Hu
Qingyun Wang
Heng Ji
Meng Jiang
538
333
0
09 Oct 2020
1
Page 1 of 1