v1v2v3 (latest)

GLGE: A New General Language Generation Evaluation Benchmark

Findings (Findings), 2020

24 November 2020

Dayiheng Liu

ArXiv (abs)PDF HTML Github (57★)

Papers citing "GLGE: A New General Language Generation Evaluation Benchmark"

49 / 49 papers shown

Idiom Understanding as a Tool to Measure the Dialect Gap

David Beauchemin

Yan Tremblay

Mohamed Amine Youssef

Richard Khoury

240

06 Oct 2025

QFrBLiMP: a Quebec-French Benchmark of Linguistic Minimal Pairs

226

30 Sep 2025

QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments

David Beauchemin

Richard Khoury

200

23 Aug 2025

Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?

288

27 Jul 2025

VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks

Shailaja Keyur Sampat

Yezhou Yang

MLLM CoGe ReLM VLM LRM

251

17 Oct 2024

LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English

563

12 Oct 2024

IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning

265

07 Jul 2024

Language Generation with Strictly Proper Scoring Rules

382

29 May 2024

UT5: Pretraining Non autoregressive T5 with unrolled denoising

189

14 Nov 2023

Beyond MLE: Convex Learning for Text GenerationNeural Information Processing Systems (NeurIPS), 2023

299

26 Oct 2023

NoCoLA: The Norwegian Corpus of Linguistic AcceptabilityNordic Conference of Computational Linguistics (NODALIDA), 2023

Matias Jentoft

David Samuel

278

13 Jun 2023

Dolphin: A Challenging and Diverse Benchmark for Arabic NLGConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

El Moatez Billah Nagoudi

AbdelRahim Elmadany

Ahmed Oumar El-Shangiti

Muhammad Abdul-Mageed

LM&MA

377

24 May 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text GenerationNeural Information Processing Systems (NeurIPS), 2023

...

462

138

16 May 2023

STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yulun Du

Lydia B. Chilton

294

14 May 2023

Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text GenerationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

306

06 May 2023

NorBench -- A Benchmark for Norwegian Language ModelsNordic Conference of Computational Linguistics (NODALIDA), 2023

329

06 May 2023

Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text GenerationTransactions of the Association for Computational Linguistics (TACL), 2023

Fei Huang

Pei Ke

Shiyu Huang

AI4CE

238

24 Apr 2023

TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

...

166

26 Dec 2022

Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework

425

16 Dec 2022

Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning

180

12 Dec 2022

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for PolishNeural Information Processing Systems (NeurIPS), 2022

...

355

23 Nov 2022

A Survey of Knowledge Enhanced Pre-trained Language ModelsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

Lei Hou

Juanzi Li

549

214

11 Nov 2022

ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

311

24 Oct 2022

P$^3$LM: Probabilistically Permuted Prophet Language Modeling for
Generative Pre-Training

^3

LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

249

22 Oct 2022

Draft, Command, and Edit: Controllable Text Editing in E-Commerce

Dayiheng Liu

335

11 Aug 2022

Joint Generator-Ranker Learning for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

386

28 Jun 2022

MVP: Multi-task Supervised Pre-training for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

324

24 Jun 2022

BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaFindings (Findings), 2022

430

23 May 2022

Near-Negative Distinction: Giving a Second Life to Human Evaluation DatasetsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

352

13 May 2022

Learning to Transfer Prompts for Text GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

243

03 May 2022

Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

348

27 Feb 2022

MuLD: The Multitask Long Document BenchmarkInternational Conference on Language Resources and Evaluation (LREC), 2022

G. Hudson

Noura Al Moubayed

254

15 Feb 2022

Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022

665

288

14 Jan 2022

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

Yuan Yao

Qingxiu Dong

Jian Guan

Boxi Cao

Zhengyan Zhang

...

Zhiyuan Liu

Xianpei Han

Erhong Yang

Zhifang Sui

Maosong Sun

ALM ELM

294

27 Dec 2021

Improving Non-autoregressive Generation with Mixup Training

153

21 Oct 2021

Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation

240

14 Sep 2021

Asking Questions Like Educational Experts: Automatically Generating Question-Answer Pairs on Real-World Examination Data

303

11 Sep 2021

LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and GenerationTransactions of the Association for Computational Linguistics (TACL), 2021

Changjie Fan

291

30 Aug 2021

AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing

Katikapalli Subramanyam Kalyan

A. Rajasekharan

S. Sangeetha

VLM LM&MA

437

322

12 Aug 2021

Human Evaluation of Creative NLG Systems: An Interdisciplinary Survey on Recent Papers

Mika Hämäläinen

Khalid Alnajjar

ELM LM&MA

254

31 Jul 2021

Indian Legal NLP Benchmarks : A Survey

191

13 Jul 2021

GEM: A General Evaluation Benchmark for Multimodal TasksFindings (Findings), 2021

278

18 Jun 2021

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Dayiheng Liu

Min Zhang

193

11 Jun 2021

EL-Attention: Memory Efficient Lossless Attention for GenerationInternational Conference on Machine Learning (ICML), 2021

229

11 May 2021

Prediction, Selection, and Generation: Exploration of Knowledge-Driven Conversation System

Dayiheng Liu

176

23 Apr 2021

Problems and Countermeasures in Natural Language Processing Evaluation

Qingxiu Dong

Zhifang Sui

Weidong Zhan

Baobao Chang

ELM

128

20 Apr 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsIEEE Games Entertainment Media Conference (IEEE GEM), 2021

Sebastian Gehrmann

Tosin Adewumi

Karmanya Aggarwal

Pawan Sasanka Ammanamanchi

...

Diyi Yang

982

316

02 Feb 2021

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale PretrainingInternational Conference on Machine Learning (ICML), 2020

...

398

31 Dec 2020

A Survey of Knowledge-Enhanced Text GenerationACM Computing Surveys (ACM CSUR), 2020

Wenhao Yu

Chenguang Zhu

Zaitang Li

Zhiting Hu

Qingyun Wang

Heng Ji

Meng Jiang

561

333

09 Oct 2020