v1v2 (latest)

Text Generation by Learning from Demonstrations

16 September 2020

Papers citing "Text Generation by Learning from Demonstrations"

46 / 46 papers shown

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

310

07 Aug 2025

Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards

328

25 Jun 2025

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

Eric Thibodeau-Laufer

Sándor Toth

Sam Work

OffRL

577

18 Mar 2025

Sequence-level Large Language Model Training with Contrastive Preference OptimizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

450

23 Feb 2025

Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization

Eng Siong Chng

268

02 Jul 2024

Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

383

14 Jan 2024

Successor Features for Efficient Multisubject Controlled Text Generation

Mengyao Cao

Mehdi Fatemi

Jackie Chi Kit Cheung

Samira Shabanian

BDL

208

03 Nov 2023

Beyond MLE: Convex Learning for Text GenerationNeural Information Processing Systems (NeurIPS), 2023

299

26 Oct 2023

Building Persona Consistent Dialogue Agents with Offline Reinforcement LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Ryan Shea

Zhou Yu

OffRL

364

16 Oct 2023

EMO: Earth Mover Distance Optimization for Auto-Regressive Language ModelingInternational Conference on Learning Representations (ICLR), 2023

Siyu Ren

Zhiyong Wu

Kenny Q. Zhu

441

07 Oct 2023

Language Model Decoding as Direct Metrics OptimizationInternational Conference on Learning Representations (ICLR), 2023

392

02 Oct 2023

Reinforcement Learning for Generative AI: A Survey

Yuanjiang Cao

595

28 Aug 2023

Prompt-Based Length Controlled Generation with Reinforcement Learning

Renlong Jie

Xiaojun Meng

Lifeng Shang

Xin Jiang

Qun Liu

443

23 Aug 2023

Reinforced Self-Training (ReST) for Language Modeling

...

512

423

17 Aug 2023

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023

Jingbo Zhu

295

04 Aug 2023

On the Effectiveness of Offline RL for Dialogue Response GenerationInternational Conference on Machine Learning (ICML), 2023

242

23 Jul 2023

On the Efficacy of Sampling AdaptersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

418

07 Jul 2023

Semi-Offline Reinforcement Learning for Optimized Text GenerationInternational Conference on Machine Learning (ICML), 2023

Rui Yan

257

16 Jun 2023

MiniLLM: Knowledge Distillation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

735

14 Jun 2023

Preference-grounded Token-level Guidance for Language Model Fine-tuningNeural Information Processing Systems (NeurIPS), 2023

573

01 Jun 2023

Zero-shot Visual Question Answering with Language Model FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

383

26 May 2023

Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Faeze Brahman

465

24 May 2023

On Learning to Summarize with Large Language Models as ReferencesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Arman Cohan

568

131

23 May 2023

Think Outside the Code: Brainstorming Boosts Large Language Models in Code Generation

249

18 May 2023

Self-Edit: Fault-Aware Code Editor for Code GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

625

164

06 May 2023

GEMINI: Controlling the Sentence-level Writing Style for Abstractive Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Guangsheng Bao

Zebin Ou

Yue Zhang

255

07 Apr 2023

SPEC: Summary Preference Decomposition for Low-Resource Abstractive SummarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Yi-Syuan Chen

Yun-Zhu Song

Hong-Han Shuai

230

24 Mar 2023

Tailoring Language Generation Models under Total Variation DistanceInternational Conference on Learning Representations (ICLR), 2023

311

26 Feb 2023

Learning with Rejection for Abstractive Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

228

16 Feb 2023

Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation

Xiang Lin

Prathyusha Jwalapuram

Shafiq Joty

DiffM

248

31 Jan 2023

Weakly-Supervised Questions for Zero-Shot Relation ExtractionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Saeed Najafi

Alona Fyshe

289

21 Jan 2023

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

403

164

15 Dec 2022

KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Kun Qian

333

30 Nov 2022

GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse stateKnowledge and Information Systems (KAIS), 2022

336

18 Nov 2022

Reward Gaming in Conditional Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

457

16 Nov 2022

Teacher Forcing Recovers Reward Functions for Text GenerationNeural Information Processing Systems (NeurIPS), 2022

510

17 Oct 2022

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

Rajkumar Ramamurthy

Prithviraj Ammanabrolu

Yejin Choi

686

289

03 Oct 2022

Text Summarization with Oracle ExpectationInternational Conference on Learning Representations (ICLR), 2022

Yumo Xu

Mirella Lapata

VLM

208

26 Sep 2022

MAD for Robust Reinforcement Learning in Machine Translation

284

18 Jul 2022

Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneNeural Information Processing Systems (NeurIPS), 2022

...

343

159

15 Jun 2022

Offline RL for Natural Language Generation with Implicit Language Q LearningInternational Conference on Learning Representations (ICLR), 2022

515

143

05 Jun 2022

Knowledge Infused DecodingInternational Conference on Learning Representations (ICLR), 2022

Ruibo Liu

Ahmed Hassan Awadallah

KELM

269

06 Apr 2022

BRIO: Bringing Order to Abstractive SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Yixin Liu

Pengfei Liu

Dragomir R. Radev

Graham Neubig

465

325

31 Mar 2022

Amortized Noisy Channel Neural Machine Translation

Richard Yuanzhe Pang

He He

Dong Wang

255

16 Dec 2021

Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation

Michalis Korakakis

Andreas Vlachos

CLL

267

13 Sep 2021

AgreeSum: Agreement-Oriented Multi-Document SummarizationFindings (Findings), 2021

256

04 Jun 2021