v1v2 (latest)

Teacher Forcing Recovers Reward Functions for Text Generation

Neural Information Processing Systems (NeurIPS), 2022

17 October 2022

Papers citing "Teacher Forcing Recovers Reward Functions for Text Generation"

8 / 8 papers shown

An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU

Xuan Xiao

Xiaotong Ren

Haitao Li

192

24 May 2025

KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation

473

26 Apr 2025

Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning

330

16 Oct 2024

LLMR: Knowledge Distillation with a Large Language Model-Induced RewardInternational Conference on Language Resources and Evaluation (LREC), 2024

Dongheng Li

Yongchang Hao

Lili Mou

365

19 Sep 2024

A Critical Look At Tokenwise Reward-Guided Text Generation

630

12 Jun 2024

MiniLLM: Knowledge Distillation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

682

14 Jun 2023

Token Imbalance Adaptation for Radiology Report GenerationACM Conference on Health, Inference, and Learning (CHIL), 2023

226

18 Apr 2023

Inverse Reinforcement Learning for Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yujiao Fu

Deyi Xiong

Yue Dong

334

19 Dec 2022