ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.08708
  4. Cited By
Teacher Forcing Recovers Reward Functions for Text Generation
v1v2 (latest)

Teacher Forcing Recovers Reward Functions for Text Generation

Neural Information Processing Systems (NeurIPS), 2022
17 October 2022
Yongchang Hao
Yuxin Liu
Lili Mou
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Teacher Forcing Recovers Reward Functions for Text Generation"

8 / 8 papers shown
An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU
An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU
Xuan Xiao
Xiaotong Ren
Haitao Li
192
0
0
24 May 2025
KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation
KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation
Jiabin Fan
Guoqing Luo
Michael Bowling
Lili Mou
OffRL
473
0
0
26 Apr 2025
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
Jared Joselowitz
Ritam Majumdar
Arjun Jagota
Matthieu Bou
Nyal Patel
Satyapriya Krishna
Sonali Parbhoo
330
0
0
16 Oct 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
LLMR: Knowledge Distillation with a Large Language Model-Induced RewardInternational Conference on Language Resources and Evaluation (LREC), 2024
Dongheng Li
Yongchang Hao
Lili Mou
365
6
0
19 Sep 2024
A Critical Look At Tokenwise Reward-Guided Text Generation
A Critical Look At Tokenwise Reward-Guided Text Generation
Ahmad Rashid
Ruotian Wu
Julia Grosse
Agustinus Kristiadi
Pascal Poupart
OffRL
630
6
0
12 Jun 2024
MiniLLM: Knowledge Distillation of Large Language Models
MiniLLM: Knowledge Distillation of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Yuxian Gu
Li Dong
Furu Wei
Shiyu Huang
ALM
682
94
0
14 Jun 2023
Token Imbalance Adaptation for Radiology Report Generation
Token Imbalance Adaptation for Radiology Report GenerationACM Conference on Health, Inference, and Learning (CHIL), 2023
Yuexin Wu
I. Huang
Xiaolei Huang
MedIm
226
13
0
18 Apr 2023
Inverse Reinforcement Learning for Text Summarization
Inverse Reinforcement Learning for Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yujiao Fu
Deyi Xiong
Yue Dong
334
5
0
19 Dec 2022
1
Page 1 of 1