Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.13578
Cited By
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
20 March 2024
Do June Min
Verónica Pérez-Rosas
Kenneth Resnicow
Rada Mihalcea
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation"
4 / 4 papers shown
Title
EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning
Lingxiao Kong
Cong Yang
Susanne Neufang
Oya Beyan
Zeyd Boukhers
OffRL
22
0
0
05 May 2025
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
121
101
0
05 Jun 2022
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
49
15
0
13 Oct 2021
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
192
1,325
0
05 Jun 2016
1