Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.07046
Cited By
Deploying Offline Reinforcement Learning with Human Feedback
13 March 2023
Ziniu Li
Kelvin Xu
Liu Liu
Lanqing Li
Deheng Ye
P. Zhao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deploying Offline Reinforcement Learning with Human Feedback"
5 / 5 papers shown
Title
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
89
1
0
13 Feb 2025
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
328
11,953
0
04 Mar 2022
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
50
20
0
15 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning
Yifan Zhang
P. Zhao
Qingyao Wu
Bin Li
Junzhou Huang
Mingkui Tan
OOD
60
95
0
06 Mar 2020
1