Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06600
Cited By
Zeroth-Order Supervised Policy Improvement
11 June 2020
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Zeroth-Order Supervised Policy Improvement"
2 / 2 papers shown
Title
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1