Zeroth-Order Supervised Policy Improvement

11 June 2020

Papers citing "Zeroth-Order Supervised Policy Improvement"

2 / 2 papers shown

Title
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond Hao Sun OffRL 34 21 0 09 Oct 2023
Off-Policy Actor-Critic T. Degris Martha White R. Sutton OffRL CML 163 220 0 22 May 2012