Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

12 July 2019

Papers citing "Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling"

3 / 3 papers shown

Title
SR-Reward: Taking The Path More Traveled Seyed Mahdi Basiri Azad Zahra Padar Gabriel Kalweit Joschka Boedecker OffRL 67 0 0 04 Jan 2025
Social NCE: Contrastive Learning of Socially-aware Motion Representations Yuejiang Liu Qi Yan Alexandre Alahi 29 101 0 21 Dec 2020
Off-Policy Actor-Critic T. Degris Martha White R. Sutton OffRL CML 160 220 0 22 May 2012