Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.05634
Cited By
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
12 July 2019
Yuping Luo
Huazhe Xu
Tengyu Ma
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling"
3 / 3 papers shown
Title
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
67
0
0
04 Jan 2025
Social NCE: Contrastive Learning of Socially-aware Motion Representations
Yuejiang Liu
Qi Yan
Alexandre Alahi
29
101
0
21 Dec 2020
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
160
220
0
22 May 2012
1