Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.03615
Cited By
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation
6 June 2023
Runze Liu
Yali Du
Fengshuo Bai
Jiafei Lyu
Xiu Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation"
6 / 6 papers shown
Title
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
46
62
0
30 Jun 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
832
0
12 Oct 2021
Cross-Domain Imitation Learning via Optimal Transport
Arnaud Fickinger
Samuel N. Cohen
Stuart J. Russell
Brandon Amos
OT
40
47
0
07 Oct 2021
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
147
469
0
06 Aug 2021
1