Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.05610
Cited By
Representation Alignment from Human Feedback for Cross-Embodiment Reward Learning from Mixed-Quality Demonstrations
10 August 2024
Connor Mattson
Anurag Aribandi
Daniel S. Brown
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Representation Alignment from Human Feedback for Cross-Embodiment Reward Learning from Mixed-Quality Demonstrations"
5 / 5 papers shown
Title
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery
Zohre Karimi
Shing-Hei Ho
Bao Thach
Alan Kuntz
Daniel S. Brown
OffRL
27
7
0
10 Apr 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Cross-Domain Imitation Learning via Optimal Transport
Arnaud Fickinger
Samuel N. Cohen
Stuart J. Russell
Brandon Amos
OT
40
47
0
07 Oct 2021
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
39
32
0
01 Oct 2021
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
147
469
0
06 Aug 2021
1