ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.05382
  4. Cited By
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback

Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

11 August 2021
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
ArXivPDFHTML

Papers citing "Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback"

29 / 29 papers shown
Title
Effects of Robot Competency and Motion Legibility on Human Correction Feedback
Effects of Robot Competency and Motion Legibility on Human Correction Feedback
Shuangge Wang
Anjiabei Wang
Sofiya Goncharova
Brian Scassellati
Tesca Fitzgerald
36
1
0
08 Jan 2025
Large Language Model guided Deep Reinforcement Learning for Decision
  Making in Autonomous Driving
Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving
Hao Pang
Zhenpo Wang
Guoqiang Li
43
1
0
24 Dec 2024
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from
  Intervention
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
Masayoshi Tomizuka
Wei Zhan
23
1
0
24 Jun 2024
Efficient Preference-based Reinforcement Learning via Aligned Experience
  Estimation
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Fengshuo Bai
Rui Zhao
Hongming Zhang
Sijia Cui
Ying Wen
Yaodong Yang
Bo Xu
Lei Han
OffRL
32
6
0
29 May 2024
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
Yunfan Jiang
Chen Wang
Ruohan Zhang
Jiajun Wu
Fei-Fei Li
OnRL
37
26
0
16 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
46
2
0
30 Apr 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
40
9
0
20 Mar 2024
Sample-Efficient Preference-based Reinforcement Learning with Dynamics
  Aware Rewards
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards
Katherine Metcalf
Miguel Sarabia
Natalie Mackraz
B. Theobald
37
6
0
28 Feb 2024
PREDILECT: Preferences Delineated with Zero-Shot Language-based
  Reasoning in Reinforcement Learning
PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning
Simon Holk
Daniel Marta
Iolanda Leite
42
12
0
23 Feb 2024
Scalable Interactive Machine Learning for Future Command and Control
Scalable Interactive Machine Learning for Future Command and Control
Anna Madison
Ellen R. Novoseller
Vinicius G. Goecks
Benjamin T. Files
Nicholas R. Waytowich
Alfred Yu
Vernon J. Lawhern
Steven Thurman
Christopher Kelshaw
Kaleb McDowell
35
4
0
09 Feb 2024
Accelerating Reinforcement Learning of Robotic Manipulations via
  Feedback from Large Language Models
Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun-Mo Chu
Xufeng Zhao
C. Weber
Mengdi Li
Stefan Wermter
LLMAG
LM&Ro
46
14
0
04 Nov 2023
Socially Cognizant Robotics for a Technology Enhanced Society
Socially Cognizant Robotics for a Technology Enhanced Society
Kristin J. Dana
Clinton Andrews
Kostas Bekris
Jacob Feldman
Matthew Stone
Pernille Hemmer
Aaron Mazzeo
Hal Salzman
Jingang Yi
18
0
0
27 Oct 2023
Sample Complexity of Preference-Based Nonparametric Off-Policy
  Evaluation with Deep Networks
Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks
Zihao Li
Xiang Ji
Minshuo Chen
Mengdi Wang
OffRL
29
0
0
16 Oct 2023
Learning Optimal Advantage from Preferences and Mistaking it for Reward
Learning Optimal Advantage from Preferences and Mistaking it for Reward
W. B. Knox
Stephane Hatgis-Kessell
Sigurdur O. Adalgeirsson
Serena Booth
Anca D. Dragan
Peter Stone
S. Niekum
30
12
0
03 Oct 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Dingwen Kong
Lin F. Yang
34
9
0
18 Apr 2023
Controlled Diversity with Preference : Towards Learning a Diverse Set of
  Desired Skills
Controlled Diversity with Preference : Towards Learning a Diverse Set of Desired Skills
Maxence Hussonnois
Thommen George Karimpanal
Santu Rana
36
4
0
07 Mar 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
42
2
0
26 Jan 2023
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Discovering Generalizable Spatial Goal Representations via Graph-based
  Active Reward Learning
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning
Aviv Netanyahu
Tianmin Shu
J. Tenenbaum
Pulkit Agrawal
24
5
0
24 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning
  During Deployment
Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment
Huihan Liu
Soroush Nasiriany
Lance Zhang
Zhiyao Bao
Yuke Zhu
38
56
0
15 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based
  Reinforcement Learning
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
38
4
0
12 Nov 2022
Dichotomy of Control: Separating What You Can Control from What You
  Cannot
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
25
42
0
24 Oct 2022
When to Ask for Help: Proactive Interventions in Autonomous
  Reinforcement Learning
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
Annie Xie
Fahim Tajwar
Archit Sharma
Chelsea Finn
45
14
0
19 Oct 2022
ASPiRe:Adaptive Skill Priors for Reinforcement Learning
ASPiRe:Adaptive Skill Priors for Reinforcement Learning
Mengda Xu
Manuela Veloso
Shuran Song
CLL
OffRL
24
10
0
30 Sep 2022
Transformers are Adaptable Task Planners
Transformers are Adaptable Task Planners
Vidhi Jain
Yixin Lin
Eric Undersander
Yonatan Bisk
Akshara Rai
25
24
0
06 Jul 2022
Models of human preference for learning reward functions
Models of human preference for learning reward functions
W. B. Knox
Stephane Hatgis-Kessell
Serena Booth
S. Niekum
Peter Stone
A. Allievi
27
43
0
05 Jun 2022
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
298
1,610
0
18 Sep 2019
1