Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1711.02827
Cited By
v1
v2 (latest)
Inverse Reward Design
8 November 2017
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Inverse Reward Design"
50 / 265 papers shown
Title
Dataset Poisoning Attacks on Behavioral Cloning Policies
Akansha Kalra
Soumil Datta
Ethan Gilmore
Duc La
Guanhong Tao
Daniel S. Brown
AAML
OffRL
159
0
0
26 Nov 2025
Learning Where, What and How to Transfer: A Multi-Role Reinforcement Learning Approach for Evolutionary Multitasking
Jiajun Zhan
Zeyuan Ma
Yue-Jiao Gong
Kay Chen Tan
OffRL
154
0
0
19 Nov 2025
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Shengjie Sun
Jiafei Lyu
Runze Liu
Mengbei Yan
Bo Liu
Deheng Ye
Xiu Li
OffRL
190
0
0
14 Nov 2025
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
Addison J. Wu
Ryan Liu
Xuechunzi Bai
Thomas Griffiths
104
0
0
08 Nov 2025
Restoring Noisy Demonstration for Imitation Learning With Diffusion Models
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Shang-Fu Chen
Co Yong
Shao-Hua Sun
DiffM
88
0
0
16 Oct 2025
Training LLM Agents to Empower Humans
Evan Ellis
Vivek Myers
Jens Tuyls
Sergey Levine
Anca Dragan
Benjamin Eysenbach
134
0
0
15 Oct 2025
Repairing Reward Functions with Human Feedback to Mitigate Reward Hacking
Stephane Hatgis-Kessell
Logan Mondal Bhamidipaty
Emma Brunskill
81
0
0
14 Oct 2025
Control Synthesis of Cyber-Physical Systems for Real-Time Specifications through Causation-Guided Reinforcement Learning
Xiaochen Tang
Zhenya Zhang
Miaomiao Zhang
Jie An
60
0
0
09 Oct 2025
Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL
Nyal Patel
Matthieu Bou
Arjun Jagota
Satyapriya Krishna
Sonali Parbhoo
53
0
0
07 Oct 2025
Failure Modes of Maximum Entropy RLHF
Ömer Veysel Çağatan
Barış Akgün
61
0
0
24 Sep 2025
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Chirayu Nimonkar
Shlok Shah
Catherine Ji
Benjamin Eysenbach
118
1
0
12 Sep 2025
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Xue Yang
Yuxin Zuo
Jiale Yu
Xicheng Zhang
Z. Yang
...
Shanghang Zhang
Y. Wang
Yao Mu
Bowen Zhou
Ning Ding
OffRL
LRM
103
17
0
11 Sep 2025
Symmetry-Guided Multi-Agent Inverse Reinforcement Learning
Yongkai Tian
Yirong Qi
Xin Yu
Wenjun Wu
Jie Luo
123
0
0
10 Sep 2025
Text2Touch: Tactile In-Hand Manipulation with LLM-Designed Reward Functions
Harrison Field
Max Yang
Yijiong Lin
Efi Psomopoulou
David A.W. Barton
Nathan Lepora
52
0
0
09 Sep 2025
An Economy of AI Agents
Gillian K. Hadfield
Andrew Koh
152
4
0
01 Sep 2025
GPLight+: A Genetic Programming Method for Learning Symmetric Traffic Signal Control Policy
IEEE Transactions on Evolutionary Computation (IEEE Trans. Evol. Comput.), 2025
Xiao-Cheng Liao
Yi Mei
Mengjie Zhang
61
2
0
22 Aug 2025
Learning from Preferences and Mixed Demonstrations in General Settings
Jason Brown
Carl Henrik Ek
Robert D. Mullins
76
0
0
19 Aug 2025
Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction
Ruike Song
Zeen Song
Huijie Guo
Wenwen Qiang
LRM
68
0
0
06 Aug 2025
Policy Learning from Large Vision-Language Model Feedback without Reward Modeling
Tung M. Luu
Donghoon Lee
Younghwan Lee
Chang D. Yoo
OffRL
123
0
0
31 Jul 2025
Inference-Time Reward Hacking in Large Language Models
Hadi Khalaf
C. M. Verdun
Alex Oesterling
Himabindu Lakkaraju
Flavio du Pin Calmon
169
1
0
24 Jun 2025
PB
2
^2
2
: Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning
Brahim Driss
Alex Davey
Riad Akrour
136
0
0
16 Jun 2025
Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
Andreas Schlaginhaufen
Reda Ouhamma
Maryam Kamgarpour
180
1
0
11 Jun 2025
Provable Reinforcement Learning from Human Feedback with an Unknown Link Function
Qining Zhang
Lei Ying
196
0
0
03 Jun 2025
Apprenticeship learning with prior beliefs using inverse optimization
Mauricio Junca
Esteban Leiva
157
0
0
27 May 2025
Learning Pareto-Optimal Rewards from Noisy Preferences: A Framework for Multi-Objective Inverse Reinforcement Learning
Kalyan Cherukuri
Aarav Lala
169
0
0
17 May 2025
Super Co-alignment of Human and AI for Sustainable Symbiotic Society
Yi Zeng
Yijiao Wang
Enmeng Lu
Dongcheng Zhao
Bing Han
...
Chao Liu
Yaodong Yang
Yi Zeng
Boyuan Chen
Jinyu Fan
492
1
0
24 Apr 2025
FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
IEEE International Conference on Robotics and Automation (ICRA), 2025
Daniel Marta
Simon Holk
Miguel Vasco
Jens Lundell
Timon Homberger
F. L. Busch
Olov Andersson
Jens Lundell
Iolanda Leite
317
2
0
14 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DV
VLM
OffRL
262
1
0
03 Apr 2025
Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
Linji Wang
Tong Xu
Yuanjie Lu
Xuesu Xiao
261
1
0
19 Mar 2025
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
Haksub Kim
Kanghoon Lee
Minjun Kim
Jiachen Li
Jinkyoo Park
323
3
0
05 Mar 2025
Societal Alignment Frameworks Can Improve LLM Alignment
Karolina Stañczak
Nicholas Meade
Mehar Bhatia
Hattie Zhou
Konstantin Böttinger
...
Timothy P. Lillicrap
Ana Marasović
Sylvie Delacroix
Gillian K. Hadfield
Siva Reddy
944
3
0
27 Feb 2025
Your Learned Constraint is Secretly a Backward Reachable Tube
Mohamad Qadri
Gokul Swamy
Jonathan Francis
Michael Kaess
Andrea Bajcsy
365
5
0
26 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
280
6
0
22 Jan 2025
Learning to Assist Humans without Inferring Rewards
Neural Information Processing Systems (NeurIPS), 2024
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
494
10
0
17 Jan 2025
Robustness in the Face of Partial Identifiability in Reward Learning
Filippo Lazzati
Alberto Maria Metelli
165
1
0
10 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2025
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
845
1
0
03 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
IEEE Access (IEEE Access), 2024
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
242
46
0
31 Dec 2024
Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker
Jiangdong Fan
Hongcai He
Paul Weng
Hui Xu
Jie Shao
196
2
0
31 Dec 2024
LEASE: Offline Preference-based Reinforcement Learning with High Sample Efficiency
Xiao-Yin Liu
Guotao Li
Xiao-Hu Zhou
Z. Hou
OffRL
289
1
0
30 Dec 2024
Active Inference and Human--Computer Interaction
R. Murray-Smith
J. Williamson
Sebastian Stein
AI4CE
118
3
0
19 Dec 2024
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement
Tewodros Ayalew
Xiao Zhang
Kevin Yuanbo Wu
Tianchong Jiang
Michael Maire
Matthew R. Walter
OffRL
361
2
0
26 Nov 2024
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments
BigData Congress [Services Society] (BSS), 2024
Sven Goluža
Tomislav Kovačević
Stjepan Begušić
Z. Kostanjčar
166
0
0
13 Nov 2024
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
Neural Information Processing Systems (NeurIPS), 2024
Weichao Zhou
Wenchao Li
199
2
0
31 Oct 2024
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning
Knowledge-Based Systems (KBS), 2024
Shengjie Sun
Runze Liu
Jiafei Lyu
J. Yang
L. Zhang
Xiu Li
LRM
195
17
0
18 Oct 2024
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
278
3
0
08 Oct 2024
Adaptive Language-Guided Abstraction from Contrastive Explanations
Conference on Robot Learning (CoRL), 2024
Andi Peng
Belinda Z. Li
Ilia Sucholutsky
Nishanth Kumar
Julie A. Shah
Jacob Andreas
Andreea Bobu
OffRL
164
5
0
12 Sep 2024
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
IEEE International Conference on Robotics and Automation (ICRA), 2024
Z. Liu
Junjie Xu
Xingjiao Wu
J. Yang
Liang He
275
1
0
11 Sep 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
International Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
351
8
0
11 Aug 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xinyuan Li
Tianyuan Chen
Xiao Zhang
Tianyuan Chen
Xuyang Chen
197
0
0
09 Jul 2024
Quantifying Misalignment Between Agents: Towards a Sociotechnical Understanding of Alignment
Aidan Kierans
Avijit Ghosh
Hananel Hazan
Shiri Dori-Hacohen
193
7
0
06 Jun 2024
1
2
3
4
5
6
Next