Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.02827
Cited By
v1
v2 (latest)
Inverse Reward Design
8 November 2017
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Inverse Reward Design"
50 / 265 papers shown
Learning Perceptual Concepts by Bootstrapping from Human Queries
IEEE Robotics and Automation Letters (RA-L), 2021
Andreea Bobu
Chris Paxton
Wei Yang
Balakumar Sundaralingam
Yu-Wei Chao
Maya Cakmak
Dieter Fox
SSL
263
17
0
09 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
323
125
0
04 Nov 2021
On the Expressivity of Markov Reward
Neural Information Processing Systems (NeurIPS), 2021
David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
247
97
0
01 Nov 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
140
15
0
10 Oct 2021
Medical Dead-ends and Learning to Identify High-risk States and Treatments
Neural Information Processing Systems (NeurIPS), 2021
Mehdi Fatemi
Taylor W. Killian
J. Subramanian
Marzyeh Ghassemi
OffRL
229
47
0
08 Oct 2021
Reactive and Safe Road User Simulations using Neural Barrier Certificates
Yue Meng
Zengyi Qin
Chuchu Fan
249
20
0
14 Sep 2021
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning
Ning Wei
Jiahua Liang
Di Xie
Shiliang Pu
165
0
0
06 Sep 2021
Balancing Performance and Human Autonomy with Implicit Guidance Agent
Frontiers in Artificial Intelligence (Front. Artif. Intell.), 2021
Ryo Nakahashi
Seiji Yamada
157
6
0
01 Sep 2021
Cognitive science as a source of forward and inverse models of human decisions for robotics and control
Mark K. Ho
Thomas Griffiths
252
48
0
01 Sep 2021
A Hybrid Rule-Based and Data-Driven Approach to Driver Modeling through Particle Filtering
Raunak P. Bhattacharyya
Soyeon Jung
Liam A. Kruse
Ransalu Senanayake
Mykel Kochenderfer
153
35
0
29 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Conference on Robot Learning (CoRL), 2021
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
188
48
0
11 Aug 2021
Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration
Christian Ellis
Maggie B. Wigness
J. Rogers
Craig T. Lennon
L. Fiondella
196
7
0
31 Jul 2021
What are you optimizing for? Aligning Recommender Systems with Human Values
J. Stray
Ivan Vendrov
Jeremy Nixon
Steven Adler
Dylan Hadfield-Menell
OffRL
176
65
0
22 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
International Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
364
75
0
08 Jul 2021
Supervised Bayesian Specification Inference from Demonstrations
Ankit J. Shah
Pritish Kamath
Shen Li
Patrick L. Craven
Kevin J. Landers
Kevin B. Oden
J. Shah
228
4
0
06 Jul 2021
The MineRL BASALT Competition on Learning from Human Feedback
Rohin Shah
Cody Wild
Steven H. Wang
Neel Alex
Brandon Houghton
...
Stephanie Milani
Nicholay Topin
Pieter Abbeel
Stuart J. Russell
Anca Dragan
202
32
0
05 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
International Conference on Machine Learning (ICML), 2021
Jaekyeom Kim
Seohong Park
Gunhee Kim
209
38
0
27 Jun 2021
Deep Reinforcement Learning for Conservation Decisions
Marcus Lapeyrolerie
Melissa S. Chapman
Kari E. A. Norman
C. Boettiger
OffRL
214
25
0
15 Jun 2021
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
International Conference on Machine Learning (ICML), 2021
Dhruv Malik
Aldo Pacchiano
Vishwak Srinivasan
Yuanzhi Li
143
7
0
15 Jun 2021
Policy Gradient Bayesian Robust Optimization for Imitation Learning
International Conference on Machine Learning (ICML), 2021
Zaynah Javed
Daniel S. Brown
Satvik Sharma
Jerry Zhu
Ashwin Balakrishna
Marek Petrik
Anca Dragan
Ken Goldberg
277
18
0
11 Jun 2021
Hard Choices in Artificial Intelligence
Artificial Intelligence (AI), 2021
Roel Dobbe
T. Gilbert
Yonatan Dov Mintz
151
70
0
10 Jun 2021
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Jongwook Choi
Archit Sharma
Honglak Lee
Sergey Levine
S. Gu
DRL
210
23
0
02 Jun 2021
Goal Misgeneralization in Deep Reinforcement Learning
International Conference on Machine Learning (ICML), 2021
L. Langosco
Jack Koch
Lee D. Sharkey
J. Pfau
Laurent Orseau
David M. Krueger
503
111
0
28 May 2021
A Survey on Interactive Reinforcement Learning: Design Principles and Open Challenges
Christian Arzate Cruz
Takeo Igarashi
OffRL
214
102
0
27 May 2021
Informational Design of Dynamic Multi-Agent System
Tao Zhang
Quanyan Zhu
41
5
0
07 May 2021
Reward (Mis)design for Autonomous Driving
Artificial Intelligence (AI), 2021
W. B. Knox
A. Allievi
Holger Banzhaf
Felix Schmitt
Peter Stone
349
144
0
28 Apr 2021
Understanding and Avoiding AI Failures: A Practical Guide
R. M. Williams
Roman V. Yampolskiy
199
28
0
22 Apr 2021
Alignment of Language Agents
Zachary Kenton
Tom Everitt
Laura Weidinger
Iason Gabriel
Vladimir Mikulik
G. Irving
239
204
0
26 Mar 2021
Combining Reward Information from Multiple Sources
Dmitrii Krasheninnikov
Rohin Shah
H. V. Hoof
191
4
0
22 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
International Conference on Learning Representations (ICLR), 2021
Benjamin Eysenbach
Sergey Levine
OOD
276
220
0
10 Mar 2021
Efficient learning of goal-oriented push-grasping synergy in clutter
IEEE Robotics and Automation Letters (RA-L), 2021
Kechun Xu
Hongxiang Yu
Qianen Lai
Yue Wang
R. Xiong
280
87
0
09 Mar 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
International Conference on Learning Representations (ICLR), 2021
Zhen-Yu Tang
Chao Yu
Boyuan Chen
Huazhe Xu
Xiaolong Wang
Fei Fang
S. Du
Yu Wang
Yi Wu
230
61
0
08 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
306
64
0
08 Mar 2021
On the Equilibrium Elicitation of Markov Games Through Information Design
Tao Zhang
Quanyan Zhu
27
1
0
14 Feb 2021
Mitigating Negative Side Effects via Environment Shaping
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Sandhya Saisubramanian
S. Zilberstein
115
6
0
13 Feb 2021
Planning and Learning Using Adaptive Entropy Tree Search
IEEE International Joint Conference on Neural Network (IJCNN), 2021
Piotr Kozakowski
Mikolaj Pacek
Piotr Milo's
191
3
0
12 Feb 2021
Consequences of Misaligned AI
Neural Information Processing Systems (NeurIPS), 2021
Simon Zhuang
Dylan Hadfield-Menell
203
90
0
07 Feb 2021
Reinforcement Learning Assisted Beamforming for Inter-cell Interference Mitigation in 5G Massive MIMO Networks
Aidong Yang
Xinlang Yue
Ouyang Ye
142
3
0
27 Jan 2021
Choice Set Misspecification in Reward Inference
Rachel Freedman
Rohin Shah
Anca Dragan
171
19
0
19 Jan 2021
Multi-Principal Assistance Games: Definition and Collegial Mechanisms
Arnaud Fickinger
Simon Zhuang
Andrew Critch
Dylan Hadfield-Menell
Stuart J. Russell
146
5
0
29 Dec 2020
Avoiding Tampering Incentives in Deep RL via Decoupled Approval
J. Uesato
Ramana Kumar
Victoria Krakovna
Tom Everitt
Richard Ngo
Shane Legg
224
18
0
17 Nov 2020
REALab: An Embedded Perspective on Tampering
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
160
10
0
17 Nov 2020
Learning Dense Rewards for Contact-Rich Manipulation Tasks
IEEE International Conference on Robotics and Automation (ICRA), 2020
Zheng Wu
Wenzhao Lian
Vaibhav Unhelkar
Masayoshi Tomizuka
S. Schaal
279
46
0
17 Nov 2020
Avoiding Side Effects By Considering Future Tasks
Victoria Krakovna
Laurent Orseau
Richard Ngo
Miljan Martic
Shane Legg
215
43
0
15 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Journal of Artificial Intelligence Research (JAIR), 2020
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
449
276
0
06 Oct 2020
Hidden Incentives for Auto-Induced Distributional Shift
David M. Krueger
Tegan Maharaj
Jan Leike
209
56
0
19 Sep 2020
Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Sandhya Saisubramanian
S. Zilberstein
Ece Kamar
283
24
0
24 Aug 2020
Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach
IEEE Robotics and Automation Letters (RA-L), 2020
Boris Ivanovic
Karen Leung
Edward Schmerling
Marco Pavone
VGen
DRL
231
123
0
10 Aug 2020
Bayesian Robust Optimization for Imitation Learning
Neural Information Processing Systems (NeurIPS), 2020
Daniel S. Brown
S. Niekum
Marek Petrik
444
40
0
24 Jul 2020
Multi-Principal Assistance Games
Arnaud Fickinger
Simon Zhuang
Dylan Hadfield-Menell
Stuart J. Russell
130
13
0
19 Jul 2020
Previous
1
2
3
4
5
6
Next