v1v2 (latest)

Inverse Reward Design

8 November 2017

Dylan Hadfield-Menell

Pieter Abbeel

Papers citing "Inverse Reward Design"

50 / 265 papers shown

Learning Perceptual Concepts by Bootstrapping from Human QueriesIEEE Robotics and Automation Letters (RA-L), 2021

Andreea Bobu

Chris Paxton

Wei Yang

Balakumar Sundaralingam

263

09 Nov 2021

B-Pref: Benchmarking Preference-Based Reinforcement Learning

Pieter Abbeel

323

125

04 Nov 2021

On the Expressivity of Markov RewardNeural Information Processing Systems (NeurIPS), 2021

Michael L. Littman

247

01 Nov 2021

Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

S. Gu

Manfred Diaz

Daniel Freeman

Hiroki Furuta

Seyed Kamyar Seyed Ghasemipour

Olivier Bachem

140

10 Oct 2021

Medical Dead-ends and Learning to Identify High-risk States and TreatmentsNeural Information Processing Systems (NeurIPS), 2021

229

08 Oct 2021

Reactive and Safe Road User Simulations using Neural Barrier Certificates

Yue Meng

Zengyi Qin

Chuchu Fan

249

14 Sep 2021

Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning

165

06 Sep 2021

Balancing Performance and Human Autonomy with Implicit Guidance AgentFrontiers in Artificial Intelligence (Front. Artif. Intell.), 2021

Ryo Nakahashi

Seiji Yamada

157

01 Sep 2021

Cognitive science as a source of forward and inverse models of human decisions for robotics and control

Mark K. Ho

Thomas Griffiths

252

01 Sep 2021

A Hybrid Rule-Based and Data-Driven Approach to Driver Modeling through Particle Filtering

Raunak P. Bhattacharyya

Soyeon Jung

Liam A. Kruse

Ransalu Senanayake

Mykel Kochenderfer

153

29 Aug 2021

Skill Preferences: Learning to Extract and Execute Robotic Skills from Human FeedbackConference on Robot Learning (CoRL), 2021

Pieter Abbeel

188

11 Aug 2021

Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

196

31 Jul 2021

What are you optimizing for? Aligning Recommender Systems with Human Values

Dylan Hadfield-Menell

OffRL

176

22 Jul 2021

Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021

364

08 Jul 2021

Supervised Bayesian Specification Inference from Demonstrations

Shen Li

228

06 Jul 2021

The MineRL BASALT Competition on Learning from Human Feedback

...

Pieter Abbeel

202

05 Jul 2021

Unsupervised Skill Discovery with Bottleneck Option LearningInternational Conference on Machine Learning (ICML), 2021

Jaekyeom Kim

Seohong Park

Gunhee Kim

209

27 Jun 2021

Deep Reinforcement Learning for Conservation Decisions

214

15 Jun 2021

Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond LinearityInternational Conference on Machine Learning (ICML), 2021

143

15 Jun 2021

Policy Gradient Bayesian Robust Optimization for Imitation LearningInternational Conference on Machine Learning (ICML), 2021

277

11 Jun 2021

Hard Choices in Artificial IntelligenceArtificial Intelligence (AI), 2021

Roel Dobbe

T. Gilbert

Yonatan Dov Mintz

151

10 Jun 2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning

210

02 Jun 2021

Goal Misgeneralization in Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021

503

111

28 May 2021

A Survey on Interactive Reinforcement Learning: Design Principles and Open Challenges

Christian Arzate Cruz

Takeo Igarashi

OffRL

214

102

27 May 2021

Informational Design of Dynamic Multi-Agent System

Tao Zhang

Quanyan Zhu

07 May 2021

Reward (Mis)design for Autonomous DrivingArtificial Intelligence (AI), 2021

349

144

28 Apr 2021

Understanding and Avoiding AI Failures: A Practical Guide

R. M. Williams

Roman V. Yampolskiy

199

22 Apr 2021

Alignment of Language Agents

Iason Gabriel

239

204

26 Mar 2021

Combining Reward Information from Multiple Sources

Dmitrii Krasheninnikov

Rohin Shah

H. V. Hoof

191

22 Mar 2021

Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsInternational Conference on Learning Representations (ICLR), 2021

Benjamin Eysenbach

Sergey Levine

OOD

276

220

10 Mar 2021

Efficient learning of goal-oriented push-grasping synergy in clutterIEEE Robotics and Automation Letters (RA-L), 2021

280

09 Mar 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward RandomizationInternational Conference on Learning Representations (ICLR), 2021

Chao Yu

230

08 Mar 2021

Self-Supervised Online Reward Shaping in Sparse-Reward EnvironmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Ufuk Topcu

306

08 Mar 2021

On the Equilibrium Elicitation of Markov Games Through Information Design

Tao Zhang

Quanyan Zhu

14 Feb 2021

Mitigating Negative Side Effects via Environment ShapingAdaptive Agents and Multi-Agent Systems (AAMAS), 2021

Sandhya Saisubramanian

S. Zilberstein

115

13 Feb 2021

Planning and Learning Using Adaptive Entropy Tree SearchIEEE International Joint Conference on Neural Network (IJCNN), 2021

Piotr Kozakowski

Mikolaj Pacek

Piotr Milo's

191

12 Feb 2021

Consequences of Misaligned AINeural Information Processing Systems (NeurIPS), 2021

Simon Zhuang

Dylan Hadfield-Menell

203

07 Feb 2021

Reinforcement Learning Assisted Beamforming for Inter-cell Interference Mitigation in 5G Massive MIMO Networks

Aidong Yang

Xinlang Yue

Ouyang Ye

142

27 Jan 2021

Choice Set Misspecification in Reward Inference

Rachel Freedman

Rohin Shah

Anca Dragan

171

19 Jan 2021

Multi-Principal Assistance Games: Definition and Collegial Mechanisms

Arnaud Fickinger

Simon Zhuang

Andrew Critch

Dylan Hadfield-Menell

Stuart J. Russell

146

29 Dec 2020

Avoiding Tampering Incentives in Deep RL via Decoupled Approval

224

17 Nov 2020

REALab: An Embedded Perspective on Tampering

160

17 Nov 2020

Learning Dense Rewards for Contact-Rich Manipulation TasksIEEE International Conference on Robotics and Automation (ICRA), 2020

279

17 Nov 2020

Avoiding Side Effects By Considering Future Tasks

215

15 Oct 2020

Reward Machines: Exploiting Reward Function Structure in Reinforcement LearningJournal of Artificial Intelligence Research (JAIR), 2020

449

276

06 Oct 2020

Hidden Incentives for Auto-Induced Distributional Shift

David M. Krueger

Tegan Maharaj

Jan Leike

209

19 Sep 2020

Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems

Sandhya Saisubramanian

S. Zilberstein

Ece Kamar

283

24 Aug 2020

Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder ApproachIEEE Robotics and Automation Letters (RA-L), 2020

231

123

10 Aug 2020

Bayesian Robust Optimization for Imitation LearningNeural Information Processing Systems (NeurIPS), 2020

Daniel S. Brown

S. Niekum

Marek Petrik

444

24 Jul 2020

Multi-Principal Assistance Games

Arnaud Fickinger

Simon Zhuang

Dylan Hadfield-Menell

Stuart J. Russell

130

19 Jul 2020