Inverse Reward Design

v1v2 (latest)

Inverse Reward Design

8 November 2017

Dylan Hadfield-Menell

Pieter Abbeel

Stuart J. Russell

ArXiv (abs)PDF HTML

Papers citing "Inverse Reward Design"

15 / 265 papers shown

Title
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition Justin Fu Avi Singh Dibya Ghosh Larry Yang Sergey Levine BDL 299 130 0 29 May 2018
Playing hard exploration games by watching YouTube Y. Aytar Tobias Pfaff David Budden T. Paine Ziyun Wang Nando de Freitas 216 279 0 29 May 2018
Learning Safe Policies with Expert Guidance Je-chun Huang Fa Wu Doina Precup Yang Cai 160 27 0 21 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning Joshua Romoff Peter Henderson Alexandre Piché Vincent François-Lavet Joelle Pineau 290 42 0 09 May 2018
AGI Safety Literature Review Tom Everitt G. Lea Marcus Hutter AI4CE 166 126 0 03 May 2018
Incomplete Contracting and AI Alignment Dylan Hadfield-Menell Gillian Hadfield 182 99 0 12 Apr 2018
Modeling Others using Oneself in Multi-Agent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2018 Roberta Raileanu Emily L. Denton Arthur Szlam Rob Fergus 241 221 0 26 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function Benjamin Eysenbach Abhishek Gupta Julian Ibarz Sergey Levine 823 1,198 0 16 Feb 2018
Counterfactual equivalence for POMDPs, and underlying deterministic environments Stuart Armstrong 80 2 0 11 Jan 2018
Índifference' methods for managing agent rewards Stuart Armstrong Xavier O'Rourke 214 20 0 18 Dec 2017
Occam's razor is insufficient to infer the preferences of irrational agents Stuart Armstrong Sören Mindermann 527 93 0 15 Dec 2017
AI Safety Gridworlds Jan Leike Miljan Martic Victoria Krakovna Pedro A. Ortega Tom Everitt Andrew Lefrancq Laurent Orseau Shane Legg 275 276 0 27 Nov 2017
"Dave...I can assure you...that it's going to be all right..." -- A definition, case for, and survey of algorithmic assurances in human-autonomy trust relationships Brett W. Israelsen Nisar R. Ahmed 296 94 0 08 Nov 2017
Robot Planning with Mathematical Models of Human State and Action Anca Dragan 123 36 0 11 May 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 835 1,734 0 25 Jan 2017