Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.02827
Cited By
v1
v2 (latest)
Inverse Reward Design
8 November 2017
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Inverse Reward Design"
15 / 265 papers shown
Title
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
299
130
0
29 May 2018
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
216
279
0
29 May 2018
Learning Safe Policies with Expert Guidance
Je-chun Huang
Fa Wu
Doina Precup
Yang Cai
160
27
0
21 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
290
42
0
09 May 2018
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
166
126
0
03 May 2018
Incomplete Contracting and AI Alignment
Dylan Hadfield-Menell
Gillian Hadfield
182
99
0
12 Apr 2018
Modeling Others using Oneself in Multi-Agent Reinforcement Learning
International Conference on Machine Learning (ICML), 2018
Roberta Raileanu
Emily L. Denton
Arthur Szlam
Rob Fergus
241
221
0
26 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
823
1,198
0
16 Feb 2018
Counterfactual equivalence for POMDPs, and underlying deterministic environments
Stuart Armstrong
80
2
0
11 Jan 2018
Índifference' methods for managing agent rewards
Stuart Armstrong
Xavier O'Rourke
214
20
0
18 Dec 2017
Occam's razor is insufficient to infer the preferences of irrational agents
Stuart Armstrong
Sören Mindermann
527
93
0
15 Dec 2017
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
275
276
0
27 Nov 2017
"Dave...I can assure you...that it's going to be all right..." -- A definition, case for, and survey of algorithmic assurances in human-autonomy trust relationships
Brett W. Israelsen
Nisar R. Ahmed
296
94
0
08 Nov 2017
Robot Planning with Mathematical Models of Human State and Action
Anca Dragan
123
36
0
11 May 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
835
1,734
0
25 Jan 2017
Previous
1
2
3
4
5
6