ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.02827
  4. Cited By
Inverse Reward Design
v1v2 (latest)

Inverse Reward Design

8 November 2017
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
ArXiv (abs)PDFHTML

Papers citing "Inverse Reward Design"

15 / 265 papers shown
Title
Variational Inverse Control with Events: A General Framework for
  Data-Driven Reward Definition
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
299
130
0
29 May 2018
Playing hard exploration games by watching YouTube
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
216
279
0
29 May 2018
Learning Safe Policies with Expert Guidance
Learning Safe Policies with Expert Guidance
Je-chun Huang
Fa Wu
Doina Precup
Yang Cai
160
27
0
21 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
290
42
0
09 May 2018
AGI Safety Literature Review
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
166
126
0
03 May 2018
Incomplete Contracting and AI Alignment
Incomplete Contracting and AI Alignment
Dylan Hadfield-Menell
Gillian Hadfield
182
99
0
12 Apr 2018
Modeling Others using Oneself in Multi-Agent Reinforcement Learning
Modeling Others using Oneself in Multi-Agent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2018
Roberta Raileanu
Emily L. Denton
Arthur Szlam
Rob Fergus
241
221
0
26 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
823
1,198
0
16 Feb 2018
Counterfactual equivalence for POMDPs, and underlying deterministic
  environments
Counterfactual equivalence for POMDPs, and underlying deterministic environments
Stuart Armstrong
80
2
0
11 Jan 2018
Índifference' methods for managing agent rewards
Índifference' methods for managing agent rewards
Stuart Armstrong
Xavier O'Rourke
214
20
0
18 Dec 2017
Occam's razor is insufficient to infer the preferences of irrational
  agents
Occam's razor is insufficient to infer the preferences of irrational agents
Stuart Armstrong
Sören Mindermann
527
93
0
15 Dec 2017
AI Safety Gridworlds
AI Safety Gridworlds
Jan Leike
Miljan Martic
Victoria Krakovna
Pedro A. Ortega
Tom Everitt
Andrew Lefrancq
Laurent Orseau
Shane Legg
275
276
0
27 Nov 2017
"Dave...I can assure you...that it's going to be all right..." -- A
  definition, case for, and survey of algorithmic assurances in human-autonomy
  trust relationships
"Dave...I can assure you...that it's going to be all right..." -- A definition, case for, and survey of algorithmic assurances in human-autonomy trust relationships
Brett W. Israelsen
Nisar R. Ahmed
296
94
0
08 Nov 2017
Robot Planning with Mathematical Models of Human State and Action
Robot Planning with Mathematical Models of Human State and Action
Anca Dragan
123
36
0
11 May 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRLVLM
835
1,734
0
25 Jan 2017
Previous
123456