ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.04198
  4. Cited By
Preferences Implicit in the State of the World

Preferences Implicit in the State of the World

12 February 2019
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
ArXivPDFHTML

Papers citing "Preferences Implicit in the State of the World"

12 / 12 papers shown
Title
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
Akshat Kumar
45
0
0
17 Apr 2025
UniMASK: Unified Inference in Sequential Decision Problems
UniMASK: Unified Inference in Sequential Decision Problems
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
26
21
0
20 Nov 2022
Humans are not Boltzmann Distributions: Challenges and Opportunities for
  Modelling Human Feedback and Interaction in Reinforcement Learning
Humans are not Boltzmann Distributions: Challenges and Opportunities for Modelling Human Feedback and Interaction in Reinforcement Learning
David Lindner
Mennatallah El-Assady
OffRL
30
16
0
27 Jun 2022
Towards Flexible Inference in Sequential Decision Problems via
  Bidirectional Transformers
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Micah Carroll
Jessy Lin
Orr Paradise
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
40
10
0
28 Apr 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
21
13
0
22 Mar 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
The MineRL BASALT Competition on Learning from Human Feedback
The MineRL BASALT Competition on Learning from Human Feedback
Rohin Shah
Cody Wild
Steven H. Wang
Neel Alex
Brandon Houghton
...
Stephanie Milani
Nicholay Topin
Pieter Abbeel
Stuart J. Russell
Anca Dragan
28
31
0
05 Jul 2021
Preference-based Learning of Reward Function Features
Preference-based Learning of Reward Function Features
Sydney M. Katz
Amir Maleki
Erdem Biyik
Mykel J. Kochenderfer
33
11
0
03 Mar 2021
Inverse Constrained Reinforcement Learning
Inverse Constrained Reinforcement Learning
Usman Anwar
Shehryar Malik
Alireza Aghasi
Ali Ahmed
18
58
0
19 Nov 2020
Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Sandhya Saisubramanian
S. Zilberstein
Ece Kamar
11
21
0
24 Aug 2020
Reward-rational (implicit) choice: A unifying formalism for reward
  learning
Reward-rational (implicit) choice: A unifying formalism for reward learning
Hong Jun Jeon
S. Milli
Anca Dragan
17
176
0
12 Feb 2020
SafeLife 1.0: Exploring Side Effects in Complex Environments
SafeLife 1.0: Exploring Side Effects in Complex Environments
Carroll L. Wainwright
P. Eckersley
27
12
0
03 Dec 2019
1