Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.10759
Cited By
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
22 April 2022
Cassidy Laidlaw
Anca Dragan
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models"
24 / 24 papers shown
Title
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw
Eli Bronstein
Timothy Guo
Dylan Feng
Lukas Berglund
Justin Svegliato
Stuart J. Russell
Anca Dragan
84
1
0
09 Apr 2025
Probabilistic Quantum SVM Training on Ising Machine
Haoqi He
Yan Xiao
87
0
0
20 Mar 2025
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
146
2
0
21 Feb 2025
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
138
5
0
17 Jan 2025
Effects of Robot Competency and Motion Legibility on Human Correction Feedback
Shuangge Wang
Anjiabei Wang
Sofiya Goncharova
Brian Scassellati
Tesca Fitzgerald
109
1
0
08 Jan 2025
Observation Interference in Partially Observable Assistance Games
Scott Emmons
Caspar Oesterheld
Vincent Conitzer
Stuart Russell
69
1
0
23 Dec 2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
S. Poddar
Yanming Wan
Hamish Ivison
Abhishek Gupta
Natasha Jaques
104
50
0
19 Aug 2024
Improving Context-Aware Preference Modeling for Language Models
Silviu Pitis
Ziang Xiao
Nicolas Le Roux
Alessandro Sordoni
95
12
0
20 Jul 2024
Boltzmann State-Dependent Rationality
Osher Lerner
53
0
0
26 Apr 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
116
4
0
02 Feb 2024
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
Anand Siththaranjan
Cassidy Laidlaw
Dylan Hadfield-Menell
118
72
0
13 Dec 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
Olivia Macmillan-Scott
Mirco Musolesi
96
1
0
28 Nov 2023
Efficient Human-AI Coordination via Preparatory Language-based Convention
Cong Guan
Lichao Zhang
Chunpeng Fan
Yi-Chen Li
Feng Chen
Lihe Li
Yunjia Tian
Lei Yuan
Yang Yu
LM&Ro
93
8
0
01 Nov 2023
Concept Alignment as a Prerequisite for Value Alignment
Sunayana Rane
Mark K. Ho
Ilia Sucholutsky
Thomas Griffiths
CVBM
60
6
0
30 Oct 2023
Quantifying Assistive Robustness Via the Natural-Adversarial Frontier
Jerry Zhi-Yang He
Zackory M. Erickson
Daniel S. Brown
Anca Dragan
AAML
83
0
0
16 Oct 2023
Learning to Make Adherence-Aware Advice
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
50
12
0
01 Oct 2023
A behavioural transformer for effective collaboration between a robot and a non-stationary human
Ruaridh Mon-Williams
Theodoros Stouraitis
S. Vijayakumar
82
2
0
25 Jul 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
Learning to Influence Human Behavior with Offline Reinforcement Learning
Joey Hong
Sergey Levine
Anca Dragan
OffRL
AI4CE
87
0
0
03 Mar 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
109
42
0
03 Feb 2023
Modeling Mobile Health Users as Reinforcement Learning Agents
Eura Shin
S. Swaroop
Weiwei Pan
Susan Murphy
Finale Doshi-Velez
OffRL
50
3
0
01 Dec 2022
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Tuan-Duong Trinh
Haoyu Chen
Daniel S. Brown
OffRL
72
8
0
28 Nov 2022
Adapting Neural Models with Sequential Monte Carlo Dropout
Pamela Carreno-Medrano
Dana Kulic
Michael G. Burke
122
4
0
27 Oct 2022
Causal Confusion and Reward Misidentification in Preference-Based Reward Learning
J. Tien
Jerry Zhi-Yang He
Zackory M. Erickson
Anca Dragan
Daniel S. Brown
CML
105
43
0
13 Apr 2022
1