ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.10759
  4. Cited By
The Boltzmann Policy Distribution: Accounting for Systematic
  Suboptimality in Human Models

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

22 April 2022
Cassidy Laidlaw
Anca Dragan
    OffRL
ArXiv (abs)PDFHTML

Papers citing "The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models"

24 / 24 papers shown
Title
AssistanceZero: Scalably Solving Assistance Games
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw
Eli Bronstein
Timothy Guo
Dylan Feng
Lukas Berglund
Justin Svegliato
Stuart J. Russell
Anca Dragan
84
1
0
09 Apr 2025
Probabilistic Quantum SVM Training on Ising Machine
Probabilistic Quantum SVM Training on Ising Machine
Haoqi He
Yan Xiao
87
0
0
20 Mar 2025
MILE: Model-based Intervention Learning
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
146
2
0
21 Feb 2025
Learning to Assist Humans without Inferring Rewards
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
138
5
0
17 Jan 2025
Effects of Robot Competency and Motion Legibility on Human Correction Feedback
Effects of Robot Competency and Motion Legibility on Human Correction Feedback
Shuangge Wang
Anjiabei Wang
Sofiya Goncharova
Brian Scassellati
Tesca Fitzgerald
109
1
0
08 Jan 2025
Observation Interference in Partially Observable Assistance Games
Observation Interference in Partially Observable Assistance Games
Scott Emmons
Caspar Oesterheld
Vincent Conitzer
Stuart Russell
69
1
0
23 Dec 2024
Personalizing Reinforcement Learning from Human Feedback with
  Variational Preference Learning
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
S. Poddar
Yanming Wan
Hamish Ivison
Abhishek Gupta
Natasha Jaques
104
50
0
19 Aug 2024
Improving Context-Aware Preference Modeling for Language Models
Improving Context-Aware Preference Modeling for Language Models
Silviu Pitis
Ziang Xiao
Nicolas Le Roux
Alessandro Sordoni
95
12
0
20 Jul 2024
Boltzmann State-Dependent Rationality
Boltzmann State-Dependent Rationality
Osher Lerner
53
0
0
26 Apr 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
116
4
0
02 Feb 2024
Distributional Preference Learning: Understanding and Accounting for
  Hidden Context in RLHF
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
Anand Siththaranjan
Cassidy Laidlaw
Dylan Hadfield-Menell
118
72
0
13 Dec 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
Olivia Macmillan-Scott
Mirco Musolesi
96
1
0
28 Nov 2023
Efficient Human-AI Coordination via Preparatory Language-based
  Convention
Efficient Human-AI Coordination via Preparatory Language-based Convention
Cong Guan
Lichao Zhang
Chunpeng Fan
Yi-Chen Li
Feng Chen
Lihe Li
Yunjia Tian
Lei Yuan
Yang Yu
LM&Ro
93
8
0
01 Nov 2023
Concept Alignment as a Prerequisite for Value Alignment
Concept Alignment as a Prerequisite for Value Alignment
Sunayana Rane
Mark K. Ho
Ilia Sucholutsky
Thomas Griffiths
CVBM
60
6
0
30 Oct 2023
Quantifying Assistive Robustness Via the Natural-Adversarial Frontier
Quantifying Assistive Robustness Via the Natural-Adversarial Frontier
Jerry Zhi-Yang He
Zackory M. Erickson
Daniel S. Brown
Anca Dragan
AAML
83
0
0
16 Oct 2023
Learning to Make Adherence-Aware Advice
Learning to Make Adherence-Aware Advice
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
50
12
0
01 Oct 2023
A behavioural transformer for effective collaboration between a robot
  and a non-stationary human
A behavioural transformer for effective collaboration between a robot and a non-stationary human
Ruaridh Mon-Williams
Theodoros Stouraitis
S. Vijayakumar
82
2
0
25 Jul 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
Learning to Influence Human Behavior with Offline Reinforcement Learning
Learning to Influence Human Behavior with Offline Reinforcement Learning
Joey Hong
Sergey Levine
Anca Dragan
OffRLAI4CE
85
0
0
03 Mar 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
109
42
0
03 Feb 2023
Modeling Mobile Health Users as Reinforcement Learning Agents
Modeling Mobile Health Users as Reinforcement Learning Agents
Eura Shin
S. Swaroop
Weiwei Pan
Susan Murphy
Finale Doshi-Velez
OffRL
44
3
0
01 Dec 2022
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse
  Reinforcement Learning
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Tuan-Duong Trinh
Haoyu Chen
Daniel S. Brown
OffRL
72
8
0
28 Nov 2022
Adapting Neural Models with Sequential Monte Carlo Dropout
Adapting Neural Models with Sequential Monte Carlo Dropout
Pamela Carreno-Medrano
Dana Kulic
Michael G. Burke
122
4
0
27 Oct 2022
Causal Confusion and Reward Misidentification in Preference-Based Reward
  Learning
Causal Confusion and Reward Misidentification in Preference-Based Reward Learning
J. Tien
Jerry Zhi-Yang He
Zackory M. Erickson
Anca Dragan
Daniel S. Brown
CML
105
43
0
13 Apr 2022
1