Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

Neural Information Processing Systems (NeurIPS), 2020

17 November 2020

Sreejith Balakrishnan

Q. Nguyen

Bryan Kian Hsiang Low

Harold Soh

ArXiv (abs)PDF HTML

Papers citing "Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization"

17 / 17 papers shown

Embracing Evolution: A Call for Body-Control Co-Design in Embodied Humanoid Robot

191

03 Oct 2025

Provably Efficient Exploration in Inverse Constrained Reinforcement Learning

Bo Yue

Jian Li

Guiliang Liu

485

24 Sep 2024

A Generalized Acquisition Function for Preference-based Reward LearningIEEE International Conference on Robotics and Automation (ICRA), 2024

299

09 Mar 2024

Inverse Decision Modeling: Learning Interpretable Representations of BehaviorInternational Conference on Machine Learning (ICML), 2023

266

28 Oct 2023

Training-Free Neural Active Learning with Initialization-Robustness GuaranteesInternational Conference on Machine Learning (ICML), 2023

Apivich Hemachandra

Zhongxiang Dai

Jasraj Singh

See-Kiong Ng

K. H. Low

AAML

298

07 Jun 2023

Reward Learning with Intractable Normalizing FunctionsIEEE Robotics and Automation Letters (RA-L), 2023

Joshua Hoegerman

Dylan P. Losey

233

16 May 2023

Kernel Density Bayesian Inverse Reinforcement Learning

Barbara E. Engelhardt

OffRL BDL

558

13 Mar 2023

Active Learning and Bayesian Optimization: a Unified Perspective to Learn with a GoalArchives of Computational Methods in Engineering (ACME), 2023

Francesco Di Fiore

Michela Nardelli

L. Mainini

454

02 Mar 2023

Sample-Then-Optimize Batch Neural Thompson SamplingNeural Information Processing Systems (NeurIPS), 2022

Zhongxiang Dai

Yao Shu

Bryan Kian Hsiang Low

Patrick Jaillet

AAML

229

13 Oct 2022

Identifiability and generalizability from multiple experts in Inverse Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

370

22 Sep 2022

Bayesian Optimization under Stochastic Delayed FeedbackInternational Conference on Machine Learning (ICML), 2022

Arun Verma

Zhongxiang Dai

Bryan Kian Hsiang Low

256

19 Jun 2022

Modeling Human Behavior Part I -- Learning and Belief Approaches

Andrew Fuchs

A. Passarella

M. Conti

297

13 May 2022

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit OptimizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

223

15 Mar 2022

MIRROR: Differentiable Deep Social Projection for Assistive Human-Robot Communication

Kaiqi Chen

J. Fong

Harold Soh

218

06 Mar 2022

Differentially Private Federated Bayesian Optimization with Distributed Exploration

231

27 Oct 2021

Inverse Contextual Bandits: Learning How Behavior Evolves over Time

386

13 Jul 2021

Identifiability in inverse reinforcement learningNeural Information Processing Systems (NeurIPS), 2021

Haoyang Cao

Samuel N. Cohen

Lukasz Szpruch

412

07 Jun 2021