ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.06483
  4. Cited By
Scalable Bayesian Inverse Reinforcement Learning

Scalable Bayesian Inverse Reinforcement Learning

12 February 2021
Alex J. Chan
M. Schaar
    OffRL
    BDL
ArXivPDFHTML

Papers citing "Scalable Bayesian Inverse Reinforcement Learning"

44 / 44 papers shown
Title
Inverse Delayed Reinforcement Learning
Inverse Delayed Reinforcement Learning
S. Zhan
Qingyuan Wu
Zhian Ruan
Frank Yang
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
65
0
0
04 Dec 2024
Approximated Variational Bayesian Inverse Reinforcement Learning for
  Large Language Model Alignment
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment
Yuang Cai
Yuyu Yuan
Jinsheng Shi
Qinhong Lin
43
0
0
14 Nov 2024
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean
  Metric Space
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Jingdi Chen
Hanhan Zhou
Yongsheng Mei
Carlee Joe-Wong
Gina Adam
Nathaniel D. Bastian
Tian-Shing Lan
OffRL
30
0
0
21 Oct 2024
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
Shicheng Liu
Minghui Zhu
51
1
0
21 Oct 2024
Model-Based Reward Shaping for Adversarial Inverse Reinforcement
  Learning in Stochastic Environments
Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
S. Zhan
Qingyuan Wu
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
36
1
0
04 Oct 2024
Online Control-Informed Learning
Online Control-Informed Learning
Zihao Liang
Tianyu Zhou
Zehui Lu
Shaoshuai Mou
33
1
0
04 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
22
2
0
03 Oct 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
Dongheng Li
Yongchang Hao
Lili Mou
53
1
0
19 Sep 2024
Markov Balance Satisfaction Improves Performance in Strictly Batch
  Offline Imitation Learning
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
Ashutosh Nayyar
OffRL
36
0
0
17 Aug 2024
Walking the Values in Bayesian Inverse Reinforcement Learning
Walking the Values in Bayesian Inverse Reinforcement Learning
Ondrej Bajgar
Alessandro Abate
Konstantinos Gatsis
Michael A. Osborne
OffRL
BDL
30
0
0
15 Jul 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
34
3
0
24 May 2024
Collaborative AI Teaming in Unknown Environments via Active Goal
  Deduction
Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
Zuyuan Zhang
Hanhan Zhou
Mahdi Imani
Taeyoung Lee
Tian-Shing Lan
37
11
0
22 Mar 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
33
0
0
04 Feb 2024
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
David Wu
Sanjiban Choudhury
21
0
0
04 Feb 2024
Dense Reward for Free in Reinforcement Learning from Human Feedback
Dense Reward for Free in Reinforcement Learning from Human Feedback
Alex J. Chan
Hao Sun
Samuel Holt
M. Schaar
18
31
0
01 Feb 2024
Multi-intention Inverse Q-learning for Interpretable Behavior
  Representation
Multi-intention Inverse Q-learning for Interpretable Behavior Representation
Hao Zhu
Brice de la Crompe
Gabriel Kalweit
Artur Schneider
M. Kalweit
Ilka Diester
Joschka Boedecker
OffRL
AI4CE
24
5
0
23 Nov 2023
Optimising Human-AI Collaboration by Learning Convincing Explanations
Optimising Human-AI Collaboration by Learning Convincing Explanations
Alex J. Chan
Alihan Huyuk
M. Schaar
37
3
0
13 Nov 2023
A Novel Variational Lower Bound for Inverse Reinforcement Learning
A Novel Variational Lower Bound for Inverse Reinforcement Learning
Yikang Gui
Prashant Doshi
24
0
0
07 Nov 2023
A Bayesian Approach to Robust Inverse Reinforcement Learning
A Bayesian Approach to Robust Inverse Reinforcement Learning
Ran Wei
Siliang Zeng
Chenliang Li
Alfredo García
Anthony D. McDonald
Mingyi Hong
OffRL
28
4
0
15 Sep 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and
  Challenges
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
24
81
0
05 Sep 2023
Conditional Kernel Imitation Learning for Continuous State Environments
Conditional Kernel Imitation Learning for Continuous State Environments
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
A. Nayyar
30
0
0
24 Aug 2023
MiniLLM: Knowledge Distillation of Large Language Models
MiniLLM: Knowledge Distillation of Large Language Models
Yuxian Gu
Li Dong
Furu Wei
Minlie Huang
ALM
31
77
0
14 Jun 2023
Curricular Subgoals for Inverse Reinforcement Learning
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
21
1
0
14 Jun 2023
Inverse Reinforcement Learning with the Average Reward Criterion
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
35
9
0
24 May 2023
Massively Scalable Inverse Reinforcement Learning in Google Maps
Massively Scalable Inverse Reinforcement Learning in Google Maps
Matt Barnes
Matthew Abueg
Oliver F. Lange
Matt Deeds
Jason M. Trader
Denali Molitor
Markus Wulfmeier
S. O’Banion
22
6
0
18 May 2023
Replicating Complex Dialogue Policy of Humans via Offline Imitation
  Learning with Supervised Regularization
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Zhoujian Sun
Chenyang Zhao
Zheng-Wei Huang
Nai Ding
OffRL
38
1
0
06 May 2023
Kernel Density Bayesian Inverse Reinforcement Learning
Kernel Density Bayesian Inverse Reinforcement Learning
Aishwarya Mandyam
Didong Li
Diana Cai
Andrew Jones
Barbara E. Engelhardt
BDL
OffRL
45
3
0
13 Mar 2023
Programmatic Imitation Learning from Unlabeled and Noisy Demonstrations
Programmatic Imitation Learning from Unlabeled and Noisy Demonstrations
Jimmy Xin
Linus Zheng
Kia Rahmani
Jiayi Wei
Jarrett Holtz
Işıl Dillig
Joydeep Biswas
30
1
0
02 Mar 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
34
13
0
15 Feb 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse
  Reinforcement Learning
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
28
20
0
09 Feb 2023
Practical Approaches for Fair Learning with Multitype and Multivariate
  Sensitive Attributes
Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes
Tennison Liu
Alex J. Chan
B. V. Breugel
M. Schaar
FaML
25
2
0
11 Nov 2022
Environment Design for Inverse Reinforcement Learning
Environment Design for Inverse Reinforcement Learning
Thomas Kleine Buening
Victor Villin
Christos Dimitrakakis
32
1
0
26 Oct 2022
Teacher Forcing Recovers Reward Functions for Text Generation
Teacher Forcing Recovers Reward Functions for Text Generation
Yongchang Hao
Yuxin Liu
Lili Mou
OffRL
40
11
0
17 Oct 2022
Synthetic Model Combination: An Instance-wise Approach to Unsupervised
  Ensemble Learning
Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning
Alex J. Chan
M. Schaar
OOD
39
1
0
11 Oct 2022
Reward Learning using Structural Motifs in Inverse Reinforcement
  Learning
Reward Learning using Structural Motifs in Inverse Reinforcement Learning
Raeid Saqur
18
2
0
25 Sep 2022
Proximal Point Imitation Learning
Proximal Point Imitation Learning
Luca Viano
Angeliki Kamoutsi
Gergely Neu
Igor Krawczuk
V. Cevher
33
14
0
22 Sep 2022
Basis for Intentions: Efficient Inverse Reinforcement Learning using
  Past Experience
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Marwa Abdulhai
Natasha Jaques
Sergey Levine
OffRL
19
5
0
09 Aug 2022
Benchmarking Constraint Inference in Inverse Reinforcement Learning
Benchmarking Constraint Inference in Inverse Reinforcement Learning
Guiliang Liu
Yudong Luo
A. Gaurav
K. Rezaee
Pascal Poupart
41
22
0
20 Jun 2022
Model-based Offline Imitation Learning with Non-expert Data
Model-based Offline Imitation Learning with Non-expert Data
Jeongwon Park
Lin F. Yang
OffRL
32
1
0
11 Jun 2022
POETREE: Interpretable Policy Learning with Adaptive Decision Trees
POETREE: Interpretable Policy Learning with Adaptive Decision Trees
Alizée Pace
Alex J. Chan
M. Schaar
OffRL
19
17
0
15 Mar 2022
Inverse Online Learning: Understanding Non-Stationary and Reactionary
  Policies
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies
Alex J. Chan
Alicia Curth
M. Schaar
CML
OffRL
19
8
0
14 Mar 2022
Inverse Optimal Control Adapted to the Noise Characteristics of the
  Human Sensorimotor System
Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System
M. Schultheis
Dominik Straub
Constantin Rothkopf
11
20
0
21 Oct 2021
The Medkit-Learn(ing) Environment: Medical Decision Modelling through
  Simulation
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
19
14
0
08 Jun 2021
Mitigating Covariate Shift in Imitation Learning via Offline Data
  Without Great Coverage
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage
Jonathan D. Chang
Masatoshi Uehara
Dhruv Sreenivas
Rahul Kidambi
Wen Sun
OffRL
24
32
0
06 Jun 2021
1