Strictly Batch Imitation Learning by Energy-based Distribution Matching

25 June 2020

Papers citing "Strictly Batch Imitation Learning by Energy-based Distribution Matching"

20 / 20 papers shown

Title
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning Hao Sun M. Schaar 94 14 0 28 Jan 2025
Generalized Robot Learning Framework Jiahuan Yan Zhouyang Hong Yu Zhao Yu Tian Yunxin Liu Travis Davies Luhui Hu 40 0 0 18 Sep 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control Atharva Mete Haotian Xue Albert Wilcox Yongxin Chen Animesh Garg SSL 35 16 0 22 Jul 2024
Walking the Values in Bayesian Inverse Reinforcement Learning Ondrej Bajgar Alessandro Abate Konstantinos Gatsis Michael A. Osborne OffRL BDL 30 0 0 15 Jul 2024
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies Md Mirajul Islam Xi Yang J. Hostetter Adittya Soukarjya Saha Min Chi 29 1 0 04 Jun 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning Sheng Yue Jiani Liu Xingyuan Hua Ju Ren Sen Lin Junshan Zhang Yaoxue Zhang OffRL 34 3 0 24 May 2024
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond Hao Sun OffRL 34 21 0 09 Oct 2023
Curricular Subgoals for Inverse Reinforcement Learning Shunyu Liu Yunpeng Qing Shuqi Xu Hongyan Wu Jiangtao Zhang Jingyuan Cong Tianhao Chen Yunfu Liu Mingli Song 21 1 0 14 Jun 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions Xi Yang Ge Gao Min Chi OffRL 29 2 0 15 May 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning Siliang Zeng Chenliang Li Alfredo García Min-Fong Hong OffRL 34 13 0 15 Feb 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning Sheng Yue Guan-Bo Wang Wei Shao Zhaofeng Zhang Sen Lin Junkai Ren Junshan Zhang OffRL 28 20 0 09 Feb 2023
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations Haoran Xu Xianyuan Zhan Honglei Yin Huiling Qin OffRL 26 66 0 20 Jul 2022
Model-based Offline Imitation Learning with Non-expert Data Jeongwon Park Lin F. Yang OffRL 32 1 0 11 Jun 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation Maximilian Igl Daewoo Kim Alex Kuefler Paul Mougin Punit Shah K. Shiarlis Drago Anguelov Mark Palatucci Brandyn White Shimon Whiteson 35 64 0 06 May 2022
Continuous Control with Action Quantization from Demonstrations Robert Dadashi Léonard Hussenot Damien Vincent Sertan Girgin Anton Raichuk M. Geist Olivier Pietquin OffRL 33 23 0 19 Oct 2021
A Critique of Strictly Batch Imitation Learning Gokul Swamy Sanjiban Choudhury J. Andrew Bagnell Zhiwei Steven Wu OffRL 14 4 0 05 Oct 2021
IQ-Learn: Inverse soft-Q Learning for Imitation Divyansh Garg Shuvam Chakraborty Chris Cundy Jiaming Song Matthieu Geist Stefano Ermon 45 178 0 23 Jun 2021
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap Gokul Swamy Sanjiban Choudhury J. Andrew Bagnell Steven Wu 14 73 0 04 Mar 2021
Scalable Bayesian Inverse Reinforcement Learning Alex J. Chan M. Schaar OffRL BDL 16 66 0 12 Feb 2021
How to Train Your Energy-Based Model for Regression Fredrik K. Gustafsson Martin Danelljan Radu Timofte Thomas B. Schon 43 42 0 04 May 2020