ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14154
  4. Cited By
Strictly Batch Imitation Learning by Energy-based Distribution Matching

Strictly Batch Imitation Learning by Energy-based Distribution Matching

25 June 2020
Daniel Jarrett
Ioana Bica
M. Schaar
    OffRL
ArXivPDFHTML

Papers citing "Strictly Batch Imitation Learning by Energy-based Distribution Matching"

20 / 20 papers shown
Title
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
Generalized Robot Learning Framework
Generalized Robot Learning Framework
Jiahuan Yan
Zhouyang Hong
Yu Zhao
Yu Tian
Yunxin Liu
Travis Davies
Luhui Hu
40
0
0
18 Sep 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous
  Control
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
Atharva Mete
Haotian Xue
Albert Wilcox
Yongxin Chen
Animesh Garg
SSL
35
16
0
22 Jul 2024
Walking the Values in Bayesian Inverse Reinforcement Learning
Walking the Values in Bayesian Inverse Reinforcement Learning
Ondrej Bajgar
Alessandro Abate
Konstantinos Gatsis
Michael A. Osborne
OffRL
BDL
30
0
0
15 Jul 2024
A Generalized Apprenticeship Learning Framework for Modeling
  Heterogeneous Student Pedagogical Strategies
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
29
1
0
04 Jun 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
34
3
0
24 May 2024
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Curricular Subgoals for Inverse Reinforcement Learning
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
21
1
0
14 Jun 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving
  Reward Functions
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
29
2
0
15 May 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
34
13
0
15 Feb 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse
  Reinforcement Learning
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
28
20
0
09 Feb 2023
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
26
66
0
20 Jul 2022
Model-based Offline Imitation Learning with Non-expert Data
Model-based Offline Imitation Learning with Non-expert Data
Jeongwon Park
Lin F. Yang
OffRL
32
1
0
11 Jun 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving
  Simulation
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Maximilian Igl
Daewoo Kim
Alex Kuefler
Paul Mougin
Punit Shah
K. Shiarlis
Drago Anguelov
Mark Palatucci
Brandyn White
Shimon Whiteson
35
64
0
06 May 2022
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
A Critique of Strictly Batch Imitation Learning
A Critique of Strictly Batch Imitation Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
OffRL
14
4
0
05 Oct 2021
IQ-Learn: Inverse soft-Q Learning for Imitation
IQ-Learn: Inverse soft-Q Learning for Imitation
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
45
178
0
23 Jun 2021
Of Moments and Matching: A Game-Theoretic Framework for Closing the
  Imitation Gap
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Steven Wu
14
73
0
04 Mar 2021
Scalable Bayesian Inverse Reinforcement Learning
Scalable Bayesian Inverse Reinforcement Learning
Alex J. Chan
M. Schaar
OffRL
BDL
16
66
0
12 Feb 2021
How to Train Your Energy-Based Model for Regression
How to Train Your Energy-Based Model for Regression
Fredrik K. Gustafsson
Martin Danelljan
Radu Timofte
Thomas B. Schon
43
42
0
04 May 2020
1