Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.02925
Cited By
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
9 September 2018
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning"
50 / 81 papers shown
Title
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
90
0
0
27 Feb 2025
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
Minwoo Kim
Geunsik Bae
Jinwoo Lee
Woojae Shin
Changseung Kim
Myong-Yol Choi
Heejung Shin
H. Oh
86
0
0
04 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
39
0
0
22 Jan 2025
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
72
0
0
04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
53
0
0
03 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
55
0
0
31 Dec 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
50
3
0
17 Oct 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation
Shojiro Yamabe
Kazuto Fukuchi
Jun Sakuma
AAML
68
0
0
06 Jun 2024
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
29
1
0
04 Jun 2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
41
8
0
25 May 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
32
0
0
22 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
26
0
0
18 Jan 2024
Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning
Alihan Huyuk
Daniel Jarrett
M. Schaar
19
21
0
28 Oct 2023
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
22
27
0
28 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
34
7
0
09 Oct 2023
See to Touch: Learning Tactile Dexterity through Visual Incentives
Irmak Güzey
Yinlong Dai
Ben Evans
Soumith Chintala
Lerrel Pinto
36
31
0
21 Sep 2023
Multi-Level Compositional Reasoning for Interactive Instruction Following
Suvaansh Bhambri
Byeonghwi Kim
Jonghyun Choi
LM&Ro
41
11
0
18 Aug 2023
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
34
1
0
14 Jun 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
36
11
0
25 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
32
2
0
15 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation
Edgardo Solano-Carrillo
Jannis Stoppe
21
0
0
19 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
36
3
0
07 Apr 2023
Optimal Transport for Offline Imitation Learning
Yicheng Luo
Zhengyao Jiang
Samuel N. Cohen
Edward Grefenstette
M. Deisenroth
OffRL
43
26
0
24 Mar 2023
Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations
Siddhant Haldar
Jyothish Pari
A. Rai
Lerrel Pinto
35
66
0
02 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
59
17
0
02 Mar 2023
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
60
31
0
26 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
40
23
0
20 Feb 2023
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
22
13
0
02 Feb 2023
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Ziniu Li
Tian Xu
Y. Yu
Zhixun Luo
OffRL
38
2
0
27 Jan 2023
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chong Li
OffRL
32
0
0
07 Dec 2022
imitation: Clean Imitation Learning Implementations
Adam Gleave
Mohammad Taufeeque
Juan Rocamonde
Erik Jenner
Steven H. Wang
Sam Toyer
M. Ernestus
Nora Belrose
Scott Emmons
Stuart J. Russell
MLAU
21
30
0
22 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
39
27
0
16 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
31
4
0
09 Nov 2022
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
Eddy Hudson
Ishan Durugkar
Garrett A. Warnell
Peter Stone
OffRL
17
1
0
08 Nov 2022
Robust Imitation via Mirror Descent Inverse Reinforcement Learning
Dong-Sig Han
Hyunseok Kim
Hyun-Dong Lee
Je-hwan Ryu
Byoung-Tak Zhang
30
2
0
20 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
33
21
0
18 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
40
42
0
04 Oct 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification
Min-Yu Wu
Fangwei Zhong
Yulong Xia
Hao Dong
OOD
38
17
0
02 Sep 2022
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Marwa Abdulhai
Natasha Jaques
Sergey Levine
OffRL
24
5
0
09 Aug 2022
Exploring the trade off between human driving imitation and safety for traffic simulation
Yann Koeberle
S. Sabatini
D. Tsishkou
C. Sabourin
33
4
0
09 Aug 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
61
62
0
30 Jun 2022
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
29
1
0
22 Jun 2022
Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
E. Uchibe
28
3
0
21 Jun 2022
Imitation Learning via Differentiable Physics
Siwei Chen
Xiao Ma
Zhongwen Xu
PINN
AI4CE
24
4
0
10 Jun 2022
Receding Horizon Inverse Reinforcement Learning
Yiqing Xu
Wei Gao
David Hsu
24
14
0
09 Jun 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
29
13
0
22 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
37
45
0
15 Mar 2022
1
2
Next