Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.02925
Cited By
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
9 September 2018
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning"
50 / 71 papers shown
Title
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation
Minwoo Kim
Geunsik Bae
Jinwoo Lee
Woojae Shin
Changseung Kim
Myong-Yol Choi
Heejung Shin
H. Oh
81
0
0
04 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
37
0
0
22 Jan 2025
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
67
0
0
04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
53
0
0
03 Jan 2025
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
42
3
0
17 Oct 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation
Shojiro Yamabe
Kazuto Fukuchi
Jun Sakuma
AAML
63
0
0
06 Jun 2024
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
29
1
0
04 Jun 2024
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
37
8
0
25 May 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
32
0
0
22 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
26
0
0
18 Jan 2024
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
29
7
0
09 Oct 2023
See to Touch: Learning Tactile Dexterity through Visual Incentives
Irmak Güzey
Yinlong Dai
Ben Evans
Soumith Chintala
Lerrel Pinto
26
31
0
21 Sep 2023
Multi-Level Compositional Reasoning for Interactive Instruction Following
Suvaansh Bhambri
Byeonghwi Kim
Jonghyun Choi
LM&Ro
41
11
0
18 Aug 2023
Curricular Subgoals for Inverse Reinforcement Learning
Shunyu Liu
Yunpeng Qing
Shuqi Xu
Hongyan Wu
Jiangtao Zhang
Jingyuan Cong
Tianhao Chen
Yunfu Liu
Mingli Song
23
1
0
14 Jun 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
32
11
0
25 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
29
2
0
15 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation
Edgardo Solano-Carrillo
Jannis Stoppe
15
0
0
19 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
31
3
0
07 Apr 2023
Optimal Transport for Offline Imitation Learning
Yicheng Luo
Zhengyao Jiang
Samuel N. Cohen
Edward Grefenstette
M. Deisenroth
OffRL
43
26
0
24 Mar 2023
Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations
Siddhant Haldar
Jyothish Pari
A. Rai
Lerrel Pinto
24
66
0
02 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
54
17
0
02 Mar 2023
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
55
31
0
26 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
22
13
0
02 Feb 2023
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chong Li
OffRL
32
0
0
07 Dec 2022
imitation: Clean Imitation Learning Implementations
Adam Gleave
Mohammad Taufeeque
Juan Rocamonde
Erik Jenner
Steven H. Wang
Sam Toyer
M. Ernestus
Nora Belrose
Scott Emmons
Stuart J. Russell
MLAU
16
30
0
22 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
37
26
0
16 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
17
4
0
09 Nov 2022
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
Eddy Hudson
Ishan Durugkar
Garrett A. Warnell
Peter Stone
OffRL
15
1
0
08 Nov 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
31
21
0
18 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
34
42
0
04 Oct 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification
Min-Yu Wu
Fangwei Zhong
Yulong Xia
Hao Dong
OOD
35
17
0
02 Sep 2022
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience
Marwa Abdulhai
Natasha Jaques
Sergey Levine
OffRL
24
5
0
09 Aug 2022
Exploring the trade off between human driving imitation and safety for traffic simulation
Yann Koeberle
S. Sabatini
D. Tsishkou
C. Sabourin
30
4
0
09 Aug 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
48
62
0
30 Jun 2022
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
19
1
0
22 Jun 2022
Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
E. Uchibe
23
3
0
21 Jun 2022
Imitation Learning via Differentiable Physics
Siwei Chen
Xiao Ma
Zhongwen Xu
PINN
AI4CE
24
4
0
10 Jun 2022
Receding Horizon Inverse Reinforcement Learning
Yiqing Xu
Wei Gao
David Hsu
24
14
0
09 Jun 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
21
13
0
22 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands
Kyle Hsu
Moo Jin Kim
Rafael Rafailov
Jiajun Wu
Chelsea Finn
34
44
0
15 Mar 2022
Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations
Hao Shen
Weikang Wan
He Wang
SSL
33
24
0
04 Mar 2022
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Yecheng Jason Ma
Andrew Shen
Dinesh Jayaraman
Osbert Bastani
OffRL
23
32
0
04 Feb 2022
Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles
Won Joon Yun
Myungjae Shin
Soyi Jung
S. Kwon
Joongheon Kim
22
5
0
26 Dec 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
35
23
0
04 Nov 2021
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
29
9
0
04 Aug 2021
1
2
Next