Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning

9 September 2018

Ilya Kostrikov

Kumar Krishna Agrawal

Papers citing "Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning"

50 / 71 papers shown

Title
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation Minwoo Kim Geunsik Bae Jinwoo Lee Woojae Shin Changseung Kim Myong-Yol Choi Heejung Shin H. Oh 81 0 0 04 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning Hao Sun M. Schaar 94 14 0 28 Jan 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration Yirui Zhou Xiaowei Liu Xiaofeng Zhang Yangchun Zhang 37 0 0 22 Jan 2025
SR-Reward: Taking The Path More Traveled Seyed Mahdi Basiri Azad Zahra Padar Gabriel Kalweit Joschka Boedecker OffRL 67 0 0 04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning Utsav Singh Souradip Chakraborty Wesley A Suttle Brian M. Sadler Vinay P. Namboodiri Amrit Singh Bedi OffRL 53 0 0 03 Jan 2025
Diffusing States and Matching Scores: A New Framework for Imitation Learning Runzhe Wu Yiding Chen Gokul Swamy Kianté Brantley Wen Sun DiffM 42 3 0 17 Oct 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning Utsav Singh Pramit Bhattacharyya Vinay P. Namboodiri LM&Ro 47 1 0 09 Jun 2024
Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation Shojiro Yamabe Kazuto Fukuchi Jun Sakuma AAML 63 0 0 06 Jun 2024
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies Md Mirajul Islam Xi Yang J. Hostetter Adittya Soukarjya Saha Min Chi 29 1 0 04 Jun 2024
Diffusion-Reward Adversarial Imitation Learning Chun-Mao Lai Hsiang-Chun Wang Ping-Chun Hsieh Yu-Chiang Frank Wang Min-Hung Chen Shao-Hua Sun 37 8 0 25 May 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay Catherine Weaver Chen Tang Ce Hao Kenta Kawamoto Masayoshi Tomizuka Wei Zhan OffRL 32 0 0 22 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon Hee-Jun Ahn Seong-Woong Shim Byung-Jun Lee 26 0 0 18 Jan 2024
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning Fan Luo Tian Xu Xingchen Cao Yang Yu OffRL 29 7 0 09 Oct 2023
See to Touch: Learning Tactile Dexterity through Visual Incentives Irmak Güzey Yinlong Dai Ben Evans Soumith Chintala Lerrel Pinto 26 31 0 21 Sep 2023
Multi-Level Compositional Reasoning for Interactive Instruction Following Suvaansh Bhambri Byeonghwi Kim Jonghyun Choi LM&Ro 41 11 0 18 Aug 2023
Curricular Subgoals for Inverse Reinforcement Learning Shunyu Liu Yunpeng Qing Shuqi Xu Hongyan Wu Jiangtao Zhang Jingyuan Cong Tianhao Chen Yunfu Liu Mingli Song 23 1 0 14 Jun 2023
Coherent Soft Imitation Learning Joe Watson Sandy H. Huang Nicholas Heess 32 11 0 25 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions Xi Yang Ge Gao Min Chi OffRL 29 2 0 15 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning Geoffrey Cideron B. Tabanpour Sebastian Curi Sertan Girgin Léonard Hussenot Gabriel Dulac-Arnold M. Geist Olivier Pietquin Robert Dadashi OOD 84 2 0 02 May 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation Edgardo Solano-Carrillo Jannis Stoppe 15 0 0 19 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction Utsav Singh Vinay P. Namboodiri 31 3 0 07 Apr 2023
Optimal Transport for Offline Imitation Learning Yicheng Luo Zhengyao Jiang Samuel N. Cohen Edward Grefenstette M. Deisenroth OffRL 43 26 0 24 Mar 2023
Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations Siddhant Haldar Jyothish Pari A. Rai Lerrel Pinto 24 66 0 02 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning Archit Sharma Ahmed M. Ahmed Rehaan Ahmad Chelsea Finn SSL 54 17 0 02 Mar 2023
Diffusion Model-Augmented Behavioral Cloning Shangcheng Chen Hsiang-Chun Wang Ming-Hao Hsu Chun-Mao Lai Shao-Hua Sun DiffM 55 31 0 26 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot Tao Huang Kai-xiang Chen Bin Li Yunhui Liu Qingxu Dou 35 23 0 20 Feb 2023
Visual Imitation Learning with Patch Rewards Minghuan Liu Tairan He Weinan Zhang Shuicheng Yan Zhongwen Xu SSL 22 13 0 02 Feb 2023
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble Chong Li OffRL 32 0 0 07 Dec 2022
imitation: Clean Imitation Learning Implementations Adam Gleave Mohammad Taufeeque Juan Rocamonde Erik Jenner Steven H. Wang Sam Toyer M. Ernestus Nora Belrose Scott Emmons Stuart J. Russell MLAU 16 30 0 22 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans Minttu Alakuijala Gabriel Dulac-Arnold Julien Mairal Jean Ponce Cordelia Schmid OffRL 37 26 0 16 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration Alexandre Chenu Olivier Serris Olivier Sigaud Nicolas Perrin-Gilbert 17 4 0 09 Nov 2022
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning Eddy Hudson Ishan Durugkar Garrett A. Warnell Peter Stone OffRL 15 1 0 08 Nov 2022
Planning for Sample Efficient Imitation Learning Zhao-Heng Yin Weirui Ye Qifeng Chen Yang Gao OffRL 31 21 0 18 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees Siliang Zeng Chenliang Li Alfredo García Min-Fong Hong 34 42 0 04 Oct 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification Min-Yu Wu Fangwei Zhong Yulong Xia Hao Dong OOD 35 17 0 02 Sep 2022
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience Marwa Abdulhai Natasha Jaques Sergey Levine OffRL 24 5 0 09 Aug 2022
Exploring the trade off between human driving imitation and safety for traffic simulation Yann Koeberle S. Sabatini D. Tsishkou C. Sabourin 30 4 0 09 Aug 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport Siddhant Haldar Vaibhav Mathur Denis Yarats Lerrel Pinto 48 62 0 30 Jun 2022
Auto-Encoding Adversarial Imitation Learning Kaifeng Zhang Rui Zhao Ziming Zhang Yang Gao 19 1 0 22 Jun 2022
Model-Based Imitation Learning Using Entropy Regularization of Model and Policy E. Uchibe 23 3 0 21 Jun 2022
Imitation Learning via Differentiable Physics Siwei Chen Xiao Ma Zhongwen Xu PINN AI4CE 24 4 0 10 Jun 2022
Receding Horizon Inverse Reinforcement Learning Yiqing Xu Wei Gao David Hsu 24 14 0 09 Jun 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning Adam Gleave Sam Toyer 21 13 0 22 Mar 2022
Vision-Based Manipulators Need to Also See from Their Hands Kyle Hsu Moo Jin Kim Rafael Rafailov Jiajun Wu Chelsea Finn 34 44 0 15 Mar 2022
Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations Hao Shen Weikang Wan He Wang SSL 33 24 0 04 Mar 2022
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching Yecheng Jason Ma Andrew Shen Dinesh Jayaraman Osbert Bastani OffRL 23 32 0 04 Feb 2022
Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles Won Joon Yun Myungjae Shin Soyi Jung S. Kwon Joongheon Kim 22 5 0 26 Dec 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning Sabela Ramos Sertan Girgin Léonard Hussenot Damien Vincent Hanna Yakubovich ... Piotr Stańczyk Raphaël Marinier Jeremiah Harmsen Olivier Pietquin Nikola Momchev OffRL 35 23 0 04 Nov 2021
Continuous Control with Action Quantization from Demonstrations Robert Dadashi Léonard Hussenot Damien Vincent Sertan Girgin Anton Raichuk M. Geist Olivier Pietquin OffRL 33 23 0 19 Oct 2021
A Pragmatic Look at Deep Imitation Learning Kai Arulkumaran D. Lillrank 29 9 0 04 Aug 2021