Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.10050
Cited By
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
20 July 2022
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
50 / 56 papers shown
Title
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
22
0
0
17 Apr 2025
Robust Offline Imitation Learning Through State-level Trajectory Stitching
Shuze Wang
Yunpeng Mei
Hongjie Cao
Yetian Yuan
Gang Wang
Jian Sun
Jie Chen
OffRL
34
0
0
28 Mar 2025
Curating Demonstrations using Online Experience
Annie S. Chen
Alec M. Lessing
Yuejiang Liu
Chelsea Finn
55
0
0
05 Mar 2025
Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker
Jiangdong Fan
Hongcai He
Paul Weng
Hui Xu
Jie Shao
21
0
0
31 Dec 2024
Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation
Krzysztof Ociepa
Łukasz Flis
Krzysztof Wróbel
Adrian Gwoździej
Remigiusz Kinas
15
0
0
24 Oct 2024
UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
19
0
0
10 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
A. Roy-Chowdhury
OffRL
19
0
0
04 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
18
0
0
02 Oct 2024
Towards Effective Utilization of Mixed-Quality Demonstrations in Robotic Manipulation via Segment-Level Selection and Optimization
Jingjing Chen
Hongjie Fang
Hao-Shu Fang
Cewu Lu
34
2
0
30 Sep 2024
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
Ashutosh Nayyar
OffRL
14
0
0
17 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
25
5
0
29 Jul 2024
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
24
3
0
22 Jul 2024
Offline Reinforcement Learning with Imputed Rewards
Carlo Romeo
Andrew D. Bagdanov
OffRL
31
0
0
15 Jul 2024
Offline Imitation Learning with Model-based Reverse Augmentation
Jie-Jing Shao
Hao-Sen Shi
Lan-Zhe Guo
Yu-Feng Li
OffRL
30
5
0
18 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
29
0
0
05 Jun 2024
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
21
0
0
30 May 2024
Instruction-Guided Visual Masking
Jinliang Zheng
Jianxiong Li
Si Cheng
Yinan Zheng
Jiaming Li
Jihao Liu
Yu Liu
Jingjing Liu
Xianyuan Zhan
24
5
0
30 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
28
3
0
25 May 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
19
2
0
24 May 2024
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
Huy Hoang
Tien Mai
Pradeep Varakantham
OffRL
28
2
0
20 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
11
9
0
06 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
12
1
0
02 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
14
10
0
01 Feb 2024
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
8
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
19
5
0
02 Nov 2023
MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations
Ajay Mandlekar
Soroush Nasiriany
Bowen Wen
Iretiayo Akinola
Yashraj S. Narang
Linxi Fan
Yuke Zhu
Dieter Fox
LM&Ro
71
96
0
26 Oct 2023
Imitation Learning from Purified Demonstration
Yunke Wang
Minjing Dong
Bo Du
Chang Xu
18
1
0
11 Oct 2023
Offline Imitation Learning with Variational Counterfactual Reasoning
Bowei He
Zexu Sun
Jinxin Liu
Shuai Zhang
Xu Chen
Chen-li Ma
OffRL
18
7
0
07 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
J. Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
23
19
0
06 Oct 2023
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Guan-Bo Wang
Sijie Cheng
Xianyuan Zhan
Xiangang Li
Sen Song
Yang Liu
ALM
8
227
0
20 Sep 2023
Conditional Kernel Imitation Learning for Continuous State Environments
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
A. Nayyar
15
0
0
24 Aug 2023
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
20
3
0
24 Jul 2023
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
10
18
0
26 Jun 2023
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
Jinxin Liu
Lipeng Zu
Li He
Donglin Wang
OffRL
25
8
0
23 Jun 2023
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Zhang-Wei Hong
Pulkit Agrawal
Rémi Tachet des Combes
Romain Laroche
OffRL
16
11
0
22 Jun 2023
Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations
Tianxiang Zhao
Wenchao Yu
Suhang Wang
Lucy Wang
Xiang Zhang
Yuncong Chen
Yanchi Liu
Wei Cheng
Haifeng Chen
17
8
0
13 Jun 2023
Survival Instinct in Offline Reinforcement Learning
Anqi Li
Dipendra Kumar Misra
Andrey Kolobov
Ching-An Cheng
OffRL
8
15
0
05 Jun 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
11
88
0
29 May 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
19
10
0
25 May 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya-Qin Zhang
OffRL
OnRL
19
19
0
25 May 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
13
16
0
30 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
28
71
0
28 Mar 2023
A Survey of Demonstration Learning
André Rosa de Sousa Porfírio Correia
Luís A. Alexandre
OffRL
15
15
0
20 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
29
10
0
03 Mar 2023
Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Yunke Wang
Bo Du
Chang Xu
8
8
0
13 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya-Qin Zhang
OffRL
23
19
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
11
62
0
02 Feb 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
11
7
0
27 Jan 2023
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Ziniu Li
Tian Xu
Y. Yu
Zhixun Luo
OffRL
14
2
0
27 Jan 2023
Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment
Huihan Liu
Soroush Nasiriany
Lance Zhang
Zhiyao Bao
Yuke Zhu
22
52
0
15 Nov 2022
1
2
Next