Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.13834
Cited By
Self-Imitation Learning by Planning
25 March 2021
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-Imitation Learning by Planning"
50 / 53 papers shown
Title
SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning
Xuzhe Dang
Stefan Edelkamp
66
0
0
26 Nov 2024
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
21
2
0
05 Dec 2023
End-to-end Autonomous Driving: Challenges and Frontiers
Li Chen
Peng Wu
Kashyap Chitta
Bernhard Jaeger
Andreas Geiger
Hongyang Li
3DV
40
263
0
29 Jun 2023
Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning
Sha Luo
Lambert Schomaker
14
9
0
11 Jun 2023
Adaptive Policy Learning to Additional Tasks
Wenjian Hao
Zehui Lu
Zihao Liang
Tianyu Zhou
Shaoshuai Mou
14
0
0
24 May 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
Sungsoo Ahn
Kyunghwan Son
Jinwoo Shin
19
9
0
20 Mar 2023
Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space
Yuhang He
Irving Fang
Yiming Li
Rushin Shah
Chen Feng
26
8
0
16 Mar 2023
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
16
4
0
03 Dec 2022
Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
SSL
25
6
0
30 Nov 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
49
4
0
19 Oct 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
19
22
0
24 Jun 2022
A Parametric Class of Approximate Gradient Updates for Policy Optimization
Ramki Gummadi
Saurabh Kumar
Junfeng Wen
Dale Schuurmans
19
0
0
17 Jun 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
25
0
0
20 May 2022
Robust Action Gap Increasing with Clipped Advantage Learning
Zhe Zhang
Yaozhong Gan
Xiaoyang Tan
10
2
0
20 Mar 2022
Evolutionary Action Selection for Gradient-based Policy Learning
Yan Ma
T. Liu
Bingsheng Wei
Yi Liu
Kang Xu
Wei Li
19
8
0
12 Jan 2022
STIR
2
^2
2
: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
25
2
0
11 Jan 2022
Learning to Guide and to Be Guided in the Architect-Builder Problem
Paul Barde
Tristan Karch
Derek Nowrouzezahrai
Clément Moulin-Frier
C. Pal
Pierre-Yves Oudeyer
35
4
0
14 Dec 2021
Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data
Samarth Mishra
Rameswar Panda
Cheng Perng Phoo
Chun-Fu Chen
Leonid Karlinsky
Kate Saenko
Venkatesh Saligrama
Rogerio Feris
26
33
0
30 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation
Isabella Liu
Shagun Uppal
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
Youngwoon Lee
11
12
0
11 Nov 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
17
5
0
27 Oct 2021
Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning
Nitish Sontakke
Sehoon Ha
25
1
0
27 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
11
1
0
19 Sep 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
105
3
0
13 Jul 2021
Imitation Learning: Progress, Taxonomies and Challenges
Boyuan Zheng
Sunny Verma
Jianlong Zhou
Ivor Tsang
Fang Chen
17
85
0
23 Jun 2021
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han
Zhizhou Ren
Zuofan Wu
Yuanshuo Zhou
Jian-wei Peng
OffRL
11
29
0
22 Jun 2021
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization
Taisuke Kobayashi
25
13
0
27 May 2021
Co-Imitation Learning without Expert Demonstration
Kun-Peng Ning
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
13
3
0
27 Mar 2021
Regularized Softmax Deep Multi-Agent
Q
Q
Q
-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
32
31
0
22 Mar 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
Wenzhao Lian
Chang Su
Mel Vecerík
Ning Ye
S. Schaal
Jonathan Scholz
OffRL
19
60
0
21 Mar 2021
Bayesian Distributional Policy Gradients
Luchen Li
A. Faisal
BDL
OffRL
15
9
0
20 Mar 2021
MVGrasp: Real-Time Multi-View 3D Object Grasping in Highly Cluttered Environments
H. Kasaei
M. Kasaei
3DPC
22
38
0
19 Mar 2021
Generalizable Episodic Memory for Deep Reinforcement Learning
Haotian Hu
Jianing Ye
Guangxiang Zhu
Zhizhou Ren
Chongjie Zhang
OffRL
17
39
0
11 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
20
46
0
08 Mar 2021
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Mincheol Kim
S. Niekum
A. Deshpande
17
4
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
12
0
0
16 Feb 2021
Episodic Self-Imitation Learning with Hindsight
Tianhong Dai
Hengyan Liu
Anil Anthony Bharath
13
11
0
26 Nov 2020
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment
Wilka Carvalho
Anthony Liang
Kimin Lee
Sungryull Sohn
Honglak Lee
Richard L. Lewis
Satinder Singh
OffRL
13
9
0
28 Oct 2020
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Zhixin Chen
Mengxiang Lin
13
5
0
14 Oct 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
14
10
0
29 Sep 2020
Maximizing BCI Human Feedback using Active Learning
Zizhao Wang
Junyao Shi
Iretiayo Akinola
Peter K. Allen
22
8
0
11 Aug 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
337
1,955
0
04 May 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
11
4
0
01 Feb 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
31
5
0
20 Jan 2020
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
24
34
0
23 Dec 2019
Uncertainty-sensitive Learning and Planning with Ensembles
Piotr Milo's
Lukasz Kuciñski
K. Czechowski
Piotr Kozakowski
Maciek Klimek
OffRL
15
8
0
19 Dec 2019
Policy Continuation with Hindsight Inverse Dynamics
Hao Sun
Zhizhong Li
Xiaotong Liu
Dahua Lin
Bolei Zhou
14
38
0
30 Oct 2019
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
14
15
0
18 May 2019
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
17
4
0
03 Apr 2019
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
19
6
0
19 Mar 2019
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
Keita Ota
Devesh K. Jha
Tomoaki Oiki
Mamoru Miura
Takashi Nammoto
D. Nikovski
T. Mariyama
20
26
0
13 Mar 2019
1
2
Next