ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13834
  4. Cited By
Self-Imitation Learning by Planning

Self-Imitation Learning by Planning

25 March 2021
Junhyuk Oh
Yijie Guo
Satinder Singh
    SSL
ArXivPDFHTML

Papers citing "Self-Imitation Learning by Planning"

50 / 53 papers shown
Title
SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning
SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning
Xuzhe Dang
Stefan Edelkamp
66
0
0
26 Nov 2024
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
21
2
0
05 Dec 2023
End-to-end Autonomous Driving: Challenges and Frontiers
End-to-end Autonomous Driving: Challenges and Frontiers
Li Chen
Peng Wu
Kashyap Chitta
Bernhard Jaeger
Andreas Geiger
Hongyang Li
3DV
40
263
0
29 Jun 2023
Reinforcement Learning in Robotic Motion Planning by Combined
  Experience-based Planning and Self-Imitation Learning
Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning
Sha Luo
Lambert Schomaker
14
9
0
11 Jun 2023
Adaptive Policy Learning to Additional Tasks
Adaptive Policy Learning to Additional Tasks
Wenjian Hao
Zehui Lu
Zihao Liang
Tianyu Zhou
Shaoshuai Mou
14
0
0
24 May 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
Sungsoo Ahn
Kyunghwan Son
Jinwoo Shin
19
9
0
20 Mar 2023
Metric-Free Exploration for Topological Mapping by Task and Motion
  Imitation in Feature Space
Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space
Yuhang He
Irving Fang
Yiming Li
Rushin Shah
Chen Feng
26
8
0
16 Mar 2023
Reinforcement learning with Demonstrations from Mismatched Task under
  Sparse Reward
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
16
4
0
03 Dec 2022
Towards Improving Exploration in Self-Imitation Learning using Intrinsic
  Motivation
Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
SSL
25
6
0
30 Nov 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A
  Comprehensive Survey
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
49
4
0
19 Oct 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
19
22
0
24 Jun 2022
A Parametric Class of Approximate Gradient Updates for Policy
  Optimization
A Parametric Class of Approximate Gradient Updates for Policy Optimization
Ramki Gummadi
Saurabh Kumar
Junfeng Wen
Dale Schuurmans
19
0
0
17 Jun 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned
  Reinforcement Learning
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
25
0
0
20 May 2022
Robust Action Gap Increasing with Clipped Advantage Learning
Robust Action Gap Increasing with Clipped Advantage Learning
Zhe Zhang
Yaozhong Gan
Xiaoyang Tan
10
2
0
20 Mar 2022
Evolutionary Action Selection for Gradient-based Policy Learning
Evolutionary Action Selection for Gradient-based Policy Learning
Yan Ma
T. Liu
Bingsheng Wei
Yi Liu
Kang Xu
Wei Li
19
8
0
12 Jan 2022
STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation
  Learning on sparse-reward tasks
STIR2^22: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
25
2
0
11 Jan 2022
Learning to Guide and to Be Guided in the Architect-Builder Problem
Learning to Guide and to Be Guided in the Architect-Builder Problem
Paul Barde
Tristan Karch
Derek Nowrouzezahrai
Clément Moulin-Frier
C. Pal
Pierre-Yves Oudeyer
35
4
0
14 Dec 2021
Task2Sim : Towards Effective Pre-training and Transfer from Synthetic
  Data
Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data
Samarth Mishra
Rameswar Panda
Cheng Perng Phoo
Chun-Fu Chen
Leonid Karlinsky
Kate Saenko
Venkatesh Saligrama
Rogerio Feris
26
33
0
30 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control
  Policies for Robot Manipulation
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation
Isabella Liu
Shagun Uppal
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
Youngwoon Lee
11
12
0
11 Nov 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
17
5
0
27 Oct 2021
Solving Challenging Control Problems Using Two-Staged Deep Reinforcement
  Learning
Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning
Nitish Sontakke
Sehoon Ha
25
1
0
27 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
11
1
0
19 Sep 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
105
3
0
13 Jul 2021
Imitation Learning: Progress, Taxonomies and Challenges
Imitation Learning: Progress, Taxonomies and Challenges
Boyuan Zheng
Sunny Verma
Jianlong Zhou
Ivor Tsang
Fang Chen
17
85
0
23 Jun 2021
Off-Policy Reinforcement Learning with Delayed Rewards
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han
Zhizhou Ren
Zuofan Wu
Yuanshuo Zhou
Jian-wei Peng
OffRL
11
29
0
22 Jun 2021
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence
  Optimization
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization
Taisuke Kobayashi
25
13
0
27 May 2021
Co-Imitation Learning without Expert Demonstration
Co-Imitation Learning without Expert Demonstration
Kun-Peng Ning
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
13
3
0
27 Mar 2021
Regularized Softmax Deep Multi-Agent $Q$-Learning
Regularized Softmax Deep Multi-Agent QQQ-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
32
31
0
22 Mar 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement
  Learning and Demonstrations: A Large-Scale Study
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
Wenzhao Lian
Chang Su
Mel Vecerík
Ning Ye
S. Schaal
Jonathan Scholz
OffRL
19
60
0
21 Mar 2021
Bayesian Distributional Policy Gradients
Bayesian Distributional Policy Gradients
Luchen Li
A. Faisal
BDL
OffRL
15
9
0
20 Mar 2021
MVGrasp: Real-Time Multi-View 3D Object Grasping in Highly Cluttered
  Environments
MVGrasp: Real-Time Multi-View 3D Object Grasping in Highly Cluttered Environments
H. Kasaei
M. Kasaei
3DPC
22
38
0
19 Mar 2021
Generalizable Episodic Memory for Deep Reinforcement Learning
Generalizable Episodic Memory for Deep Reinforcement Learning
Haotian Hu
Jianing Ye
Guangxiang Zhu
Zhizhou Ren
Chongjie Zhang
OffRL
17
39
0
11 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
20
46
0
08 Mar 2021
SCAPE: Learning Stiffness Control from Augmented Position Control
  Experiences
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Mincheol Kim
S. Niekum
A. Deshpande
17
4
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
12
0
0
16 Feb 2021
Episodic Self-Imitation Learning with Hindsight
Episodic Self-Imitation Learning with Hindsight
Tianhong Dai
Hengyan Liu
Anil Anthony Bharath
13
11
0
26 Nov 2020
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a
  First-person Simulated 3D Environment
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment
Wilka Carvalho
Anthony Liang
Kimin Lee
Sungryull Sohn
Honglak Lee
Richard L. Lewis
Satinder Singh
OffRL
13
9
0
28 Oct 2020
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards
Zhixin Chen
Mengxiang Lin
13
5
0
14 Oct 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
14
10
0
29 Sep 2020
Maximizing BCI Human Feedback using Active Learning
Maximizing BCI Human Feedback using Active Learning
Zizhao Wang
Junyao Shi
Iretiayo Akinola
Peter K. Allen
22
8
0
11 Aug 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
337
1,955
0
04 May 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement
  Learning
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
11
4
0
01 Feb 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
31
5
0
20 Jan 2020
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
24
34
0
23 Dec 2019
Uncertainty-sensitive Learning and Planning with Ensembles
Uncertainty-sensitive Learning and Planning with Ensembles
Piotr Milo's
Lukasz Kuciñski
K. Czechowski
Piotr Kozakowski
Maciek Klimek
OffRL
15
8
0
19 Dec 2019
Policy Continuation with Hindsight Inverse Dynamics
Policy Continuation with Hindsight Inverse Dynamics
Hao Sun
Zhizhong Li
Xiaotong Liu
Dahua Lin
Bolei Zhou
14
38
0
30 Oct 2019
Combining Experience Replay with Exploration by Random Network
  Distillation
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
14
15
0
18 May 2019
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for
  Deep Reinforcement Learning
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
17
4
0
03 Apr 2019
Hindsight Generative Adversarial Imitation Learning
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
19
6
0
19 Mar 2019
Trajectory Optimization for Unknown Constrained Systems using
  Reinforcement Learning
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning
Keita Ota
Devesh K. Jha
Tomoaki Oiki
Mamoru Miura
Takashi Nammoto
D. Nikovski
T. Mariyama
20
26
0
13 Mar 2019
12
Next