Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.07442
Cited By
Self-Imitation Learning via Generalized Lower Bound Q-learning
12 June 2020
Yunhao Tang
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-Imitation Learning via Generalized Lower Bound Q-learning"
16 / 16 papers shown
Title
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
Yijie Guo
Bingjie Tang
Iretiayo Akinola
Dieter Fox
Abhishek Gupta
Yashraj S. Narang
44
0
0
06 Mar 2025
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
37
0
0
06 Nov 2024
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
48
2
0
26 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
37
1
0
19 Jun 2024
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
25
1
0
10 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
21
2
0
05 Dec 2023
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chuan Li
OffRL
24
0
0
07 Dec 2022
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
11
5
0
19 Jul 2022
Self-Imitation Learning from Demonstrations
Georgiy Pshikhachev
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
17
6
0
21 Mar 2022
Offline Reinforcement Learning with Value-based Episodic Memory
Xiaoteng Ma
Yiqin Yang
Haotian Hu
Qihan Liu
Jun Yang
Chongjie Zhang
Qianchuan Zhao
Bin Liang
OffRL
22
42
0
19 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Self-Imitation Advantage Learning
Johan Ferret
Olivier Pietquin
M. Geist
64
20
0
22 Dec 2020
Episodic Self-Imitation Learning with Hindsight
Tianhong Dai
Hengyan Liu
Anil Anthony Bharath
13
11
0
26 Nov 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
19
10
0
29 Sep 2020
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning
Sabrina Hoppe
Marc Toussaint
OffRL
11
7
0
15 Jul 2020
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
14
16
0
13 Jun 2020
1