ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.07442
  4. Cited By
Self-Imitation Learning via Generalized Lower Bound Q-learning

Self-Imitation Learning via Generalized Lower Bound Q-learning

12 June 2020
Yunhao Tang
    SSL
ArXivPDFHTML

Papers citing "Self-Imitation Learning via Generalized Lower Bound Q-learning"

16 / 16 papers shown
Title
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
Yijie Guo
Bingjie Tang
Iretiayo Akinola
Dieter Fox
Abhishek Gupta
Yashraj S. Narang
44
0
0
06 Mar 2025
CPIG: Leveraging Consistency Policy with Intention Guidance for
  Multi-agent Exploration
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
37
0
0
06 Nov 2024
Boosting Soft Q-Learning by Bounding
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
48
2
0
26 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
39
1
0
19 Jun 2024
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a
  High Replay Ratio and Regularization
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
25
1
0
10 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
23
2
0
05 Dec 2023
Accelerating Self-Imitation Learning from Demonstrations via Policy
  Constraints and Q-Ensemble
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble
Chong Li
OffRL
24
0
0
07 Dec 2022
Learning Action Translator for Meta Reinforcement Learning on
  Sparse-Reward Tasks
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
11
5
0
19 Jul 2022
Self-Imitation Learning from Demonstrations
Self-Imitation Learning from Demonstrations
Georgiy Pshikhachev
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
17
6
0
21 Mar 2022
Offline Reinforcement Learning with Value-based Episodic Memory
Offline Reinforcement Learning with Value-based Episodic Memory
Xiaoteng Ma
Yiqin Yang
Haotian Hu
Qihan Liu
Jun Yang
Chongjie Zhang
Qianchuan Zhao
Bin Liang
OffRL
24
42
0
19 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Self-Imitation Advantage Learning
Self-Imitation Advantage Learning
Johan Ferret
Olivier Pietquin
M. Geist
66
20
0
22 Dec 2020
Episodic Self-Imitation Learning with Hindsight
Episodic Self-Imitation Learning with Hindsight
Tianhong Dai
Hengyan Liu
Anil Anthony Bharath
13
11
0
26 Nov 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
19
10
0
29 Sep 2020
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep
  Reinforcement Learning
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning
Sabrina Hoppe
Marc Toussaint
OffRL
13
7
0
15 Jul 2020
Hindsight Expectation Maximization for Goal-conditioned Reinforcement
  Learning
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
16
16
0
13 Jun 2020
1