ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.12919
  4. Cited By
First return, then explore

First return, then explore

27 April 2020
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
ArXivPDFHTML

Papers citing "First return, then explore"

50 / 52 papers shown
Title
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
36
0
0
09 Apr 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
95
2
0
24 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
49
0
0
09 Mar 2025
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Cong Lu
Shengran Hu
Jeff Clune
LLMAG
34
9
0
24 May 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
41
4
0
29 Feb 2024
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
67
5
0
13 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
19
1
0
12 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
22
1
0
08 Dec 2023
EduGym: An Environment and Notebook Suite for Reinforcement Learning
  Education
EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
Thomas M. Moerland
Matthias Muller-Brockhausen
Zhao Yang
Andrius Bernatavicius
Koen Ponse
Tom Kouwenhoven
Andreas Sauter
Michiel van der Meer
Bram M. Renting
Aske Plaat
OffRL
19
0
0
17 Nov 2023
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic
  Forgetting in Curiosity
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
23
5
0
26 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
28
34
0
13 Oct 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
29
7
0
14 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
23
9
0
29 May 2023
Augmenting Autotelic Agents with Large Language Models
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
26
22
0
21 May 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement
  Learning
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
18
5
0
18 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
MIMEx: Intrinsic Rewards from Masked Input Modeling
Toru Lin
Allan Jabri
OffRL
16
6
0
15 May 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
20
6
0
14 Apr 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation
  Using Scene Object Spectrum Grounding
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
17
19
0
07 Mar 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement
  Learning
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
12
4
0
21 Feb 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
11
4
0
29 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
14
18
0
26 Jan 2023
Near-optimal Policy Identification in Active Reinforcement Learning
Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li
Viraj Mehta
Johannes Kirschner
I. Char
W. Neiswanger
J. Schneider
Andreas Krause
Ilija Bogunovic
OffRL
35
6
0
19 Dec 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
13
1
0
28 Nov 2022
Continuous Episodic Control
Continuous Episodic Control
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
OffRL
9
3
0
28 Nov 2022
ActMAD: Activation Matching to Align Distributions for
  Test-Time-Training
ActMAD: Activation Matching to Align Distributions for Test-Time-Training
M. Jehanzeb Mirza
Pol Jané Soneira
W. Lin
Mateusz Koziñski
Horst Possegger
Horst Bischof
VLM
TTA
24
24
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
6
4
0
22 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
J. Pajarinen
Pulkit Agrawal
OnRL
20
23
0
14 Nov 2022
Quality-diversity in dissimilarity spaces
Quality-diversity in dissimilarity spaces
Steve Huntsman
20
1
0
14 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single
  Demonstration
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
6
4
0
09 Nov 2022
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal
  Conditioning
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning
Caroline Wang
Garrett A. Warnell
Peter Stone
32
3
0
26 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
53
8
0
23 Oct 2022
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement
  Learning
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning
Firas Jarboui
Ahmed Akakzia
6
0
0
26 Sep 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
24
35
0
19 Sep 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
12
22
0
24 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
67
0
16 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL
  with SAPIENS
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS
Eleni Nisioti
Matéo Mahaut
Pierre-Yves Oudeyer
Ida Momennejad
Clément Moulin-Frier
11
9
0
10 Jun 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated
  and Musculoskeletal Systems
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems
Pierre Schumacher
D. Haeufle
Dieter Buchler
S. Schmitt
Georg Martius
15
29
0
30 May 2022
Reinforcement Learning for Branch-and-Bound Optimisation using
  Retrospective Trajectories
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
14
19
0
28 May 2022
Divide & Conquer Imitation Learning
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
8
5
0
15 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
25
4
0
12 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained
  Representations
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
13
67
0
08 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
14
107
0
05 Apr 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in
  Intrinsic Motivation
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
10
1
0
29 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
12
9
0
24 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
31
9
0
23 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
26
7
0
16 Feb 2022
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing
  of Software
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software
Chuan-Yung Tsai
Graham W. Taylor
6
2
0
29 Jan 2022
Provable Hierarchy-Based Meta-Reinforcement Learning
Provable Hierarchy-Based Meta-Reinforcement Learning
Kurtland Chua
Qi Lei
Jason D. Lee
16
5
0
18 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
28
91
0
14 Sep 2021
12
Next