ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05054
  4. Cited By
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement
  Learning Algorithms
v1v2v3v4v5 (latest)

GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms

14 February 2018
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
ArXiv (abs)PDFHTMLGithub (35★)

Papers citing "GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms"

50 / 81 papers shown
Multi-Agent Guided Policy Optimization
Multi-Agent Guided Policy Optimization
Yueheng Li
Guangming Xie
Zongqing Lu
160
1
0
24 Jul 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
254
0
0
13 May 2025
Beyond the Boundaries of Proximal Policy Optimization
Beyond the Boundaries of Proximal Policy Optimization
Charlie B. Tan
Edan Toledo
Benjamin Ellis
Jakob Foerster
Ferenc Huszár
210
1
0
01 Nov 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
338
4
0
07 Oct 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&RoLRMAI4CE
271
5
0
11 Jun 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through
  Exploiting State-Action Space Structure
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
188
2
0
01 May 2024
SPO: Sequential Monte Carlo Policy Optimisation
SPO: Sequential Monte Carlo Policy OptimisationNeural Information Processing Systems (NeurIPS), 2024
Clément Bonnet
Edan Toledo
Donal Byrne
Paul Duckworth
Alexandre Laterre
303
3
0
12 Feb 2024
Evolution Guided Generative Flow Networks
Evolution Guided Generative Flow Networks
Zarif Ikram
Ling Pan
Dianbo Liu
390
1
0
03 Feb 2024
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
Antonin Raffin
Olivier Sigaud
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
184
4
0
09 Oct 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
191
15
0
29 May 2023
Supplementing Gradient-Based Reinforcement Learning with Simple
  Evolutionary Ideas
Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas
H. Khadilkar
120
0
0
10 May 2023
Evolving Populations of Diverse RL Agents with MAP-Elites
Evolving Populations of Diverse RL Agents with MAP-ElitesInternational Conference on Learning Representations (ICLR), 2023
Thomas Pierrot
Arthur Flajolet
279
11
0
09 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Evolutionary Reinforcement Learning: A SurveyIntelligent Computing (IC), 2023
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
456
77
0
07 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
173
0
0
28 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Improving Deep Policy Gradients with Value Function SearchInternational Conference on Learning Representations (ICLR), 2023
Enrico Marchesini
Chris Amato
153
17
0
20 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Guiding Pretraining in Reinforcement Learning with Large Language ModelsInternational Conference on Machine Learning (ICML), 2023
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
315
230
0
13 Feb 2023
Elastic Step DQN: A novel multi-step algorithm to alleviate
  overestimation in Deep QNetworks
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworksNeurocomputing (Neurocomputing), 2022
Adrian Ly
Richard Dazeley
Peter Vamplew
Francisco Cruz
Sunil Aryal
215
24
0
07 Oct 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARLNeural Information Processing Systems (NeurIPS), 2022
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
216
62
0
21 Sep 2022
A Transferable and Automatic Tuning of Deep Reinforcement Learning for
  Cost Effective Phishing Detection
A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection
Orel Lavie
A. Shabtai
Gilad Katz
AAMLOffRL
261
1
0
19 Sep 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
326
30
0
08 Jun 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge
  Using Language
Asking for Knowledge: Training RL Agents to Query External Knowledge Using LanguageInternational Conference on Machine Learning (ICML), 2022
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
Alex Schwing
RALM
248
13
0
12 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A SurveyInformation Fusion (Inf. Fusion), 2022
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
309
493
0
02 May 2022
A Comparative Study of Deep Reinforcement Learning-based Transferable
  Energy Management Strategies for Hybrid Electric Vehicles
A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles
Jingyi Xu
Zirui Li
Li Gao
Junyi Ma
Qi Liu
Yanan Zhao
106
14
0
22 Feb 2022
Robots Learn Increasingly Complex Tasks with Intrinsic Motivation and
  Automatic Curriculum Learning
Robots Learn Increasingly Complex Tasks with Intrinsic Motivation and Automatic Curriculum Learning
S. Nguyen
Nicolas Duminy
A. Manoury
D. Duhaut
Cédric Buche
108
9
0
11 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in
  Reinforcement Learning
Approximating Gradients for Differentiable Quality Diversity in Reinforcement LearningAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2022
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
261
59
0
08 Feb 2022
Evolutionary Action Selection for Gradient-based Policy Learning
Evolutionary Action Selection for Gradient-based Policy LearningInternational Conference on Neural Information Processing (ICONIP), 2022
Yan Ma
T. Liu
Bingsheng Wei
Yi Liu
Kang Xu
Wei Li
354
12
0
12 Jan 2022
Multi-Stage Episodic Control for Strategic Exploration in Text Games
Multi-Stage Episodic Control for Strategic Exploration in Text GamesInternational Conference on Learning Representations (ICLR), 2022
Jens Tuyls
Shunyu Yao
Sham Kakade
Karthik Narasimhan
294
29
0
04 Jan 2022
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Discovering and Exploiting Sparse Rewards in a Learned Behavior SpaceEvolutionary Computation (Evol. Comput.), 2021
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
151
6
0
02 Nov 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
296
99
0
01 Sep 2021
Semantic Tracklets: An Object-Centric Representation for Visual
  Multi-Agent Reinforcement Learning
Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Iou-Jen Liu
Zhongzheng Ren
Raymond A. Yeh
Alex Schwing
230
20
0
06 Aug 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Cooperative Exploration for Multi-Agent Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
Alex Schwing
263
124
0
23 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
361
75
0
08 Jul 2021
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep
  Reinforcement Learning
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2021
Jinxin Liu
Xuetao Zhang
Qiangxing Tian
Ruihao Zhang
262
25
0
11 Apr 2021
Selection-Expansion: A Unifying Framework for Motion-Planning and
  Diversity Search Algorithms
Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search AlgorithmsInternational Conference on Artificial Neural Networks (ICANN), 2021
Alexandre Chenu
Nicolas Perrin-Gilbert
Stéphane Doncieux
Olivier Sigaud
128
1
0
10 Apr 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
252
47
0
10 Feb 2021
Sparse Reward Exploration via Novelty Search and Emitters
Sparse Reward Exploration via Novelty Search and EmittersAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2021
Giuseppe Paolo
Alexandre Coninx
Stéphane Doncieux
Alban Laflaquière
204
19
0
05 Feb 2021
Locally Persistent Exploration in Continuous Control Tasks with Sparse
  Rewards
Locally Persistent Exploration in Continuous Control Tasks with Sparse RewardsInternational Conference on Machine Learning (ICML), 2020
Susan Amin
Maziar Gomrokchi
Hossein Aboutalebi
Harsh Satija
Doina Precup
172
17
0
26 Dec 2020
High-Throughput Synchronous Deep RL
High-Throughput Synchronous Deep RLNeural Information Processing Systems (NeurIPS), 2020
Iou-Jen Liu
Raymond A. Yeh
Alex Schwing
OffRL
223
13
0
17 Dec 2020
Autotelic Agents with Intrinsically Motivated Goal-Conditioned
  Reinforcement Learning: a Short Survey
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short SurveyJournal of Artificial Intelligence Research (JAIR), 2020
Cédric Colas
Tristan Karch
Olivier Sigaud
Pierre-Yves Oudeyer
844
120
0
17 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
191
43
0
15 Dec 2020
Path Design and Resource Management for NOMA enhanced Indoor Intelligent
  Robots
Path Design and Resource Management for NOMA enhanced Indoor Intelligent RobotsIEEE Transactions on Wireless Communications (TWC), 2020
Ruikang Zhong
Xiao-Yang Liu
Yuanwei Liu
Yue Chen
Xianbin Wang
188
15
0
23 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep
  Reinforcement Learning Research
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning ResearchInternational Conference on Machine Learning (ICML), 2020
J. Obando-Ceron
Pablo Samuel Castro
OffRL
270
119
0
20 Nov 2020
Hierarchically Organized Latent Modules for Exploratory Search in
  Morphogenetic Systems
Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems
Mayalen Etcheverry
Clément Moulin-Frier
Pierre-Yves Oudeyer
234
27
0
02 Jul 2020
Continual Learning: Tackling Catastrophic Forgetting in Deep Neural
  Networks with Replay Processes
Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes
Timothée Lesort
CLL
363
24
0
01 Jul 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity
  Optimization
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
316
74
0
15 Jun 2020
PBCS : Efficient Exploration and Exploitation Using a Synergy between
  Reinforcement Learning and Motion Planning
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion PlanningInternational Conference on Artificial Neural Networks (ICANN), 2020
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
149
19
0
24 Apr 2020
Scaling MAP-Elites to Deep Neuroevolution
Scaling MAP-Elites to Deep NeuroevolutionAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2020
Cédric Colas
Joost Huizinga
Vashisht Madhavan
Jeff Clune
253
94
0
03 Mar 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled
  Exploration
Off-Policy Deep Reinforcement Learning with Analogous Disentangled ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Hoang Trung-Dung
Yitao Liang
Karen Ullrich
OffRL
147
4
0
25 Feb 2020
Accelerating Reinforcement Learning with a
  Directional-Gaussian-Smoothing Evolution Strategy
Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution StrategyElectronic Research Archive (ERA), 2020
Jiaxing Zhang
Hoang Tran
Guannan Zhang
126
12
0
21 Feb 2020
The problem with DDPG: understanding failures in deterministic
  environments with sparse rewards
The problem with DDPG: understanding failures in deterministic environments with sparse rewardsInternational Conference on Artificial Neural Networks (ICANN), 2019
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
115
71
0
26 Nov 2019
12
Next