ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 467 papers shown
Title
Curious Representation Learning for Embodied Intelligence
Curious Representation Learning for Embodied IntelligenceIEEE International Conference on Computer Vision (ICCV), 2021
Yilun Du
Chuang Gan
Phillip Isola
SSLLM&Ro
326
46
0
03 May 2021
SocialAI 0.1: Towards a Benchmark to Stimulate Research on
  Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
Grgur Kovač
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
ALMLM&Ro
203
2
0
27 Apr 2021
Ask & Explore: Grounded Question Answering for Curiosity-Driven
  Exploration
Ask & Explore: Grounded Question Answering for Curiosity-Driven Exploration
Jivat Neet Kaur
Yiding Jiang
Paul Pu Liang
LRM
88
2
0
24 Apr 2021
Curiosity-Driven Exploration via Latent Bayesian Surprise
Curiosity-Driven Exploration via Latent Bayesian SurpriseAAAI Conference on Artificial Intelligence (AAAI), 2021
Pietro Mazzaglia
Ozan Çatal
Tim Verbelen
Bart Dhoedt
216
41
0
15 Apr 2021
Rule-Based Reinforcement Learning for Efficient Robot Navigation with
  Space Reduction
Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space ReductionIEEE/ASME transactions on mechatronics (IEEE/ASME Trans. Mechatronics), 2021
Yuanyang Zhu
Zhi Wang
Chunlin Chen
D. Dong
142
43
0
15 Apr 2021
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy
  Behavior Representation for Deep Reinforcement Learning
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning
Ammar Fayad
M. Ibrahim
BDL
132
3
0
09 Apr 2021
Bayesian Distributional Policy Gradients
Bayesian Distributional Policy GradientsAAAI Conference on Artificial Intelligence (AAAI), 2021
Luchen Li
A. Faisal
BDLOffRL
174
11
0
20 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics ModelIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
116
19
0
15 Mar 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward
  Randomization
Discovering Diverse Multi-Agent Strategic Behavior via Reward RandomizationInternational Conference on Learning Representations (ICLR), 2021
Zhen-Yu Tang
Chao Yu
Boyuan Chen
Huazhe Xu
Xiaolong Wang
Fei Fang
S. Du
Yu Wang
Yi Wu
201
61
0
08 Mar 2021
Reinforcement Learning, Bit by Bit
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
448
76
0
06 Mar 2021
Reinforcement Learning for Adaptive Mesh Refinement
Reinforcement Learning for Adaptive Mesh RefinementInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Jiachen Yang
T. Dzanic
Brenden K. Petersen
Junpei Kudo
K. Mittal
...
T. Zhao
H. Zha
T. Kolev
Robert W. Anderson
Daniel Faissol
AI4CE
135
57
0
01 Mar 2021
A Probabilistic Interpretation of Self-Paced Learning with Applications
  to Reinforcement Learning
A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement LearningJournal of machine learning research (JMLR), 2021
Pascal Klink
Hany Abdulsamad
Boris Belousov
Carlo DÉramo
Jan Peters
Joni Pajarinen
208
29
0
25 Feb 2021
No-Regret Reinforcement Learning with Heavy-Tailed Rewards
No-Regret Reinforcement Learning with Heavy-Tailed RewardsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Vincent Zhuang
Yanan Sui
822
12
0
25 Feb 2021
State Entropy Maximization with Random Encoders for Efficient
  Exploration
State Entropy Maximization with Random Encoders for Efficient ExplorationInternational Conference on Machine Learning (ICML), 2021
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
223
146
0
18 Feb 2021
DEUP: Direct Epistemic Uncertainty Prediction
DEUP: Direct Epistemic Uncertainty Prediction
Salem Lahlou
Moksh Jain
Hadi Nekoei
V. Butoi
Paul Bertin
Jarrid Rector-Brooks
Maksym Korablyov
Yoshua Bengio
PERUQLMUQCVUD
565
108
0
16 Feb 2021
Policy Augmentation: An Exploration Strategy for Faster Convergence of
  Deep Reinforcement Learning Algorithms
Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning AlgorithmsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
A. Mahyari
90
1
0
10 Feb 2021
Modeling the Interaction between Agents in Cooperative Multi-Agent
  Reinforcement Learning
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Xiaoteng Ma
Yiqin Yang
Chenghao Li
Yiwen Lu
Qianchuan Zhao
Yang Jun
163
15
0
10 Feb 2021
Sparse Reward Exploration via Novelty Search and Emitters
Sparse Reward Exploration via Novelty Search and EmittersAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2021
Giuseppe Paolo
Alexandre Coninx
Stéphane Doncieux
Alban Laflaquière
204
19
0
05 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Dong Wang
Martin Riedmiller
OffRL
180
17
0
23 Jan 2021
Rank the Episodes: A Simple Approach for Exploration in
  Procedurally-Generated Environments
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated EnvironmentsInternational Conference on Learning Representations (ICLR), 2021
Daochen Zha
Wenye Ma
Lei Yuan
Helen Zhou
Ji Liu
184
47
0
20 Jan 2021
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning
  for Decentralized Traffic Signal Control
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal ControlIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Liwen Zhu
Peixi Peng
Zongqing Lu
Xiangqian Wang
Yonghong Tian
369
25
0
04 Jan 2021
Improved Sample Complexity for Incremental Autonomous Exploration in
  MDPs
Improved Sample Complexity for Incremental Autonomous Exploration in MDPsNeural Information Processing Systems (NeurIPS), 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
163
13
0
29 Dec 2020
Self-Imitation Advantage Learning
Self-Imitation Advantage LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Johan Ferret
Olivier Pietquin
Matthieu Geist
243
21
0
22 Dec 2020
Curiosity in exploring chemical space: Intrinsic rewards for deep
  molecular reinforcement learning
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning
Luca Thiede
Mario Krenn
AkshatKumar Nigam
Alán Aspuru-Guzik
204
33
0
17 Dec 2020
Policy Manifold Search for Improving Diversity-based Neuroevolution
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
107
0
0
15 Dec 2020
General Policies, Serializations, and Planning Width
General Policies, Serializations, and Planning Width
Blai Bonet
Hector Geffner
152
3
0
15 Dec 2020
Model-based Reinforcement Learning for Continuous Control with Posterior
  Sampling
Model-based Reinforcement Learning for Continuous Control with Posterior SamplingInternational Conference on Machine Learning (ICML), 2020
Ying Fan
Yifei Ming
291
21
0
20 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep
  Reinforcement Learning Research
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning ResearchInternational Conference on Machine Learning (ICML), 2020
J. Obando-Ceron
Pablo Samuel Castro
OffRL
260
118
0
20 Nov 2020
Leveraging the Variance of Return Sequences for Exploration Policy
Leveraging the Variance of Return Sequences for Exploration Policy
Zerong Xi
G. Sukthankar
114
0
0
17 Nov 2020
ACDER: Augmented Curiosity-Driven Experience Replay
ACDER: Augmented Curiosity-Driven Experience ReplayIEEE International Conference on Robotics and Automation (ICRA), 2020
Boyao Li
Tao Lu
Jiayi Li
N. Lu
Yinghao Cai
Shuo Wang
130
20
0
16 Nov 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the
  Hessian
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the HessianNeural Information Processing Systems (NeurIPS), 2020
Jack Parker-Holder
Luke Metz
Cinjon Resnick
Hengyuan Hu
Adam Lerer
Alistair Letcher
A. Peysakhovich
Aldo Pacchiano
Jakob N. Foerster
165
25
0
12 Nov 2020
Perturbation-based exploration methods in deep reinforcement learning
Perturbation-based exploration methods in deep reinforcement learning
Sneha Aenugu
84
0
0
10 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via
  Reset-Games
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
193
34
0
10 Nov 2020
How do Offline Measures for Exploration in Reinforcement Learning
  behave?
How do Offline Measures for Exploration in Reinforcement Learning behave?
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
OffRL
92
2
0
29 Oct 2020
TAMPC: A Controller for Escaping Traps in Novel Environments
TAMPC: A Controller for Escaping Traps in Novel EnvironmentsIEEE Robotics and Automation Letters (RA-L), 2020
Sheng Zhong
Zhenyuan Zhang
Nima Fazeli
Dmitry Berenson
210
8
0
23 Oct 2020
Batch Exploration with Examples for Scalable Robotic Reinforcement
  Learning
Batch Exploration with Examples for Scalable Robotic Reinforcement Learning
Annie S. Chen
H. Nam
Suraj Nair
Chelsea Finn
OffRL
150
27
0
22 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced
  Reinforcement Learning
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning
Shen Ren
Qianxiao Li
Liye Zhang
Zheng Qin
Bo Yang
46
0
0
22 Oct 2020
An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse
  Rewards
An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards
Siyu Dai
Wenyuan Xu
Andreas G. Hofmann
B. Williams
201
11
0
15 Oct 2020
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement
  Learning
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Tarun Gupta
Anuj Mahajan
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRL
249
58
0
06 Oct 2020
Latent World Models For Intrinsically Motivated Exploration
Latent World Models For Intrinsically Motivated ExplorationNeural Information Processing Systems (NeurIPS), 2020
Aleksandr Ermolov
Andrii Zadaianchuk
310
24
0
05 Oct 2020
Exploration in Approximate Hyper-State Space for Meta Reinforcement
  Learning
Exploration in Approximate Hyper-State Space for Meta Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
L. Zintgraf
Leo Feng
Cong Lu
Maximilian Igl
Kristian Hartikainen
Katja Hofmann
Shimon Whiteson
349
43
0
02 Oct 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep
  Q-Network
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
108
0
0
29 Sep 2020
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
235
48
0
28 Sep 2020
SEMI: Self-supervised Exploration via Multisensory Incongruity
SEMI: Self-supervised Exploration via Multisensory IncongruityIEEE International Conference on Robotics and Automation (ICRA), 2020
Jianren Wang
Ziwen Zhuang
Hang Zhao
SSL
151
1
0
26 Sep 2020
Revisiting Design Choices in Proximal Policy Optimization
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
249
60
0
23 Sep 2020
Evolutionary Reinforcement Learning via Cooperative Coevolutionary
  Negatively Correlated Search
Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated SearchSwarm and Evolutionary Computation (Swarm Evol. Comput.), 2020
Hu Zhang
Peng Yang
Yang Yu
Mingjiang Li
Shengcai Liu
204
23
0
08 Sep 2020
REMAX: Relational Representation for Multi-Agent Exploration
REMAX: Relational Representation for Multi-Agent ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Heechang Ryu
Hayong Shin
Jinkyoo Park
167
4
0
12 Aug 2020
GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep
  Reinforcement Learning
GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement LearningIEEE Transactions on Cognitive and Developmental Systems (TCDS), 2020
Grgur Kovač
A. Laversanne-Finot
Pierre-Yves Oudeyer
226
12
0
10 Aug 2020
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events
Noisy Agents: Self-supervised Exploration by Predicting Auditory EventsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020
Chuang Gan
Xiaoyu Chen
Phillip Isola
Antonio Torralba
J. Tenenbaum
136
7
0
27 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Explore and Explain: Self-supervised Navigation and RecountingInternational Conference on Pattern Recognition (ICPR), 2020
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoVLM&Ro
162
18
0
14 Jul 2020
Previous
123...1056789
Next