ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 466 papers shown
Title
Quasimetric Value Functions with Dense Rewards
Quasimetric Value Functions with Dense Rewards
Khadichabonu Valieva
Bikramjit Banerjee
OffRL
204
3
0
13 Sep 2024
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Marco Bagatella
Andreas Krause
Georg Martius
OffRL
235
3
0
18 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from
  Contrastive RL without Rewards, Demonstrations, or Subgoals
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or SubgoalsInternational Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
347
8
0
11 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
545
17
0
06 Aug 2024
Image-Based Deep Reinforcement Learning with Intrinsically Motivated
  Stimuli: On the Execution of Complex Robotic Tasks
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks
David Valencia
Henry Williams
Yuning Xing
Trevor Gee
Minas V. Liarokapis
Bruce A. MacDonald
127
4
0
31 Jul 2024
Boosting Efficiency in Task-Agnostic Exploration through Causal
  Knowledge
Boosting Efficiency in Task-Agnostic Exploration through Causal Knowledge
Yupei Yang
Erdun Gao
Shikui Tu
Lei Xu
CML
171
2
0
30 Jul 2024
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement
  Learning
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
Minjae Cho
Chuangchuang Sun
OffRL
217
0
0
17 Jul 2024
Variable-Agnostic Causal Exploration for Reinforcement Learning
Variable-Agnostic Causal Exploration for Reinforcement Learning
Minh Hoang Nguyen
Hung Le
Svetha Venkatesh
CML
175
3
0
17 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
424
11
0
09 Jul 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xinyuan Li
Tianyuan Chen
Xiao Zhang
Tianyuan Chen
Xuyang Chen
189
0
0
09 Jul 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
291
10
0
29 Jun 2024
Safety through feedback in Constrained RL
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
319
2
0
28 Jun 2024
Beyond Optimism: Exploration With Partially Observable Rewards
Beyond Optimism: Exploration With Partially Observable Rewards
Simone Parisi
Alireza Kazemipour
Michael Bowling
OffRL
161
6
0
20 Jun 2024
WoCoCo: Learning Whole-Body Humanoid Control with Sequential Contacts
WoCoCo: Learning Whole-Body Humanoid Control with Sequential ContactsConference on Robot Learning (CoRL), 2024
Chong Zhang
Wenli Xiao
Tairan He
Guanya Shi
287
76
0
10 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
155
4
0
30 May 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
372
5
0
29 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
245
3
0
23 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
231
1
0
06 May 2024
Generative Active Learning for the Search of Small-molecule Protein
  Binders
Generative Active Learning for the Search of Small-molecule Protein Binders
Maksym Korablyov
Cheng-Hao Liu
Moksh Jain
A. V. D. Sloot
Eric Jolicoeur
...
Marwin H. S. Segler
Michael M. Bronstein
A. Marinier
Mike Tyers
Yoshua Bengio
154
8
0
02 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through
  Exploiting State-Action Space Structure
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
168
2
0
01 May 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned
  Reinforcement Learning
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
165
0
0
19 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
178
0
0
03 Apr 2024
VDSC: Enhancing Exploration Timing with Value Discrepancy and State
  Counts
VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts
Marius Captari
Remo Sasso
M. Sabatelli
67
0
0
26 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent
  Reinforcement Learning
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
214
10
0
02 Mar 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
251
16
0
22 Feb 2024
Decentralized Lifelong Path Planning for Multiple Ackerman Car-Like
  Robots
Decentralized Lifelong Path Planning for Multiple Ackerman Car-Like Robots
Teng Guo
Jingjin Yu
189
3
0
19 Feb 2024
Just Cluster It: An Approach for Exploration in High-Dimensions using
  Clustering and Pre-Trained Representations
Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained RepresentationsInternational Conference on Machine Learning (ICML), 2024
Stefan Sylvius Wagner
Stefan Harmeling
138
3
0
05 Feb 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty
  Sharing
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Haobin Jiang
Zongqing Lu
189
8
0
03 Feb 2024
To the Max: Reinventing Reward in Reinforcement Learning
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko
Wendelin Bohmer
Mathijs de Weerdt
162
9
0
02 Feb 2024
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy
  Learning
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning
Xuecheng Niu
Akinori Ito
Takashi Nose
192
3
0
31 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
DittoGym: Learning to Control Soft Shape-Shifting RobotsInternational Conference on Learning Representations (ICLR), 2024
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
241
8
0
24 Jan 2024
Exploration and Anti-Exploration with Distributional Random Network
  Distillation
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
350
27
0
18 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language
  Model Critique in Text Generation
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRLALM
236
15
0
14 Jan 2024
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Youssef Attia El Hili
Haitham Bou-Ammar
Jun Wang
177
26
0
22 Dec 2023
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a
  High Replay Ratio and Regularization
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
217
1
0
10 Dec 2023
Regularity as Intrinsic Reward for Free Play
Regularity as Intrinsic Reward for Free PlayNeural Information Processing Systems (NeurIPS), 2023
Cansu Sancaktar
J. Piater
Georg Martius
176
7
0
03 Dec 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy
  Sampling
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
121
5
0
14 Nov 2023
General Policies, Subgoal Structure, and Planning Width
General Policies, Subgoal Structure, and Planning WidthJournal of Artificial Intelligence Research (JAIR), 2023
Blai Bonet
Hector Geffner
111
4
0
09 Nov 2023
Accelerating Exploration with Unlabeled Prior Data
Accelerating Exploration with Unlabeled Prior Data
Qiyang Li
Jason Zhang
Dibya Ghosh
Amy Zhang
Sergey Levine
OffRLOnRL
279
15
0
09 Nov 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio
  Minimization
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio MinimizationInternational Conference on Learning Representations (ICLR), 2023
Guowei Xu
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Zhecheng Yuan
...
Shuzhen Li
Yanjie Ze
Hal Daumé
Furong Huang
Huazhe Xu
226
41
0
30 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary ObjectivesInternational Conference on Learning Representations (ICLR), 2023
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
335
4
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit
  Model-Free Reinforcement Learning Updates
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning UpdatesInternational Conference on Learning Representations (ICLR), 2023
Nicholas Corrado
Josiah P. Hanna
227
6
0
26 Oct 2023
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic
  Forgetting in Curiosity
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
143
5
0
26 Oct 2023
Reward Shaping for Happier Autonomous Cyber Security Agents
Reward Shaping for Happier Autonomous Cyber Security Agents
Elizabeth Bates
V. Mavroudis
Chris Hicks
166
19
0
20 Oct 2023
Provable Benefits of Multi-task RL under Non-Markovian Decision Making
  Processes
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
Ruiquan Huang
Yuan Cheng
Jing Yang
Vincent Tan
Yingbin Liang
155
0
0
20 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
328
63
0
13 Oct 2023
ELDEN: Exploration via Local Dependencies
ELDEN: Exploration via Local DependenciesNeural Information Processing Systems (NeurIPS), 2023
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martin-Martin
187
11
0
12 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRLOnRL
258
4
0
12 Oct 2023
Generative Intrinsic Optimization: Intrinsic Control with Model Learning
Generative Intrinsic Optimization: Intrinsic Control with Model Learning
Jianfei Ma
210
0
0
12 Oct 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
224
1
0
11 Oct 2023
Previous
12345...8910
Next