Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1611.04717
Cited By
v1
v2
v3 (latest)
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"
50 / 466 papers shown
Title
Periodic Skill Discovery
Jonghae Park
Daesol Cho
Jusuk Lee
D. Shim
Inkyu Jang
H. J. Kim
152
0
0
05 Nov 2025
Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings
Seyed Mahdi Basiri Azad
Joschka Boedecker
OffRL
OnRL
233
0
0
28 Oct 2025
Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
Xuan Zhang
Ruixiao Li
Zhijian Zhou
Long Li
Yulei Qin
Ke Li
Xing Sun
Xiaoyu Tan
Chao Qu
Yuan Qi
LRM
148
0
0
18 Oct 2025
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls
Dylan J. Foster
A. Krishnamurthy
Jordan T. Ash
89
0
0
13 Oct 2025
BuilderBench -- A benchmark for generalist agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
96
1
0
07 Oct 2025
Q-Learning with Shift-Aware Upper Confidence Bound in Non-Stationary Reinforcement Learning
H. Bui
Felix Parker
Kimia Ghobadi
Anqi Liu
OffRL
OOD
64
0
0
03 Oct 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
283
0
0
26 Sep 2025
Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
Gawon Lee
Daesol Cho
H. J. Kim
139
0
0
25 Sep 2025
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Runpeng Dai
Linfeng Song
Haolin Liu
Zhenwen Liang
Dian Yu
...
Zhaopeng Tu
R. Liu
Tong Zheng
Hongtu Zhu
Dong Yu
LRM
124
7
0
11 Sep 2025
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
OffRL
91
0
0
04 Sep 2025
Uncertainty-driven Adaptive Exploration
Leonidas Bakopoulos
Georgios Chalkiadakis
116
0
0
03 Sep 2025
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
Ang Li
Zhihang Yuan
Yang Zhang
Shouda Liu
Yisen Wang
100
3
0
29 Aug 2025
Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning
Soumia Mehimeh
OffRL
OnRL
126
0
0
12 Aug 2025
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
117
1
0
02 Aug 2025
Exploitation Is All You Need... for Exploration
Micah Rentschler
Jesse Roberts
64
0
0
02 Aug 2025
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Yilie Huang
Xun Yu Zhou
OffRL
153
1
0
01 Jul 2025
Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design
Hampus Gummesson Svensson
Ola Engkvist
J. Janet
C. Tyrchan
M. Chehreghani
OffRL
285
0
0
26 Jun 2025
Reward Models in Deep Reinforcement Learning: A Survey
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Rui Yu
Shenghua Wan
Yucen Wang
Chen-Xiao Gao
Le Gan
Zongzhang Zhang
De-Chuan Zhan
OffRL
130
6
0
18 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
170
1
0
10 Jun 2025
Scalable and Cost-Efficient de Novo Template-Based Molecular Generation
Piotr Gaiñski
Oussama Boussif
Andrei Rekesh
Dmytro Shevchuk
Ali Parviz
Mike Tyers
Robert A. Batey
Michał Koziarski
84
3
0
10 Jun 2025
Reinforcement Learning via Implicit Imitation Guidance
Perry Dong
Alec M. Lessing
Annie S. Chen
Chelsea Finn
OffRL
119
3
0
09 Jun 2025
A Generative Physics-Informed Reinforcement Learning-Based Approach for Construction of Representative Drive Cycle
Amirreza Yasami
Mohammadali Tofigh
Mahdi Shahbakhti
Charles Robert Koch
55
0
0
09 Jun 2025
SCAR: Shapley Credit Assignment for More Efficient RLHF
Meng Cao
Shuyuan Zhang
Xiao-Wen Chang
Doina Precup
305
4
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
273
4
0
26 May 2025
Counter-Inferential Behavior in Natural and Artificial Cognitive Systems
Serge Dolgikh
190
0
0
19 May 2025
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
270
1
0
16 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
226
0
0
13 May 2025
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
232
0
0
13 May 2025
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Andreas Kontogiannis
Konstantinos Papathanasiou
Yi Shen
Giorgos Stamou
Michael M. Zavlanos
G. Vouros
279
0
0
08 May 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
318
3
0
09 Apr 2025
Exploration-Driven Generative Interactive Environments
Computer Vision and Pattern Recognition (CVPR), 2025
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen
3DV
215
5
0
03 Apr 2025
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRL
MoE
225
1
0
26 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
359
2
0
24 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
233
0
0
23 Mar 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Antonios Tragoudaras
Pavel Kolev
Georg Martius
LM&Ro
VLM
647
3
0
03 Mar 2025
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Toru Lin
Kartik Sachdev
Linxi Fan
Jitendra Malik
Yuke Zhu
302
43
0
27 Feb 2025
Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
IEEE International Conference on Robotics and Automation (ICRA), 2025
Huaxing Huang
Wenhao Cui
Tonghe Zhang
Shengtao Li
Jinchao Han
...
Chenxu Hu
Ning Yan
Jiahao Chen
Shipu Zhang
Zheyuan Jiang
261
2
0
26 Feb 2025
The impact of intrinsic rewards on exploration in Reinforcement Learning
Aya Kayal
Eduardo Pignatelli
Laura Toni
164
5
0
20 Jan 2025
PIMAEX: Multi-Agent Exploration through Peer Incentivization
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
149
0
0
03 Jan 2025
β
β
β
-DQN: Improving Deep Q-Learning By Evolving the Behavior
Adaptive Agents and Multi-Agent Systems (AAMAS), 2025
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
286
3
0
01 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
IEEE Access (IEEE Access), 2024
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
258
46
0
31 Dec 2024
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing Guo
Ivor W. Tsang
988
3
0
11 Nov 2024
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
Carlo DÉramo
180
0
0
31 Oct 2024
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
Conference on Robot Learning (CoRL), 2024
Zihan Zhou
Animesh Garg
Dieter Fox
Caelan Reed Garrett
Ajay Mandlekar
297
11
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
626
4
0
23 Oct 2024
GUIDE: Real-Time Human-Shaped Agents
Neural Information Processing Systems (NeurIPS), 2024
Lingyu Zhang
Zhengran Ji
Nicholas R Waytowich
Boyuan Chen
172
6
0
19 Oct 2024
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach
Neural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
141
2
0
17 Oct 2024
Automated Rewards via LLM-Generated Progress Functions
Vishnu Sarukkai
Brennan Shacklett
Zander Majercik
Kush S. Bhatia
Christopher Ré
Kayvon Fatahalian
229
3
0
11 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
293
4
0
07 Oct 2024
PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation
Mohammadamin Davoodabadi
Negin Hashemi Dijujin
M. Baghshah
119
0
0
02 Oct 2024
1
2
3
4
...
8
9
10
Next