ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 467 papers shown
Extending NGU to Multi-Agent RL: A Preliminary Study
Extending NGU to Multi-Agent RL: A Preliminary Study
Juan Hernandez
Diego Fernández
Manuel Cifuentes
Denis Parra
Rodrigo Toro Icarte
52
0
0
01 Dec 2025
Periodic Skill Discovery
Periodic Skill Discovery
Jonghae Park
Daesol Cho
Jusuk Lee
D. Shim
Inkyu Jang
H. J. Kim
318
0
0
05 Nov 2025
Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings
Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings
Seyed Mahdi Basiri Azad
Joschka Boedecker
OffRLOnRL
297
0
0
28 Oct 2025
Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards
Xuan Zhang
Ruixiao Li
Zhijian Zhou
Long Li
Yulei Qin
Ke Li
Xing Sun
Xiaoyu Tan
Chao Qu
Yuan Qi
LRM
181
0
0
18 Oct 2025
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls
Dylan J. Foster
A. Krishnamurthy
Jordan T. Ash
134
1
0
13 Oct 2025
BuilderBench -- A benchmark for generalist agents
BuilderBench -- A benchmark for generalist agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
134
1
0
07 Oct 2025
Q-Learning with Shift-Aware Upper Confidence Bound in Non-Stationary Reinforcement Learning
Q-Learning with Shift-Aware Upper Confidence Bound in Non-Stationary Reinforcement Learning
H. Bui
Felix Parker
Kimia Ghobadi
Anqi Liu
OffRLOOD
89
0
0
03 Oct 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
398
1
0
26 Sep 2025
Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
Gawon Lee
Daesol Cho
H. J. Kim
195
0
0
25 Sep 2025
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Runpeng Dai
Linfeng Song
Haolin Liu
Zhenwen Liang
Dian Yu
...
Zhaopeng Tu
R. Liu
Tong Zheng
Hongtu Zhu
Dong Yu
LRM
172
10
0
11 Sep 2025
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
OffRL
143
0
0
04 Sep 2025
Uncertainty-driven Adaptive Exploration
Uncertainty-driven Adaptive Exploration
Leonidas Bakopoulos
Georgios Chalkiadakis
183
0
0
03 Sep 2025
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
Ang Li
Zhihang Yuan
Yang Zhang
Shouda Liu
Yisen Wang
126
4
0
29 Aug 2025
Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning
Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning
Soumia Mehimeh
OffRLOnRL
162
0
0
12 Aug 2025
Exploitation Is All You Need... for Exploration
Exploitation Is All You Need... for Exploration
Micah Rentschler
Jesse Roberts
101
0
0
02 Aug 2025
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
150
1
0
02 Aug 2025
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
Yilie Huang
Xun Yu Zhou
OffRL
177
1
0
01 Jul 2025
Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design
Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design
Hampus Gummesson Svensson
Ola Engkvist
J. Janet
C. Tyrchan
M. Chehreghani
OffRL
329
0
0
26 Jun 2025
Reward Models in Deep Reinforcement Learning: A Survey
Reward Models in Deep Reinforcement Learning: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Rui Yu
Shenghua Wan
Yucen Wang
Chen-Xiao Gao
Le Gan
Zongzhang Zhang
De-Chuan Zhan
OffRL
158
6
0
18 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
214
1
0
10 Jun 2025
Scalable and Cost-Efficient de Novo Template-Based Molecular Generation
Scalable and Cost-Efficient de Novo Template-Based Molecular Generation
Piotr Gaiñski
Oussama Boussif
Andrei Rekesh
Dmytro Shevchuk
Ali Parviz
Mike Tyers
Robert A. Batey
Michał Koziarski
145
3
0
10 Jun 2025
Reinforcement Learning via Implicit Imitation Guidance
Reinforcement Learning via Implicit Imitation Guidance
Perry Dong
Alec M. Lessing
Annie S. Chen
Chelsea Finn
OffRL
134
3
0
09 Jun 2025
A Generative Physics-Informed Reinforcement Learning-Based Approach for Construction of Representative Drive Cycle
A Generative Physics-Informed Reinforcement Learning-Based Approach for Construction of Representative Drive Cycle
Amirreza Yasami
Mohammadali Tofigh
Mahdi Shahbakhti
Charles Robert Koch
83
0
0
09 Jun 2025
SCAR: Shapley Credit Assignment for More Efficient RLHF
SCAR: Shapley Credit Assignment for More Efficient RLHF
Meng Cao
Shuyuan Zhang
Xiao-Wen Chang
Doina Precup
362
4
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
302
4
0
26 May 2025
Counter-Inferential Behavior in Natural and Artificial Cognitive Systems
Counter-Inferential Behavior in Natural and Artificial Cognitive Systems
Serge Dolgikh
252
0
0
19 May 2025
Exploration by Random Distribution Distillation
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
310
1
0
16 May 2025
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
261
0
0
13 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
254
0
0
13 May 2025
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Andreas Kontogiannis
Konstantinos Papathanasiou
Yi Shen
Giorgos Stamou
Michael M. Zavlanos
G. Vouros
337
1
0
08 May 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
363
3
0
09 Apr 2025
Exploration-Driven Generative Interactive Environments
Exploration-Driven Generative Interactive EnvironmentsComputer Vision and Pattern Recognition (CVPR), 2025
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen3DV
267
5
0
03 Apr 2025
World Model Agents with Change-Based Intrinsic Motivation
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRLMoE
292
1
0
26 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
396
2
0
24 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
290
0
0
23 Mar 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Antonios Tragoudaras
Pavel Kolev
Georg Martius
LM&RoVLM
756
4
0
03 Mar 2025
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Toru Lin
Kartik Sachdev
Linxi Fan
Jitendra Malik
Yuke Zhu
381
46
0
27 Feb 2025
Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing CommandsIEEE International Conference on Robotics and Automation (ICRA), 2025
Huaxing Huang
Wenhao Cui
Tonghe Zhang
Shengtao Li
Jinchao Han
...
Chenxu Hu
Ning Yan
Jiahao Chen
Shipu Zhang
Zheyuan Jiang
309
2
0
26 Feb 2025
The impact of intrinsic rewards on exploration in Reinforcement Learning
The impact of intrinsic rewards on exploration in Reinforcement Learning
Aya Kayal
Eduardo Pignatelli
Laura Toni
205
5
0
20 Jan 2025
PIMAEX: Multi-Agent Exploration through Peer IncentivizationInternational Conference on Agents and Artificial Intelligence (ICAART), 2025
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
205
0
0
03 Jan 2025
$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior
βββ-DQN: Improving Deep Q-Learning By Evolving the BehaviorAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
374
3
0
01 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning ApplicationsIEEE Access (IEEE Access), 2024
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
312
52
0
31 Dec 2024
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing Guo
Ivor W. Tsang
1.1K
3
0
11 Nov 2024
Deterministic Exploration via Stationary Bellman Error Maximization
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
Carlo DÉramo
227
0
0
31 Oct 2024
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for
  Long-Horizon Manipulation
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon ManipulationConference on Robot Learning (CoRL), 2024
Zihan Zhou
Animesh Garg
Dieter Fox
Caelan Reed Garrett
Ajay Mandlekar
338
12
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSLOffRLOnRL
756
6
0
23 Oct 2024
GUIDE: Real-Time Human-Shaped Agents
GUIDE: Real-Time Human-Shaped AgentsNeural Information Processing Systems (NeurIPS), 2024
Lingyu Zhang
Zhengran Ji
Nicholas R Waytowich
Boyuan Chen
211
7
0
19 Oct 2024
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive
  Approach
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive ApproachNeural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
159
2
0
17 Oct 2024
Automated Rewards via LLM-Generated Progress Functions
Automated Rewards via LLM-Generated Progress Functions
Vishnu Sarukkai
Brennan Shacklett
Zander Majercik
Kush S. Bhatia
Christopher Ré
Kayvon Fatahalian
270
3
0
11 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
339
4
0
07 Oct 2024
1234...8910
Next