v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016

Pieter Abbeel

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 467 papers shown

Extending NGU to Multi-Agent RL: A Preliminary Study

01 Dec 2025

Periodic Skill Discovery

318

05 Nov 2025

Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings

Seyed Mahdi Basiri Azad

Joschka Boedecker

OffRL OnRL

297

28 Oct 2025

Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards

181

18 Oct 2025

Representation-Based Exploration for Language Models: From Test-Time to Post-Training

134

13 Oct 2025

BuilderBench -- A benchmark for generalist agents

134

07 Oct 2025

Q-Learning with Shift-Aware Upper Confidence Bound in Non-Stationary Reinforcement Learning

03 Oct 2025

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

...

398

26 Sep 2025

Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning

Gawon Lee

Daesol Cho

H. J. Kim

195

25 Sep 2025

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

...

172

11 Sep 2025

What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?

143

04 Sep 2025

Uncertainty-driven Adaptive Exploration

Leonidas Bakopoulos

Georgios Chalkiadakis

183

03 Sep 2025

Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning

126

29 Aug 2025

Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning

Soumia Mehimeh

OffRL OnRL

162

12 Aug 2025

Exploitation Is All You Need... for Exploration

Micah Rentschler

Jesse Roberts

101

02 Aug 2025

Is Exploration or Optimization the Problem for Deep Reinforcement Learning?

Glen Berseth

OffRL

150

02 Aug 2025

Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems

Yilie Huang

Xun Yu Zhou

OffRL

177

01 Jul 2025

Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design

Hampus Gummesson Svensson

329

26 Jun 2025

Reward Models in Deep Reinforcement Learning: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

158

18 Jun 2025

Uncertainty Prioritized Experience Replay

Rodrigo Carrasco-Davis

Sebastian Lee

Claudia Clopath

Will Dabney

214

10 Jun 2025

Scalable and Cost-Efficient de Novo Template-Based Molecular Generation

145

10 Jun 2025

Reinforcement Learning via Implicit Imitation Guidance

134

09 Jun 2025

A Generative Physics-Informed Reinforcement Learning-Based Approach for Construction of Representative Drive Cycle

09 Jun 2025

SCAR: Shapley Credit Assignment for More Efficient RLHF

362

26 May 2025

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

302

26 May 2025

Counter-Inferential Behavior in Natural and Artificial Cognitive Systems

Serge Dolgikh

252

19 May 2025

Exploration by Random Distribution Distillation

310

16 May 2025

Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning

Shuai Han

Mehdi Dastani

Shihan Wang

261

13 May 2025

Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges

Miguel Arana-Catania

Weisi Guo

CML

254

13 May 2025

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Andreas Kontogiannis

Konstantinos Papathanasiou

337

08 May 2025

Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments

Licheng Luo

Mingyu Cai

363

09 Apr 2025

Exploration-Driven Generative Interactive EnvironmentsComputer Vision and Pattern Recognition (CVPR), 2025

267

03 Apr 2025

World Model Agents with Change-Based Intrinsic Motivation

Jeremias Ferrao

Rafael Cunha

OffRL MoE

292

26 Mar 2025

Adventurer: Exploration with BiGAN for Deep Reinforcement Learning

Yongshuai Liu

Xin Liu

GAN

396

24 Mar 2025

KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies

290

23 Mar 2025

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

756

03 Mar 2025

Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids

381

27 Feb 2025

Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing CommandsIEEE International Conference on Robotics and Automation (ICRA), 2025

...

309

26 Feb 2025

The impact of intrinsic rewards on exploration in Reinforcement Learning

Aya Kayal

Eduardo Pignatelli

Laura Toni

205

20 Jan 2025

PIMAEX: Multi-Agent Exploration through Peer IncentivizationInternational Conference on Agents and Artificial Intelligence (ICAART), 2025

Claudia Linnhoff-Popien

205

03 Jan 2025

$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior

β

-DQN: Improving Deep Q-Learning By Evolving the BehaviorAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

374

01 Jan 2025

Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning ApplicationsIEEE Access (IEEE Access), 2024

312

31 Dec 2024

Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

1.1K

11 Nov 2024

Deterministic Exploration via Stationary Bellman Error Maximization

Sebastian Griesbach

Carlo DÉramo

227

31 Oct 2024

SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon ManipulationConference on Robot Learning (CoRL), 2024

338

23 Oct 2024

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

756

23 Oct 2024

GUIDE: Real-Time Human-Shaped AgentsNeural Information Processing Systems (NeurIPS), 2024

211

19 Oct 2024

Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive ApproachNeural Information Processing Systems (NeurIPS), 2024

Riccardo Poiani

Nicole Nobili

Alberto Maria Metelli

Marcello Restelli

159

17 Oct 2024

Automated Rewards via LLM-Generated Progress Functions

270

11 Oct 2024

ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

339

07 Oct 2024