v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016

Pieter Abbeel

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 467 papers shown

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer

180

12 Jul 2020

Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate

Mirco Mutti

Lorenzo Pratissoli

Marcello Restelli

223

09 Jul 2020

See, Hear, Explore: Curiosity via Audio-Visual AssociationNeural Information Processing Systems (NeurIPS), 2020

Victoria Dean

Shubham Tulsiani

Abhinav Gupta

256

07 Jul 2020

Guided Exploration with Proximal Policy Optimization using a Single Demonstration

Gabriele Libardi

Gianni De Fabritiis

173

07 Jul 2020

Regularly Updated Deterministic Policy Gradient Algorithm

01 Jul 2020

The NetHack Learning EnvironmentNeural Information Processing Systems (NeurIPS), 2020

470

209

24 Jun 2020

Show me the Way: Intrinsic Motivation from DemonstrationsAdaptive Agents and Multi-Agent Systems (AAMAS), 2020

Léonard Hussenot

Robert Dadashi

Matthieu Geist

Olivier Pietquin

233

23 Jun 2020

Ecological Reinforcement Learning

Abhishek Gupta

198

22 Jun 2020

Towards Tractable Optimism in Model-Based Reinforcement Learning

144

21 Jun 2020

On Reward-Free Reinforcement Learning with Linear Function Approximation

214

113

19 Jun 2020

NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration

110

19 Jun 2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

463

246

18 Jun 2020

Non-local Policy Optimization via Diversity-regularized Collaborative Exploration

Zhenghao Peng

Hao Sun

Bolei Zhou

172

14 Jun 2020

Adaptive Reward-Free ExplorationInternational Conference on Algorithmic Learning Theory (ALT), 2020

Pierre Ménard

Anders Jonsson

146

11 Jun 2020

Temporally-Extended ε-Greedy Exploration

Will Dabney

Georg Ostrovski

André Barreto

163

02 Jun 2020

Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient ExplorationInternational Conference on Machine Learning (ICML), 2020

Seungyul Han

Y. Sung

215

02 Jun 2020

Novel Policy Seeking with Constrained Optimization

307

21 May 2020

Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning

195

19 May 2020

TOMA: Topological Map Abstraction for Reinforcement Learning

Zhao-Heng Yin

Wu-Jun Li

11 May 2020

Exploring Exploration: Comparing Children with RL Agents in Unified Environments

Sandy Huang

145

06 May 2020

First return, then exploreNature (Nature), 2020

Jeff Clune

710

409

27 Apr 2020

Self-Paced Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020

Jan Peters

366

24 Apr 2020

PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion PlanningInternational Conference on Artificial Neural Networks (ICANN), 2020

Guillaume Matheron

Nicolas Perrin

Olivier Sigaud

149

24 Apr 2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

139

22 Apr 2020

Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics

Xusen Yin

Jonathan May

189

06 Apr 2020

Agent57: Outperforming the Atari Human BenchmarkInternational Conference on Machine Learning (ICML), 2020

Adria Puigdomenech Badia

Bilal Piot

274

567

30 Mar 2020

Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning

393

15 Mar 2020

Option Discovery in the Absence of Rewards with Manifold AnalysisInternational Conference on Machine Learning (ICML), 2020

Amitay Bar

Ronen Talmon

Ron Meir

129

12 Mar 2020

Meta-learning curiosity algorithmsInternational Conference on Learning Representations (ICLR), 2020

Ferran Alet

Martin Schneider

Tomas Lozano-Perez

L. Kaelbling

240

11 Mar 2020

Exploring Unknown States with Action Balance

Yan Song

Yingfeng Chen

Yujing Hu

Changjie Fan

131

10 Mar 2020

RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated EnvironmentsInternational Conference on Learning Representations (ICLR), 2020

Roberta Raileanu

Tim Rocktaschel

268

193

27 Feb 2020

Optimistic Exploration even with a Pessimistic InitialisationInternational Conference on Learning Representations (ICLR), 2020

129

26 Feb 2020

Online Learning in Contextual Bandits using Gated Linear NetworksNeural Information Processing Systems (NeurIPS), 2020

Eren Sezener

Marcus Hutter

David Budden

Jianan Wang

J. Veness

165

21 Feb 2020

Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution StrategyElectronic Research Archive (ERA), 2020

Jiaxing Zhang

Hoang Tran

Guannan Zhang

129

21 Feb 2020

TempLe: Learning Template of Transitions for Sample Efficient Multi-task RLAAAI Conference on Artificial Intelligence (AAAI), 2020

Yanchao Sun

Xiangyu Yin

Furong Huang

OffRL

213

16 Feb 2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering SkillsInternational Conference on Machine Learning (ICML), 2020

478

167

10 Feb 2020

An Exploration of Embodied Visual ExplorationInternational Journal of Computer Vision (IJCV), 2020

Santhosh Kumar Ramakrishnan

Dinesh Jayaraman

Kristen Grauman

LM&Ro

335

107

07 Jan 2020

Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning

Jan Peters

01 Jan 2020

A Survey of Deep Reinforcement Learning in Video Games

Youssef Attia El Hili

340

224

23 Dec 2019

Marginalized State Distribution Entropy Regularization in Policy Optimization

Riashat Islam

Zafarali Ahmed

Doina Precup

116

11 Dec 2019

Optimism in Reinforcement Learning with Generalized Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2019

276

144

09 Dec 2019

Bayesian Curiosity for Efficient Exploration in Reinforcement Learning

Tom Blau

Lionel Ott

Fabio Ramos

20 Nov 2019

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

20 Nov 2019

Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy GradientIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019

15 Nov 2019

Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019

198

156

13 Nov 2019

Multi-Path Policy OptimizationAdaptive Agents and Multi-Agent Systems (AAMAS), 2019

L. Pan

Qingpeng Cai

Longbo Huang

217

11 Nov 2019

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped RewardsNeural Information Processing Systems (NeurIPS), 2019

303

130

04 Nov 2019

Dynamic Subgoal-based Exploration via Bayesian Optimization

Yijia Wang

Matthias Poloczek

Daniel R. Jiang

401

21 Oct 2019

Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation

193

17 Oct 2019

Parallel Exploration via Negatively Correlated Search

221

16 Oct 2019