ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 467 papers shown
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
Emmy Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Abigail Z. Jacobs
OffRL
180
6
0
12 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
223
21
0
09 Jul 2020
See, Hear, Explore: Curiosity via Audio-Visual Association
See, Hear, Explore: Curiosity via Audio-Visual AssociationNeural Information Processing Systems (NeurIPS), 2020
Victoria Dean
Shubham Tulsiani
Abhinav Gupta
256
64
0
07 Jul 2020
Guided Exploration with Proximal Policy Optimization using a Single
  Demonstration
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Gabriele Libardi
Gianni De Fabritiis
173
28
0
07 Jul 2020
Regularly Updated Deterministic Policy Gradient Algorithm
Regularly Updated Deterministic Policy Gradient Algorithm
Shuai Han
Wenbo Zhou
Shuai Lu
Jiayu Yu
88
26
0
01 Jul 2020
The NetHack Learning Environment
The NetHack Learning EnvironmentNeural Information Processing Systems (NeurIPS), 2020
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
470
209
0
24 Jun 2020
Show me the Way: Intrinsic Motivation from Demonstrations
Show me the Way: Intrinsic Motivation from DemonstrationsAdaptive Agents and Multi-Agent Systems (AAMAS), 2020
Léonard Hussenot
Robert Dadashi
Matthieu Geist
Olivier Pietquin
233
9
0
23 Jun 2020
Ecological Reinforcement Learning
Ecological Reinforcement Learning
John D. Co-Reyes
Suvansh Sanjeev
Glen Berseth
Abhishek Gupta
Sergey Levine
OffRL
198
24
0
22 Jun 2020
Towards Tractable Optimism in Model-Based Reinforcement Learning
Towards Tractable Optimism in Model-Based Reinforcement Learning
Aldo Pacchiano
Philip J. Ball
Jack Parker-Holder
K. Choromanski
Stephen J. Roberts
OffRL
144
12
0
21 Jun 2020
On Reward-Free Reinforcement Learning with Linear Function Approximation
On Reward-Free Reinforcement Learning with Linear Function Approximation
Ruosong Wang
S. Du
Lin F. Yang
Ruslan Salakhutdinov
OffRL
214
113
0
19 Jun 2020
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online
  Weight Adjustment for Exploration
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration
Shuai Han
Wenbo Zhou
Jing Liu
Shuai Lu
110
34
0
19 Jun 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank
  MDPs
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
OffRL
463
246
0
18 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
172
20
0
14 Jun 2020
Adaptive Reward-Free Exploration
Adaptive Reward-Free ExplorationInternational Conference on Algorithmic Learning Theory (ALT), 2020
E. Kaufmann
Pierre Ménard
O. D. Domingues
Anders Jonsson
Edouard Leurent
Michal Valko
146
86
0
11 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
163
37
0
02 Jun 2020
Diversity Actor-Critic: Sample-Aware Entropy Regularization for
  Sample-Efficient Exploration
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient ExplorationInternational Conference on Machine Learning (ICML), 2020
Seungyul Han
Y. Sung
215
31
0
02 Jun 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
307
15
0
21 May 2020
Experience Augmentation: Boosting and Accelerating Off-Policy
  Multi-Agent Reinforcement Learning
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning
Zhenhui Ye
Yining Chen
Guang-hua Song
Bowei Yang
Sheng Fan
OffRL
195
9
0
19 May 2020
TOMA: Topological Map Abstraction for Reinforcement Learning
TOMA: Topological Map Abstraction for Reinforcement Learning
Zhao-Heng Yin
Wu-Jun Li
86
3
0
11 May 2020
Exploring Exploration: Comparing Children with RL Agents in Unified
  Environments
Exploring Exploration: Comparing Children with RL Agents in Unified Environments
Eliza Kosoy
Jasmine Collins
David M. Chan
Sandy Huang
Deepak Pathak
Pulkit Agrawal
John F. Canny
Alison Gopnik
Jessica B. Hamrick
145
17
0
06 May 2020
First return, then explore
First return, then exploreNature (Nature), 2020
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
710
409
0
27 Apr 2020
Self-Paced Deep Reinforcement Learning
Self-Paced Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
366
61
0
24 Apr 2020
PBCS : Efficient Exploration and Exploitation Using a Synergy between
  Reinforcement Learning and Motion Planning
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion PlanningInternational Conference on Artificial Neural Networks (ICANN), 2020
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
149
19
0
24 Apr 2020
Flexible and Efficient Long-Range Planning Through Curious Exploration
Flexible and Efficient Long-Range Planning Through Curious Exploration
Aidan Curtis
Minjian Xin
Dilip Arumugam
Kevin T. Feigelis
Daniel L. K. Yamins
139
6
0
22 Apr 2020
Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics
Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics
Xusen Yin
Jonathan May
189
3
0
06 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human BenchmarkInternational Conference on Machine Learning (ICML), 2020
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
274
567
0
30 Mar 2020
Provably Efficient Exploration for Reinforcement Learning Using
  Unsupervised Learning
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning
Fei Feng
Ruosong Wang
W. Yin
S. Du
Lin F. Yang
OffRLSSL
393
7
0
15 Mar 2020
Option Discovery in the Absence of Rewards with Manifold Analysis
Option Discovery in the Absence of Rewards with Manifold AnalysisInternational Conference on Machine Learning (ICML), 2020
Amitay Bar
Ronen Talmon
Ron Meir
129
6
0
12 Mar 2020
Meta-learning curiosity algorithms
Meta-learning curiosity algorithmsInternational Conference on Learning Representations (ICLR), 2020
Ferran Alet
Martin Schneider
Tomas Lozano-Perez
L. Kaelbling
240
67
0
11 Mar 2020
Exploring Unknown States with Action Balance
Exploring Unknown States with Action Balance
Yan Song
Yingfeng Chen
Yujing Hu
Changjie Fan
131
6
0
10 Mar 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated
  Environments
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated EnvironmentsInternational Conference on Learning Representations (ICLR), 2020
Roberta Raileanu
Tim Rocktaschel
268
193
0
27 Feb 2020
Optimistic Exploration even with a Pessimistic Initialisation
Optimistic Exploration even with a Pessimistic InitialisationInternational Conference on Learning Representations (ICLR), 2020
Tabish Rashid
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRLOnRL
129
49
0
26 Feb 2020
Online Learning in Contextual Bandits using Gated Linear Networks
Online Learning in Contextual Bandits using Gated Linear NetworksNeural Information Processing Systems (NeurIPS), 2020
Eren Sezener
Marcus Hutter
David Budden
Jianan Wang
J. Veness
165
10
0
21 Feb 2020
Accelerating Reinforcement Learning with a
  Directional-Gaussian-Smoothing Evolution Strategy
Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution StrategyElectronic Research Archive (ERA), 2020
Jiaxing Zhang
Hoang Tran
Guannan Zhang
129
12
0
21 Feb 2020
TempLe: Learning Template of Transitions for Sample Efficient Multi-task
  RL
TempLe: Learning Template of Transitions for Sample Efficient Multi-task RLAAAI Conference on Artificial Intelligence (AAAI), 2020
Yanchao Sun
Xiangyu Yin
Furong Huang
OffRL
213
17
0
16 Feb 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering
  Skills
Explore, Discover and Learn: Unsupervised Discovery of State-Covering SkillsInternational Conference on Machine Learning (ICML), 2020
Victor Campos
Alexander R. Trott
Caiming Xiong
R. Socher
Xavier Giró-i-Nieto
Jordi Torres
OffRL
478
167
0
10 Feb 2020
An Exploration of Embodied Visual Exploration
An Exploration of Embodied Visual ExplorationInternational Journal of Computer Vision (IJCV), 2020
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
335
107
0
07 Jan 2020
Long-Term Visitation Value for Deep Exploration in Sparse Reward
  Reinforcement Learning
Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning
Simone Parisi
Davide Tateo
Maximilian Hensel
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
95
11
0
01 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Youssef Attia El Hili
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRLAI4TS
340
224
0
23 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy
  Optimization
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
116
19
0
11 Dec 2019
Optimism in Reinforcement Learning with Generalized Linear Function
  Approximation
Optimism in Reinforcement Learning with Generalized Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2019
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
276
144
0
09 Dec 2019
Bayesian Curiosity for Efficient Exploration in Reinforcement Learning
Bayesian Curiosity for Efficient Exploration in Reinforcement Learning
Tom Blau
Lionel Ott
Fabio Ramos
89
9
0
20 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of
  arbitrary future tasks
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
85
2
0
20 Nov 2019
Improved Exploration through Latent Trajectory Optimization in Deep
  Deterministic Policy Gradient
Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy GradientIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019
K. Luck
Mel Vecerík
Simon Stepputtis
H. B. Amor
Jonathan Scholz
85
13
0
15 Nov 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation
  Reinforcement Learning
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
198
156
0
13 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy OptimizationAdaptive Agents and Multi-Agent Systems (AAMAS), 2019
L. Pan
Qingpeng Cai
Longbo Huang
217
2
0
11 Nov 2019
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped RewardsNeural Information Processing Systems (NeurIPS), 2019
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
303
130
0
04 Nov 2019
Dynamic Subgoal-based Exploration via Bayesian Optimization
Dynamic Subgoal-based Exploration via Bayesian Optimization
Yijia Wang
Matthias Poloczek
Daniel R. Jiang
401
4
0
21 Oct 2019
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on
  Contingency-aware Observation
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation
Huazhe Xu
Boyuan Chen
Yang Gao
Trevor Darrell
OffRL
193
2
0
17 Oct 2019
Parallel Exploration via Negatively Correlated Search
Parallel Exploration via Negatively Correlated Search
Peng Yang
Qi Yang
Shengcai Liu
Xin Yao
221
14
0
16 Oct 2019
Previous
123...106789
Next