Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.04717
Cited By
v1
v2
v3 (latest)
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"
50 / 466 papers shown
Title
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
International Conference on Machine Learning (ICML), 2022
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
189
6
0
18 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
110
9
0
09 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRL
OnRL
AI4CE
158
30
0
08 Nov 2022
Curiosity-Driven Multi-Agent Exploration with Mixed Objectives
Roben Delos Reyes
Kyunghwan Son
Jinhwan Jung
Wan Ju Kang
Yung Yi
151
5
0
29 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
279
1
0
20 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Neural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
189
91
0
18 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
156
9
0
17 Oct 2022
Exploration via Elliptical Episodic Bonuses
Neural Information Processing Systems (NeurIPS), 2022
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
235
52
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Neural Information Processing Systems (NeurIPS), 2022
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
113
11
0
11 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
170
5
0
05 Oct 2022
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
177
1
0
05 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications
American Control Conference (ACC), 2022
Mingyu Cai
Makai Mann
Zachary Serlin
Kevin J. Leahy
C. Vasile
302
14
0
03 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
IEEE Conference on Decision and Control (CDC), 2022
C. Banerjee
Zhiyong Chen
N. Noman
148
6
0
01 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
214
50
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
337
12
0
19 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
159
11
0
15 Sep 2022
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning
Bang You
Jingming Xie
Youping Chen
Jan Peters
Oleg Arenz
98
3
0
12 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability Testing
IEEE Transactions on Games (IEEE Trans. Games), 2022
Cong Lu
Raluca Georgescu
J. Verwey
131
8
0
01 Sep 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
283
1
0
28 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSL
OffRL
305
22
0
27 Aug 2022
A Review of Uncertainty for Deep Reinforcement Learning
Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Owen Lockwood
Mei Si
178
65
0
18 Aug 2022
Reinforcement learning with experience replay and adaptation of action dispersion
Pawel Wawrzyñski
Wojciech Masarczyk
M. Ostaszewski
75
1
0
30 Jul 2022
Annealed Training for Combinatorial Optimization on Graphs
Haoran Sun
E. Guha
H. Dai
164
25
0
23 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
212
16
0
12 Jul 2022
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning
International Conference on Agents and Artificial Intelligence (ICAART), 2022
Dogay Kamar
N. K. Üre
Gözde B. Ünal
77
1
0
28 Jun 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Annual Meeting of the Cognitive Science Society (CogSci), 2022
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
169
19
0
16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Neural Information Processing Systems (NeurIPS), 2022
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
222
84
0
16 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
282
27
0
08 Jun 2022
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes
Tetsuro Morimura
Kazuhiro Ota
Kenshi Abe
Peinan Zhang
OffRL
255
0
0
02 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
81
2
0
31 May 2022
k-Means Maximum Entropy Exploration
Alexander Nedergaard
Matthew Cook
147
15
0
31 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
125
2
0
27 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
225
6
0
26 May 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
116
71
0
24 May 2022
Concurrent Credit Assignment for Data-efficient Reinforcement Learning
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Emmanuel Daucé
72
2
0
24 May 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments
International Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
147
11
0
23 May 2022
Complex behavior from intrinsic motivation to occupy action-state path space
Nature Communications (Nat Commun), 2022
Jorge Ramírez-Ruiz
D. Grytskyy
Chiara Mastrogiuseppe
Yamen Habib
R. Moreno-Bote
216
14
0
20 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters
ACM Transactions on Graphics (TOG), 2022
Xue Bin Peng
Yunrong Guo
L. Halper
Sergey Levine
Sanja Fidler
162
15
0
04 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Information Fusion (Inf. Fusion), 2022
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
272
479
0
02 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Computer Vision and Pattern Recognition (CVPR), 2022
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
303
130
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments
IEEE Robotics and Automation Letters (RA-L), 2022
Yafei Hu
Shaoshu Su
Chen Wang
John Keller
Sebastian Scherer
OffRL
177
15
0
07 Apr 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Yujie Lu
Jianren Wang
Vikash Kumar
181
4
0
31 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
International Conference on Machine Learning (ICML), 2022
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
262
133
0
25 Mar 2022
Intrinsically-Motivated Reinforcement Learning: A Brief Introduction
Mingqi Yuan
172
2
0
03 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
156
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
139
11
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Applied Sciences (Appl. Sci.), 2022
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
213
21
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
211
18
0
23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and Computational Models
CLEaR (CLEaR), 2022
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
166
11
0
21 Feb 2022
Improving Intrinsic Exploration with Language Abstractions
Neural Information Processing Systems (NeurIPS), 2022
Jesse Mu
Victor Zhong
Roberta Raileanu
Minqi Jiang
Noah D. Goodman
Tim Rocktaschel
Edward Grefenstette
228
72
0
17 Feb 2022
Previous
1
2
3
4
5
...
8
9
10
Next