ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXiv (abs)PDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 466 papers shown
Title
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic EnvironmentsInternational Conference on Machine Learning (ICML), 2022
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
189
6
0
18 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLMOffRLLRM
110
9
0
09 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRLOnRLAI4CE
158
30
0
08 Nov 2022
Curiosity-Driven Multi-Agent Exploration with Mixed Objectives
Curiosity-Driven Multi-Agent Exploration with Mixed Objectives
Roben Delos Reyes
Kyunghwan Son
Jinhwan Jung
Wan Ju Kang
Yung Yi
151
5
0
29 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
279
1
0
20 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward
  Engineering on Sample Complexity
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
189
91
0
18 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human
  Preferences
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
156
9
0
17 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic BonusesNeural Information Processing Systems (NeurIPS), 2022
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
235
52
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
LECO: Learnable Episodic Count for Task-Specific Intrinsic RewardNeural Information Processing Systems (NeurIPS), 2022
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
113
11
0
11 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
Constantine Caramanis
Shie Mannor
170
5
0
05 Oct 2022
Query The Agent: Improving sample efficiency through epistemic
  uncertainty estimation
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
177
1
0
05 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear
  Temporal Logic Specifications
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic SpecificationsAmerican Control Conference (ACC), 2022
Mingyu Cai
Makai Mann
Zachary Serlin
Kevin J. Leahy
C. Vasile
302
14
0
03 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing
  Plausible Novel States
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel StatesIEEE Conference on Decision and Control (CDC), 2022
C. Banerjee
Zhiyong Chen
N. Noman
148
6
0
01 Oct 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
214
50
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in
  Reinforcement Learning
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
337
12
0
19 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRLOnRL
159
11
0
15 Sep 2022
Self-supervised Sequential Information Bottleneck for Robust Exploration
  in Deep Reinforcement Learning
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning
Bang You
Jingming Xie
Youping Chen
Jan Peters
Oleg Arenz
98
3
0
12 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability
  Testing
Go-Explore Complex 3D Game Environments for Automated Reachability TestingIEEE Transactions on Games (IEEE Trans. Games), 2022
Cong Lu
Raluca Georgescu
J. Verwey
131
8
0
01 Sep 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
283
1
0
28 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSLOffRL
305
22
0
27 Aug 2022
A Review of Uncertainty for Deep Reinforcement Learning
A Review of Uncertainty for Deep Reinforcement LearningArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Owen Lockwood
Mei Si
178
65
0
18 Aug 2022
Reinforcement learning with experience replay and adaptation of action
  dispersion
Reinforcement learning with experience replay and adaptation of action dispersion
Pawel Wawrzyñski
Wojciech Masarczyk
M. Ostaszewski
75
1
0
30 Jul 2022
Annealed Training for Combinatorial Optimization on Graphs
Annealed Training for Combinatorial Optimization on Graphs
Haoran Sun
E. Guha
H. Dai
164
25
0
23 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
212
16
0
12 Jul 2022
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement
  Learning
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement LearningInternational Conference on Agents and Artificial Intelligence (ICAART), 2022
Dogay Kamar
N. K. Üre
Gözde B. Ünal
77
1
0
28 Jun 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Towards Understanding How Machines Can Learn Causal OverhypothesesAnnual Meeting of the Cognitive Science Society (CogSci), 2022
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CMLAI4CE
169
19
0
16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped PredictionNeural Information Processing Systems (NeurIPS), 2022
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
222
84
0
16 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
282
27
0
08 Jun 2022
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov
  Decision Processes
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes
Tetsuro Morimura
Kazuhiro Ota
Kenshi Abe
Peinan Zhang
OffRL
255
0
0
02 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
81
2
0
31 May 2022
k-Means Maximum Entropy Exploration
k-Means Maximum Entropy Exploration
Alexander Nedergaard
Matthew Cook
147
15
0
31 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
125
2
0
27 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement
  Learning
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
225
6
0
26 May 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement
  Learning
Reward Uncertainty for Exploration in Preference-based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
116
71
0
24 May 2022
Concurrent Credit Assignment for Data-efficient Reinforcement Learning
Concurrent Credit Assignment for Data-efficient Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2022
Emmanuel Daucé
72
2
0
24 May 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to
  Reinforcement Learning over Hard Exploration Environments
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration EnvironmentsInternational Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
147
11
0
23 May 2022
Complex behavior from intrinsic motivation to occupy action-state path
  space
Complex behavior from intrinsic motivation to occupy action-state path spaceNature Communications (Nat Commun), 2022
Jorge Ramírez-Ruiz
D. Grytskyy
Chiara Mastrogiuseppe
Yamen Habib
R. Moreno-Bote
216
14
0
20 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically
  Simulated Characters
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated CharactersACM Transactions on Graphics (TOG), 2022
Xue Bin Peng
Yunrong Guo
L. Halper
Sergey Levine
Sanja Fidler
162
15
0
04 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A SurveyInformation Fusion (Inf. Fusion), 2022
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
272
479
0
02 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human
  Demonstrations at Scale
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at ScaleComputer Vision and Pattern Recognition (CVPR), 2022
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
303
130
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in
  Challenging Environments
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging EnvironmentsIEEE Robotics and Automation Letters (RA-L), 2022
Yafei Hu
Shaoshu Su
Chen Wang
John Keller
Sebastian Scherer
OffRL
177
15
0
07 Apr 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Yujie Lu
Jianren Wang
Vikash Kumar
181
4
0
31 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from VideosInternational Conference on Machine Learning (ICML), 2022
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSLOnRL
262
133
0
25 Mar 2022
Intrinsically-Motivated Reinforcement Learning: A Brief Introduction
Intrinsically-Motivated Reinforcement Learning: A Brief Introduction
Mingqi Yuan
172
2
0
03 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration
  in Reinforcement Learning
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
156
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
139
11
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in IntralogisticsApplied Sciences (Appl. Sci.), 2022
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
213
21
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
211
18
0
23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and
  Computational Models
Learning Causal Overhypotheses through Exploration in Children and Computational ModelsCLEaR (CLEaR), 2022
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
166
11
0
21 Feb 2022
Improving Intrinsic Exploration with Language Abstractions
Improving Intrinsic Exploration with Language AbstractionsNeural Information Processing Systems (NeurIPS), 2022
Jesse Mu
Victor Zhong
Roberta Raileanu
Minqi Jiang
Noah D. Goodman
Tim Rocktaschel
Edward Grefenstette
228
72
0
17 Feb 2022
Previous
12345...8910
Next