Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01868
Cited By
Unifying Count-Based Exploration and Intrinsic Motivation
6 June 2016
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unifying Count-Based Exploration and Intrinsic Motivation"
50 / 350 papers shown
Title
Iteratively Learn Diverse Strategies with State Distance Information
Wei Fu
Weihua Du
Jingwei Li
Sunli Chen
Jingzhao Zhang
Yi Wu
56
3
0
23 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
38
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
36
1
0
12 Oct 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
Woojun Kim
Jeonghye Kim
Young-Jin Sung
28
5
0
05 Oct 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
44
8
0
28 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
49
9
0
18 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Neural Categorical Priors for Physics-Based Character Control
Qing Zhu
He Zhang
Mengting Lan
Lei Han
34
32
0
14 Aug 2023
Inferring Hierarchical Structure in Multi-Room Maze Environments
Daria de Tinguy
Toon Van de Maele
Tim Verbelen
Bart Dhoedt
13
0
0
23 Jun 2023
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions
Nishil Patel
Sebastian Lee
Stefano Sarao Mannelli
Sebastian Goldt
Adrew Saxe
OffRL
41
3
0
17 Jun 2023
A Cover Time Study of a non-Markovian Algorithm
Guanhua Fang
G. Samorodnitsky
Zhiqiang Xu
25
0
0
08 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
30
20
0
29 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
46
5
0
18 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
Toru Lin
Allan Jabri
OffRL
33
6
0
15 May 2023
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
24
0
0
05 May 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
29
2
0
01 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
30
3
0
30 Apr 2023
Affordances from Human Videos as a Versatile Representation for Robotics
Shikhar Bahl
Russell Mendonca
Lili Chen
Unnat Jain
Deepak Pathak
58
164
0
17 Apr 2023
Accelerating exploration and representation learning with offline pre-training
Bogdan Mazoure
Jake Bruce
Doina Precup
Rob Fergus
Ankit Anand
OffRL
41
5
0
31 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
41
8
0
20 Mar 2023
Fast Rates for Maximum Entropy Exploration
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
49
18
0
14 Mar 2023
Fast exploration and learning of latent graphs with aliased observations
Miguel Lazaro-Gredilla
Ishani Deshpande
Siva K. Swaminathan
Meet Dave
Dileep George
30
3
0
13 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
41
1
0
09 Mar 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
38
1
0
03 Mar 2023
Human-Inspired Framework to Accelerate Reinforcement Learning
Ali Beikmohammadi
Sindri Magnússon
OffRL
29
4
0
28 Feb 2023
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Kechun Xu
Runjian Chen
Shuqing Zhao
Zizhang Li
Hongxiang Yu
Ci Chen
Yue Wang
R. Xiong
20
1
0
25 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
25
175
0
13 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Layered State Discovery for Incremental Autonomous Exploration
Liyu Chen
Andrea Tirinzoni
A. Lazaric
Matteo Pirotta
39
0
0
07 Feb 2023
Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning
Marius Lindegaard
Hjalmar Jacob Vinje
Odin Severinsen
30
2
0
06 Feb 2023
Hierarchically Composing Level Generators for the Creation of Complex Structures
Michael Beukman
Manuel A. Fokam
Marcel Kruger
Guy Axelrod
Muhammad Umair Nasir
Branden Ingram
Benjamin Rosman
Steven D. James
45
9
0
03 Feb 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
B. Kveton
A. Rangi
OffRL
68
0
0
01 Feb 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
30
4
0
29 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
32
8
0
26 Jan 2023
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
Ken Kansky
Skanda Vaidyanath
Scott Swingle
Xinghua Lou
Miguel Lazaro-Gredilla
Dileep George
31
4
0
24 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
52
5
0
05 Jan 2023
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
39
14
0
24 Dec 2022
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have
Nadia M. Ady
R. Shariff
J. Günther
P. Pilarski
19
0
0
01 Dec 2022
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
33
1
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
31
1
0
28 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
31
22
0
23 Nov 2022
Efficient Exploration using Model-Based Quality-Diversity with Gradients
Bryan Lim
Manon Flageat
Antoine Cully
23
4
0
22 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
36
24
0
14 Nov 2022
Previous
1
2
3
4
5
6
7
Next