Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.12894
Cited By
Exploration by Random Network Distillation
30 October 2018
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploration by Random Network Distillation"
50 / 277 papers shown
Title
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
Imitation Learning by Reinforcement Learning
K. Ciosek
14
18
0
10 Aug 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
29
9
0
04 Aug 2021
Strategically Efficient Exploration in Competitive Multi-agent Reinforcement Learning
R. Loftin
Aadirupa Saha
Sam Devlin
Katja Hofmann
30
5
0
30 Jul 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
A. Schwing
36
104
0
23 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
108
3
0
13 Jul 2021
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
34
8
0
12 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative Coding
Alexander Ororbia
A. Mali
41
15
0
10 Jul 2021
Robust Out-of-Distribution Detection on Deep Probabilistic Generative Models
Jaemoo Choi
Changyeon Yoon
Jeongwoo Bae
Myung-joo Kang
OODD
30
4
0
15 Jun 2021
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
34
51
0
11 Jun 2021
Learning Markov State Abstractions for Deep Reinforcement Learning
Cameron Allen
Neev Parikh
Omer Gottesman
George Konidaris
BDL
OffRL
29
35
0
08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Z. Fountas
Karl J. Friston
14
20
0
08 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
18
3
0
01 Jun 2021
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
26
6
0
15 May 2021
Learning on a Budget via Teacher Imitation
Ercüment Ilhan
Jeremy Gow
Diego Perez-Liebana
OffRL
20
2
0
17 Apr 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models
Dhruv Shah
Benjamin Eysenbach
G. Kahn
Nicholas Rhinehart
Sergey Levine
24
70
0
12 Apr 2021
BR-NS: an Archive-less Approach to Novelty Search
Achkan Salehi
Alexandre Coninx
Stéphane Doncieux
23
6
0
08 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
15
17
0
15 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
22
17
0
10 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
S. Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
33
25
0
24 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
79
76
0
13 Jan 2021
Bridging In- and Out-of-distribution Samples for Their Better Discriminability
Engkarat Techapanurak
Anh-Chuong Dang
Takayuki Okatani
OODD
25
3
0
07 Jan 2021
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
32
30
0
06 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
33
40
0
15 Dec 2020
Meta Automatic Curriculum Learning
Rémy Portelas
Clément Romac
Katja Hofmann
Pierre-Yves Oudeyer
29
8
0
16 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
16
105
0
12 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
33
33
0
10 Nov 2020
TAMPC: A Controller for Escaping Traps in Novel Environments
Sheng Zhong
Zhenyuan Zhang
Nima Fazeli
Dmitry Berenson
15
7
0
23 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning
Shen Ren
Qianxiao Li
Liye Zhang
Zheng Qin
Bo Yang
21
0
0
22 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
36
5
0
07 Oct 2020
Latent World Models For Intrinsically Motivated Exploration
Aleksandr Ermolov
N. Sebe
25
25
0
05 Oct 2020
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
30
43
0
28 Sep 2020
Robust and Generalizable Visual Representation Learning via Random Convolutions
Zhenlin Xu
Deyi Liu
Junlin Yang
Colin Raffel
Marc Niethammer
OOD
AAML
49
189
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
19
5
0
12 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
11
19
0
09 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
31
117
0
22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
38
125
0
22 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
13
18
0
14 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
18
124
0
08 Jun 2020
PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals
Henry Charlesworth
Giovanni Montana
OffRL
16
24
0
01 Jun 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
13
13
0
21 May 2020
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
24
397
0
12 May 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
43
139
0
30 Apr 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
349
0
27 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
13
509
0
30 Mar 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
Rémy Portelas
Cédric Colas
Lilian Weng
Katja Hofmann
Pierre-Yves Oudeyer
ODL
19
167
0
10 Mar 2020
Scaling MAP-Elites to Deep Neuroevolution
Cédric Colas
Joost Huizinga
Vashisht Madhavan
Jeff Clune
33
86
0
03 Mar 2020
Previous
1
2
3
4
5
6
Next