Exploration by Random Network Distillation

30 October 2018

Amos Storkey

Papers citing "Exploration by Random Network Distillation"

50 / 277 papers shown

Title
When should agents explore? Miruna Pislar David Szepesvari Georg Ostrovski Diana Borsa Tom Schaul 40 22 0 26 Aug 2021
Imitation Learning by Reinforcement Learning K. Ciosek 14 18 0 10 Aug 2021
A Pragmatic Look at Deep Imitation Learning Kai Arulkumaran D. Lillrank 29 9 0 04 Aug 2021
Strategically Efficient Exploration in Competitive Multi-agent Reinforcement Learning R. Loftin Aadirupa Saha Sam Devlin Katja Hofmann 30 5 0 30 Jul 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning Iou-Jen Liu Unnat Jain Raymond A. Yeh A. Schwing 36 104 0 23 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks Sungryull Sohn Sungtae Lee Jongwook Choi H. V. Seijen Mehdi Fatemi Honglak Lee 108 3 0 13 Jul 2021
Explore and Control with Adversarial Surprise Arnaud Fickinger Natasha Jaques Samyak Parajuli Michael Chang Nicholas Rhinehart Glen Berseth Stuart J. Russell Sergey Levine 34 8 0 12 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative Coding Alexander Ororbia A. Mali 41 15 0 10 Jul 2021
Robust Out-of-Distribution Detection on Deep Probabilistic Generative Models Jaemoo Choi Changyeon Yoon Jeongwoo Bae Myung-joo Kang OODD 30 4 0 15 Jun 2021
Offline Reinforcement Learning as Anti-Exploration Shideh Rezaeifar Robert Dadashi Nino Vieillard Léonard Hussenot Olivier Bachem Olivier Pietquin M. Geist OffRL 34 51 0 11 Jun 2021
Learning Markov State Abstractions for Deep Reinforcement Learning Cameron Allen Neev Parikh Omer Gottesman George Konidaris BDL OffRL 29 35 0 08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free learning Noor Sajid P. Tigas Alexey Zakharov Z. Fountas Karl J. Friston 14 20 0 08 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning Oriol Corcoll Youssef Mohamed Raul Vicente 18 3 0 01 Jun 2021
A brain basis of dynamical intelligence for AI and computational neuroscience J. Monaco Kanaka Rajan Grace M. Hwang AI4CE 26 6 0 15 May 2021
Learning on a Budget via Teacher Imitation Ercüment Ilhan Jeremy Gow Diego Perez-Liebana OffRL 20 2 0 17 Apr 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models Dhruv Shah Benjamin Eysenbach G. Kahn Nicholas Rhinehart Sergey Levine 24 70 0 12 Apr 2021
BR-NS: an Archive-less Approach to Novelty Search Achkan Salehi Alexandre Coninx Stéphane Doncieux 23 6 0 08 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL Clément Romac Rémy Portelas Katja Hofmann Pierre-Yves Oudeyer 27 21 0 17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model Thanh Nguyen Tung M. Luu Thang Vu Chang D. Yoo 15 17 0 15 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning Dilip Arumugam Peter Henderson Pierre-Luc Bacon 22 17 0 10 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning Victor Campos Pablo Sprechmann S. Hansen André Barreto Steven Kapturowski Alex Vitvitskyi Adria Puigdomenech Badia Charles Blundell OffRL OnRL 33 25 0 24 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning William F. Whitney Michael Bloesch Jost Tobias Springenberg A. Abdolmaleki Kyunghyun Cho Martin Riedmiller OffRL 29 13 0 23 Jan 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation OpenAI OpenAI Matthias Plappert Raul Sampedro Tao Xu Ilge Akkaya ... Hyeonwoo Noh Lilian Weng Qiming Yuan Casey Chu Wojciech Zaremba SSL 79 76 0 13 Jan 2021
Bridging In- and Out-of-distribution Samples for Their Better Discriminability Engkarat Techapanurak Anh-Chuong Dang Takayuki Okatani OODD 25 3 0 07 Jan 2021
Geometric Entropic Exploration Z. Guo M. G. Azar Alaa Saade S. Thakoor Bilal Piot Bernardo Avila-Pires Michal Valko Thomas Mesnard Tor Lattimore Rémi Munos 32 30 0 06 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions Tianjun Zhang Huazhe Xu Xiaolong Wang Yi Wu Kurt Keutzer Joseph E. Gonzalez Yuandong Tian 33 40 0 15 Dec 2020
Meta Automatic Curriculum Learning Rémy Portelas Clément Romac Katja Hofmann Pierre-Yves Oudeyer 29 8 0 16 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with Interaction Karl Schmeckpeper Oleh Rybkin Kostas Daniilidis Sergey Levine Chelsea Finn OffRL 16 105 0 12 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via Reset-Games Kelvin Xu Siddharth Verma Chelsea Finn Sergey Levine CLL 33 33 0 10 Nov 2020
TAMPC: A Controller for Escaping Traps in Novel Environments Sheng Zhong Zhenyuan Zhang Nima Fazeli Dmitry Berenson 15 7 0 23 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning Shen Ren Qianxiao Li Liye Zhang Zheng Qin Bo Yang 21 0 0 22 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning Noga H. Rotman Michael Schapira Aviv Tamar OffRL 36 5 0 07 Oct 2020
Latent World Models For Intrinsically Motivated Exploration Aleksandr Ermolov N. Sebe 25 25 0 05 Oct 2020
Novelty Search in Representational Space for Sample Efficient Exploration Ruo Yu Tao Vincent François-Lavet Joelle Pineau 30 43 0 28 Sep 2020
Robust and Generalizable Visual Representation Learning via Random Convolutions Zhenlin Xu Deyi Liu Junlin Yang Colin Raffel Marc Niethammer OOD AAML 49 189 0 25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control Karush Suri Xiaolong Shi Konstantinos N. Plataniotis Y. Lawryshyn 25 4 0 24 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer E. Liu Ramtin Keramati Sudarshan Seshadri Kelvin Guu Panupong Pasupat Emma Brunskill Percy Liang OffRL 19 5 0 12 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate Mirco Mutti Lorenzo Pratissoli Marcello Restelli 11 19 0 09 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control Sham Kakade A. Krishnamurthy Kendall Lowrey Motoya Ohnishi Wen Sun 31 117 0 22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals Andres Campero Roberta Raileanu Heinrich Küttler J. Tenenbaum Tim Rocktaschel Edward Grefenstette 38 125 0 22 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration Zhenghao Peng Hao Sun Bolei Zhou 13 18 0 14 Jun 2020
Primal Wasserstein Imitation Learning Robert Dadashi Léonard Hussenot M. Geist Olivier Pietquin 18 124 0 08 Jun 2020
PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals Henry Charlesworth Giovanni Montana OffRL 16 24 0 01 Jun 2020
Novel Policy Seeking with Constrained Optimization Hao Sun Zhenghao Peng Bo Dai Jian Guo Dahua Lin Bolei Zhou 13 13 0 21 May 2020
Planning to Explore via Self-Supervised World Models Ramanan Sekar Oleh Rybkin Kostas Daniilidis Pieter Abbeel Danijar Hafner Deepak Pathak SSL 24 397 0 12 May 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning Z. Guo Bernardo Avila-Pires Bilal Piot Jean-Bastien Grill Florent Altché Rémi Munos M. G. Azar BDL DRL SSL 43 139 0 30 Apr 2020
First return, then explore Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune 47 349 0 27 Apr 2020
Agent57: Outperforming the Atari Human Benchmark Adria Puigdomenech Badia Bilal Piot Steven Kapturowski Pablo Sprechmann Alex Vitvitskyi Daniel Guo Charles Blundell OffRL 13 509 0 30 Mar 2020
Automatic Curriculum Learning For Deep RL: A Short Survey Rémy Portelas Cédric Colas Lilian Weng Katja Hofmann Pierre-Yves Oudeyer ODL 19 167 0 10 Mar 2020
Scaling MAP-Elites to Deep Neuroevolution Cédric Colas Joost Huizinga Vashisht Madhavan Jeff Clune 33 86 0 03 Mar 2020