Planning to Explore via Self-Supervised World Models

12 May 2020

Pieter Abbeel

Papers citing "Planning to Explore via Self-Supervised World Models"

36 / 86 papers shown

Title
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 16 58 0 24 May 2022
A Survey of Traversability Estimation for Mobile Robots Christos Sevastopoulos S. Konstantopoulos 38 34 0 22 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained Representations Allison C. Tam Neil C. Rabinowitz Andrew Kyle Lampinen Nicholas A. Roy Stephanie C. Y. Chan D. Strouse Jane X. Wang Andrea Banino Felix Hill LM&Ro 30 67 0 08 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 18 116 0 25 Mar 2022
Temporal Difference Learning for Model Predictive Control Nicklas Hansen Xiaolong Wang H. Su PINN MU 36 220 0 09 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning Chenjia Bai Lingxiao Wang Zhuoran Yang Zhihong Deng Animesh Garg Peng Liu Zhaoran Wang OffRL 26 132 0 23 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models Changgu Chen Yi-Fu Wu Jaesik Yoon Sungjin Ahn OffRL 32 90 0 19 Feb 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach Xuezhou Zhang Yuda Song Masatoshi Uehara Mengdi Wang Alekh Agarwal Wen Sun OffRL 24 57 0 31 Jan 2022
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning Haichao Zhang Wei-ping Xu Haonan Yu 25 10 0 24 Jan 2022
Physical Derivatives: Computing policy gradients by physical forward-propagation Arash Mehrjou Ashkan Soleymani Stefan Bauer Bernhard Schölkopf 25 0 0 15 Jan 2022
Smooth Model Predictive Path Integral Control without Smoothing Taekyung Kim Gyuhyun Park K. Kwak Jihwan Bae Wonsuk Lee 24 38 0 18 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics Ingmar Schubert Danny Driess Ozgur S. Oguz Marc Toussaint OffRL 20 1 0 15 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning Rujikorn Charakorn P. Manoonpong Nat Dilokthanakul 25 5 0 05 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives Murtaza Dalal Deepak Pathak Ruslan Salakhutdinov 21 90 0 28 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 36 18 0 27 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task Chuang Gan Abhishek Bhandwaldar Antonio Torralba J. Tenenbaum Phillip Isola LRM 133 4 0 13 Oct 2021
Neural Algorithmic Reasoners are Implicit Planners Andreea Deac Petar Velivcković Ognjen Milinković Pierre-Luc Bacon Jian Tang Mladen Nikolic OffRL 32 23 0 11 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine SSL OffRL 53 31 0 06 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks Robert McCarthy Qiang Wang S. Redmond OffRL 27 15 0 05 Oct 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration Oliver Groth Markus Wulfmeier Giulia Vezzani Vibhavari Dasagi Tim Hertweck Roland Hafner N. Heess Martin Riedmiller LRM 35 20 0 17 Sep 2021
Benchmarking the Spectrum of Agent Capabilities Danijar Hafner ELM 22 126 0 14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 30 92 0 14 Sep 2021
Backprop-Free Reinforcement Learning with Active Neural Generative Coding Alexander Ororbia A. Mali 28 15 0 10 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation Nicklas Hansen H. Su Xiaolong Wang OffRL 20 133 0 01 Jul 2021
Learning to Map for Active Semantic Goal Navigation G. Georgakis Bernadette Bucher Karl Schmeckpeper Siddharth Singh Kostas Daniilidis 25 73 0 29 Jun 2021
Behavior From the Void: Unsupervised Active Pre-Training Hao Liu Pieter Abbeel VLM SSL 34 195 0 08 Mar 2021
Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design Adam Foster Desi R. Ivanova Ilyas Malik Tom Rainforth 26 78 0 03 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning Victor Campos Pablo Sprechmann S. Hansen André Barreto Steven Kapturowski Alex Vitvitskyi Adria Puigdomenech Badia Charles Blundell OffRL OnRL 28 26 0 24 Feb 2021
Online Safety Assurance for Deep Reinforcement Learning Noga H. Rotman Michael Schapira Aviv Tamar OffRL 36 5 0 07 Oct 2020
Latent World Models For Intrinsically Motivated Exploration Aleksandr Ermolov N. Sebe 23 24 0 05 Oct 2020
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 33 809 0 05 Oct 2020
Self-Supervised Policy Adaptation during Deployment Nicklas Hansen Rishabh Jangir Yu Sun Guillem Alenyà Pieter Abbeel Alexei A. Efros Lerrel Pinto Xiaolong Wang 30 159 0 08 Jul 2020
A Unifying Framework for Reinforcement Learning and Planning Thomas M. Moerland Joost Broekens Aske Plaat Catholijn M. Jonker OffRL 15 9 0 26 Jun 2020
Deep Dynamics Models for Learning Dexterous Manipulation Anusha Nagabandi K. Konolige Sergey Levine Vikash Kumar 148 407 0 25 Sep 2019
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles Balaji Lakshminarayanan Alexander Pritzel Charles Blundell UQCV BDL 273 5,660 0 05 Dec 2016