ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.05960
  4. Cited By
Planning to Explore via Self-Supervised World Models

Planning to Explore via Self-Supervised World Models

12 May 2020
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
    SSL
ArXivPDFHTML

Papers citing "Planning to Explore via Self-Supervised World Models"

36 / 86 papers shown
Title
Reward Uncertainty for Exploration in Preference-based Reinforcement
  Learning
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
16
58
0
24 May 2022
A Survey of Traversability Estimation for Mobile Robots
A Survey of Traversability Estimation for Mobile Robots
Christos Sevastopoulos
S. Konstantopoulos
38
34
0
22 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained
  Representations
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
30
67
0
08 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
116
0
25 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
36
220
0
09 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
26
132
0
23 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
32
90
0
19 Feb 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free
  Representation Learning Approach
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Xuezhou Zhang
Yuda Song
Masatoshi Uehara
Mengdi Wang
Alekh Agarwal
Wen Sun
OffRL
24
57
0
31 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei-ping Xu
Haonan Yu
25
10
0
24 Jan 2022
Physical Derivatives: Computing policy gradients by physical
  forward-propagation
Physical Derivatives: Computing policy gradients by physical forward-propagation
Arash Mehrjou
Ashkan Soleymani
Stefan Bauer
Bernhard Schölkopf
25
0
0
15 Jan 2022
Smooth Model Predictive Path Integral Control without Smoothing
Smooth Model Predictive Path Integral Control without Smoothing
Taekyung Kim
Gyuhyun Park
K. Kwak
Jihwan Bae
Wonsuk Lee
24
38
0
18 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned
  Policies in Robotics
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics
Ingmar Schubert
Danny Driess
Ozgur S. Oguz
Marc Toussaint
OffRL
20
1
0
15 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
25
5
0
05 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action
  Primitives
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
Murtaza Dalal
Deepak Pathak
Ruslan Salakhutdinov
21
90
0
28 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
36
18
0
27 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task
OPEn: An Open-ended Physics Environment for Learning Without a Task
Chuang Gan
Abhishek Bhandwaldar
Antonio Torralba
J. Tenenbaum
Phillip Isola
LRM
133
4
0
13 Oct 2021
Neural Algorithmic Reasoners are Implicit Planners
Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac
Petar Velivcković
Ognjen Milinković
Pierre-Luc Bacon
Jian Tang
Mladen Nikolic
OffRL
32
23
0
11 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning
The Information Geometry of Unsupervised Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
53
31
0
06 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for
  Sparse Reward Tasks
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
27
15
0
05 Oct 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from
  Curious Exploration
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Oliver Groth
Markus Wulfmeier
Giulia Vezzani
Vibhavari Dasagi
Tim Hertweck
Roland Hafner
N. Heess
Martin Riedmiller
LRM
35
20
0
17 Sep 2021
Benchmarking the Spectrum of Agent Capabilities
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
22
126
0
14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
30
92
0
14 Sep 2021
Backprop-Free Reinforcement Learning with Active Neural Generative
  Coding
Backprop-Free Reinforcement Learning with Active Neural Generative Coding
Alexander Ororbia
A. Mali
28
15
0
10 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
20
133
0
01 Jul 2021
Learning to Map for Active Semantic Goal Navigation
Learning to Map for Active Semantic Goal Navigation
G. Georgakis
Bernadette Bucher
Karl Schmeckpeper
Siddharth Singh
Kostas Daniilidis
25
73
0
29 Jun 2021
Behavior From the Void: Unsupervised Active Pre-Training
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
34
195
0
08 Mar 2021
Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design
Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design
Adam Foster
Desi R. Ivanova
Ilyas Malik
Tom Rainforth
26
78
0
03 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
S. Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
28
26
0
24 Feb 2021
Online Safety Assurance for Deep Reinforcement Learning
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
36
5
0
07 Oct 2020
Latent World Models For Intrinsically Motivated Exploration
Latent World Models For Intrinsically Motivated Exploration
Aleksandr Ermolov
N. Sebe
23
24
0
05 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
33
809
0
05 Oct 2020
Self-Supervised Policy Adaptation during Deployment
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
30
159
0
08 Jul 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
15
9
0
26 Jun 2020
Deep Dynamics Models for Learning Dexterous Manipulation
Deep Dynamics Models for Learning Dexterous Manipulation
Anusha Nagabandi
K. Konolige
Sergey Levine
Vikash Kumar
148
407
0
25 Sep 2019
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
273
5,660
0
05 Dec 2016
Previous
12