Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,245 papers shown
Title
Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning
Wenhao Ding
Haohong Lin
Yue Liu
Ding Zhao
LRM
28
37
0
19 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
32
31
0
13 Jul 2022
Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning
Jan Achterhold
Markus Krimmel
Joerg Stueckler
40
9
0
11 Jul 2022
Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents
Laetitia Teodorescu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
LLMAG
35
0
0
08 Jul 2022
Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management
Julen Cestero
M. Quartulli
Alberto Maria Metelli
Marcello Restelli
OffRL
26
6
0
08 Jul 2022
A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object
Shengjie Wang
Yu-wen Cao
Xiang Zheng
Tao Zhang
35
16
0
06 Jul 2022
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation
Yan Zhao
Ruihai Wu
Zhehuan Chen
Yourong Zhang
Qingnan Fan
Kaichun Mo
Hao Dong
31
14
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
47
8
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
19
4
0
03 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
63
62
0
30 Jun 2022
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full Orientation Control
Martin Schuck
Jan Brüdigam
A. Capone
Stefan Sosnowski
Sandra Hirche
19
1
0
28 Jun 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
19
3
0
28 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
44
22
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
52
288
0
23 Jun 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
42
4
0
23 Jun 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
28
25
0
22 Jun 2022
Learning Neuro-Symbolic Skills for Bilevel Planning
Tom Silver
Ashay Athalye
J. Tenenbaum
Tomas Lozano-Perez
L. Kaelbling
39
59
0
21 Jun 2022
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer
Jeewon Jeon
Woojun Kim
Whiyoung Jung
Young-Jin Sung
29
35
0
20 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
Brandon Trabucco
Mariano Phielipp
Glen Berseth
34
28
0
17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
22
7
0
17 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
27
69
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
42
141
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
25
2
0
14 Jun 2022
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments
Hugo Caselles-Dupré
Olivier Sigaud
Mohamed Chetouani
9
3
0
09 Jun 2022
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
49
93
0
08 Jun 2022
Discrete State-Action Abstraction via the Successor Representation
A. Attali
Pedro Cisneros-Velarde
M. Morales
Nancy M. Amato
OffRL
36
1
0
07 Jun 2022
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
44
16
0
07 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
18
2
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
f
f
f
-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
25
53
0
07 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
12
1
0
06 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
41
25
0
02 Jun 2022
When does return-conditioned supervised learning work for offline reinforcement learning?
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
30
60
0
02 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
23
9
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
28
13
0
31 May 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems
Pierre Schumacher
Daniel Haeufle
Le Chen
Syn Schmitt
Georg Martius
36
31
0
30 May 2022
Autoformalization with Large Language Models
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
119
161
0
25 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
28
22
0
25 May 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
64
57
0
24 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
19
1
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
204
637
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
35
0
0
20 May 2022
Transformer with Memory Replay
R. Liu
Barzan Mozafari
OffRL
70
4
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
20
14
0
19 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
58
29
0
17 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments
Jakob Thumm
Matthias Althoff
63
34
0
12 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OOD
OffRL
34
21
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
20
2
0
08 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
28
1
0
06 May 2022
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSL
OffRL
37
4
0
04 May 2022
Previous
1
2
3
...
10
11
12
...
23
24
25
Next