Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,340 papers shown
A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object
Aerospace Science and Technology (AST), 2022
Shengjie Wang
Yu-wen Cao
Xiang Zheng
Tao Zhang
225
27
0
06 Jul 2022
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation
International Conference on Learning Representations (ICLR), 2022
Yan Zhao
Kai Cheng
Zhehuan Chen
Yourong Zhang
Qingnan Fan
Kaichun Mo
Hao Dong
469
24
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
AAAI Conference on Artificial Intelligence (AAAI), 2022
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
165
10
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
Conference on Robot Learning (CoRL), 2022
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
90
6
0
03 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Conference on Robot Learning (CoRL), 2022
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
307
85
0
30 Jun 2022
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full Orientation Control
Martin Schuck
Jan Brüdigam
A. Capone
Roland Toth
Sandra Hirche
229
1
0
28 Jun 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
166
4
0
28 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
213
24
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Neural Information Processing Systems (NeurIPS), 2022
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
499
368
0
23 Jun 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
236
4
0
23 Jun 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Neural Information Processing Systems (NeurIPS), 2022
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
406
35
0
22 Jun 2022
Learning Neuro-Symbolic Skills for Bilevel Planning
Conference on Robot Learning (CoRL), 2022
Tom Silver
Ashay Athalye
J. Tenenbaum
Tomas Lozano-Perez
L. Kaelbling
279
83
0
21 Jun 2022
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer
International Conference on Machine Learning (ICML), 2022
Jeewon Jeon
Woojun Kim
Whiyoung Jung
Young-Jin Sung
185
49
0
20 Jun 2022
A Survey on Model-based Reinforcement Learning
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
347
153
0
19 Jun 2022
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
International Conference on Machine Learning (ICML), 2022
Brandon Trabucco
Mariano Phielipp
Glen Berseth
177
35
0
17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
International Conference on Machine Learning (ICML), 2022
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
186
10
0
17 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Neural Information Processing Systems (NeurIPS), 2022
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
284
87
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
392
213
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
263
4
0
14 Jun 2022
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments
Neural Information Processing Systems (NeurIPS), 2022
Hugo Caselles-Dupré
Olivier Sigaud
Mohamed Chetouani
219
3
0
09 Jun 2022
Deep Hierarchical Planning from Pixels
Neural Information Processing Systems (NeurIPS), 2022
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
228
119
0
08 Jun 2022
Discrete State-Action Abstraction via the Successor Representation
A. Attali
Pedro Cisneros-Velarde
M. Morales
Nancy M. Amato
OffRL
189
1
0
07 Jun 2022
Imitating Past Successes can be Very Suboptimal
Neural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
265
24
0
07 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
313
3
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
f
f
f
-Advantage Regression
Neural Information Processing Systems (NeurIPS), 2022
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
280
73
0
07 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
Future Technologies Conference (FT), 2022
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
195
2
0
06 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
244
34
0
02 Jun 2022
When does return-conditioned supervised learning work for offline reinforcement learning?
Neural Information Processing Systems (NeurIPS), 2022
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
261
86
0
02 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
International Conference on Learning Representations (ICLR), 2022
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
568
13
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Neural Information Processing Systems (NeurIPS), 2022
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
385
14
0
31 May 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems
International Conference on Learning Representations (ICLR), 2022
Pierre Schumacher
Daniel Haeufle
Le Chen
Syn Schmitt
Georg Martius
232
46
0
30 May 2022
Autoformalization with Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
431
235
0
25 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Vladimir Egorov
A. Shpilman
192
41
0
25 May 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
IEEE Robotics and Automation Letters (RA-L), 2022
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
292
69
0
24 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
131
2
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
International Conference on Machine Learning (ICML), 2022
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
1.0K
986
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
153
0
0
20 May 2022
Transformer with Memory Replay
AAAI Conference on Artificial Intelligence (AAAI), 2022
R. Liu
Barzan Mozafari
OffRL
322
5
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
312
19
0
19 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
252
40
0
17 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments
IEEE International Conference on Robotics and Automation (ICRA), 2022
Jakob Thumm
Matthias Althoff
273
42
0
12 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OOD
OffRL
178
22
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
111
4
0
08 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
201
4
0
06 May 2022
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSL
OffRL
180
8
0
04 May 2022
Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery
IEEE Robotics and Automation Letters (RA-L), 2022
Daesol Cho
Jigang Kim
H. J. Kim
OffRL
SSL
206
19
0
29 Apr 2022
Bilinear value networks
Zhang-Wei Hong
Ge Yang
Pulkit Agrawal
OffRL
278
10
0
28 Apr 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
310
38
0
27 Apr 2022
Relational Abstractions for Generalized Reinforcement Learning on Symbolic Problems
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Rushang Karia
Siddharth Srivastava
NAI
OffRL
136
15
0
27 Apr 2022
Executive Function: A Contrastive Value Policy for Resampling and Relabeling Perceptions via Hindsight Summarization?
Christopher T. Lengerich
Ben Lengerich
141
1
0
27 Apr 2022
Previous
1
2
3
...
12
13
14
...
25
26
27
Next
Page 13 of 27
Page
of 27
Go