ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,340 papers shown
A Learning System for Motion Planning of Free-Float Dual-Arm Space
  Manipulator towards Non-Cooperative Object
A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative ObjectAerospace Science and Technology (AST), 2022
Shengjie Wang
Yu-wen Cao
Xiang Zheng
Tao Zhang
225
27
0
06 Jul 2022
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper
  Manipulation
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper ManipulationInternational Conference on Learning Representations (ICLR), 2022
Yan Zhao
Kai Cheng
Zhehuan Chen
Yourong Zhang
Qingnan Fan
Kaichun Mo
Hao Dong
469
24
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
Goal-Conditioned Generators of Deep PoliciesAAAI Conference on Artificial Intelligence (AAAI), 2022
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
165
10
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
USHER: Unbiased Sampling for Hindsight Experience ReplayConference on Robot Learning (CoRL), 2022
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
90
6
0
03 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal
  Transport
Watch and Match: Supercharging Imitation with Regularized Optimal TransportConference on Robot Learning (CoRL), 2022
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
307
85
0
30 Jun 2022
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full
  Orientation Control
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full Orientation Control
Martin Schuck
Jan Brüdigam
A. Capone
Roland Toth
Sandra Hirche
229
1
0
28 Jun 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement
  Learning Systems
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
166
4
0
28 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
213
24
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online VideosNeural Information Processing Systems (NeurIPS), 2022
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
499
368
0
23 Jun 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without
  Supervision
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
236
4
0
23 Jun 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object
  Manipulation
Curious Exploration via Structured World Models Yields Zero-Shot Object ManipulationNeural Information Processing Systems (NeurIPS), 2022
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
406
35
0
22 Jun 2022
Learning Neuro-Symbolic Skills for Bilevel Planning
Learning Neuro-Symbolic Skills for Bilevel PlanningConference on Robot Learning (CoRL), 2022
Tom Silver
Ashay Athalye
J. Tenenbaum
Tomas Lozano-Perez
L. Kaelbling
279
83
0
21 Jun 2022
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from
  Experience Replay Buffer
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay BufferInternational Conference on Machine Learning (ICML), 2022
Jeewon Jeon
Woojun Kim
Whiyoung Jung
Young-Jin Sung
185
49
0
20 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
347
153
0
19 Jun 2022
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
AnyMorph: Learning Transferable Polices By Inferring Agent MorphologyInternational Conference on Machine Learning (ICML), 2022
Brandon Trabucco
Mariano Phielipp
Glen Berseth
177
35
0
17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
Generalised Policy Improvement with Geometric Policy CompositionInternational Conference on Machine Learning (ICML), 2022
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
186
10
0
17 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped PredictionNeural Information Processing Systems (NeurIPS), 2022
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
284
87
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSLOffRL
392
213
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal
  Reinforcement Learning
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
263
4
0
14 Jun 2022
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal
  Environments
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal EnvironmentsNeural Information Processing Systems (NeurIPS), 2022
Hugo Caselles-Dupré
Olivier Sigaud
Mohamed Chetouani
219
3
0
09 Jun 2022
Deep Hierarchical Planning from Pixels
Deep Hierarchical Planning from PixelsNeural Information Processing Systems (NeurIPS), 2022
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
228
119
0
08 Jun 2022
Discrete State-Action Abstraction via the Successor Representation
Discrete State-Action Abstraction via the Successor Representation
A. Attali
Pedro Cisneros-Velarde
M. Morales
Nancy M. Amato
OffRL
189
1
0
07 Jun 2022
Imitating Past Successes can be Very Suboptimal
Imitating Past Successes can be Very SuboptimalNeural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
265
24
0
07 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
313
3
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
  $f$-Advantage Regression
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via fff-Advantage RegressionNeural Information Processing Systems (NeurIPS), 2022
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
280
73
0
07 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
Achieving Goals using Reward Shaping and Curriculum LearningFuture Technologies Conference (FT), 2022
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
195
2
0
06 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
244
34
0
02 Jun 2022
When does return-conditioned supervised learning work for offline
  reinforcement learning?
When does return-conditioned supervised learning work for offline reinforcement learning?Neural Information Processing Systems (NeurIPS), 2022
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
261
86
0
02 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal
  Search
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal SearchInternational Conference on Learning Representations (ICLR), 2022
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
568
13
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Human-AI Shared Control via Policy DissectionNeural Information Processing Systems (NeurIPS), 2022
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
385
14
0
31 May 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated
  and Musculoskeletal Systems
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal SystemsInternational Conference on Learning Representations (ICLR), 2022
Pierre Schumacher
Daniel Haeufle
Le Chen
Syn Schmitt
Georg Martius
232
46
0
30 May 2022
Autoformalization with Large Language Models
Autoformalization with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
431
235
0
25 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Vladimir Egorov
A. Shpilman
192
41
0
25 May 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement
  Learning
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement LearningIEEE Robotics and Automation Letters (RA-L), 2022
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
292
69
0
24 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
131
2
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior SynthesisInternational Conference on Machine Learning (ICML), 2022
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
1.0K
986
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned
  Reinforcement Learning
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
153
0
0
20 May 2022
Transformer with Memory Replay
Transformer with Memory ReplayAAAI Conference on Artificial Intelligence (AAAI), 2022
R. Liu
Barzan Mozafari
OffRL
322
5
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and
  Knowledge Transfer for Complex Sparse Reward-based Tasks
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
312
19
0
19 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent SpaceIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
252
40
0
17 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in
  Human Environments
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2022
Jakob Thumm
Matthias Althoff
273
42
0
12 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement
  Learning
A State-Distribution Matching Approach to Non-Episodic Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OODOffRL
178
22
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for
  Actor-Critic Methods
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
111
4
0
08 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Diverse Imitation Learning via Self-Organizing Generative ModelsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
201
4
0
06 May 2022
State Representation Learning for Goal-Conditioned Reinforcement
  Learning
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSLOffRL
180
8
0
04 May 2022
Unsupervised Reinforcement Learning for Transferable Manipulation Skill
  Discovery
Unsupervised Reinforcement Learning for Transferable Manipulation Skill DiscoveryIEEE Robotics and Automation Letters (RA-L), 2022
Daesol Cho
Jigang Kim
H. J. Kim
OffRLSSL
206
19
0
29 Apr 2022
Bilinear value networks
Bilinear value networks
Zhang-Wei Hong
Ge Yang
Pulkit Agrawal
OffRL
278
10
0
28 Apr 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
310
38
0
27 Apr 2022
Relational Abstractions for Generalized Reinforcement Learning on
  Symbolic Problems
Relational Abstractions for Generalized Reinforcement Learning on Symbolic ProblemsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Rushang Karia
Siddharth Srivastava
NAIOffRL
136
15
0
27 Apr 2022
Executive Function: A Contrastive Value Policy for Resampling and
  Relabeling Perceptions via Hindsight Summarization?
Executive Function: A Contrastive Value Policy for Resampling and Relabeling Perceptions via Hindsight Summarization?
Christopher T. Lengerich
Ben Lengerich
141
1
0
27 Apr 2022
Previous
123...121314...252627
Next
Page 13 of 27
Pageof 27