ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,245 papers shown
Title
Generalizing Goal-Conditioned Reinforcement Learning with Variational
  Causal Reasoning
Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning
Wenhao Ding
Haohong Lin
Yue Liu
Ding Zhao
LRM
28
37
0
19 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
32
31
0
13 Jul 2022
Learning Temporally Extended Skills in Continuous Domains as Symbolic
  Actions for Planning
Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning
Jan Achterhold
Markus Krimmel
Joerg Stueckler
40
9
0
11 Jul 2022
Automatic Exploration of Textual Environments with Language-Conditioned
  Autotelic Agents
Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents
Laetitia Teodorescu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
LLMAG
35
0
0
08 Jul 2022
Storehouse: a Reinforcement Learning Environment for Optimizing
  Warehouse Management
Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management
Julen Cestero
M. Quartulli
Alberto Maria Metelli
Marcello Restelli
OffRL
26
6
0
08 Jul 2022
A Learning System for Motion Planning of Free-Float Dual-Arm Space
  Manipulator towards Non-Cooperative Object
A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object
Shengjie Wang
Yu-wen Cao
Xiang Zheng
Tao Zhang
35
16
0
06 Jul 2022
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper
  Manipulation
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation
Yan Zhao
Ruihai Wu
Zhehuan Chen
Yourong Zhang
Qingnan Fan
Kaichun Mo
Hao Dong
31
14
0
05 Jul 2022
Goal-Conditioned Generators of Deep Policies
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
47
8
0
04 Jul 2022
USHER: Unbiased Sampling for Hindsight Experience Replay
USHER: Unbiased Sampling for Hindsight Experience Replay
Liam Schramm
Yunfu Deng
Edgar Granados
Abdeslam Boularias
19
4
0
03 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal
  Transport
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
63
62
0
30 Jun 2022
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full
  Orientation Control
Dext-Gen: Dexterous Grasping in Sparse Reward Environments with Full Orientation Control
Martin Schuck
Jan Brüdigam
A. Capone
Stefan Sosnowski
Sandra Hirche
19
1
0
28 Jun 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement
  Learning Systems
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
19
3
0
28 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
44
22
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
52
288
0
23 Jun 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without
  Supervision
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
42
4
0
23 Jun 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object
  Manipulation
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
28
25
0
22 Jun 2022
Learning Neuro-Symbolic Skills for Bilevel Planning
Learning Neuro-Symbolic Skills for Bilevel Planning
Tom Silver
Ashay Athalye
J. Tenenbaum
Tomas Lozano-Perez
L. Kaelbling
39
59
0
21 Jun 2022
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from
  Experience Replay Buffer
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer
Jeewon Jeon
Woojun Kim
Whiyoung Jung
Young-Jin Sung
29
35
0
20 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
Brandon Trabucco
Mariano Phielipp
Glen Berseth
34
28
0
17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
Generalised Policy Improvement with Geometric Policy Composition
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
22
7
0
17 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
27
69
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
42
141
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal
  Reinforcement Learning
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
25
2
0
14 Jun 2022
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal
  Environments
Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments
Hugo Caselles-Dupré
Olivier Sigaud
Mohamed Chetouani
9
3
0
09 Jun 2022
Deep Hierarchical Planning from Pixels
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
49
93
0
08 Jun 2022
Discrete State-Action Abstraction via the Successor Representation
Discrete State-Action Abstraction via the Successor Representation
A. Attali
Pedro Cisneros-Velarde
M. Morales
Nancy M. Amato
OffRL
36
1
0
07 Jun 2022
Imitating Past Successes can be Very Suboptimal
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
44
16
0
07 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
18
2
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
  $f$-Advantage Regression
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via fff-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
25
53
0
07 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
Achieving Goals using Reward Shaping and Curriculum Learning
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
12
1
0
06 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
41
25
0
02 Jun 2022
When does return-conditioned supervised learning work for offline
  reinforcement learning?
When does return-conditioned supervised learning work for offline reinforcement learning?
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
30
60
0
02 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal
  Search
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
23
9
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Human-AI Shared Control via Policy Dissection
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
28
13
0
31 May 2022
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated
  and Musculoskeletal Systems
DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems
Pierre Schumacher
Daniel Haeufle
Le Chen
Syn Schmitt
Georg Martius
36
31
0
30 May 2022
Autoformalization with Large Language Models
Autoformalization with Large Language Models
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
119
161
0
25 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
28
22
0
25 May 2022
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement
  Learning
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
64
57
0
24 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
19
1
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
204
637
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned
  Reinforcement Learning
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
35
0
0
20 May 2022
Transformer with Memory Replay
Transformer with Memory Replay
R. Liu
Barzan Mozafari
OffRL
70
4
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and
  Knowledge Transfer for Complex Sparse Reward-based Tasks
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
20
14
0
19 May 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
58
29
0
17 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in
  Human Environments
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments
Jakob Thumm
Matthias Althoff
63
34
0
12 May 2022
A State-Distribution Matching Approach to Non-Episodic Reinforcement
  Learning
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma
Rehaan Ahmad
Chelsea Finn
OOD
OffRL
34
21
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for
  Actor-Critic Methods
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
20
2
0
08 May 2022
Diverse Imitation Learning via Self-Organizing Generative Models
Diverse Imitation Learning via Self-Organizing Generative Models
Arash Vahabpour
Tianyi Wang
Qiujing Lu
Omead Brandon Pooladzandi
V. Roychowdhury
SSL
28
1
0
06 May 2022
State Representation Learning for Goal-Conditioned Reinforcement
  Learning
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSL
OffRL
37
4
0
04 May 2022
Previous
123...101112...232425
Next