ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,335 papers shown
Title
Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular
  Design
Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design
Julien Roy
Pierre-Luc Bacon
C. Pal
Emmanuel Bengio
AI4CE
122
20
0
07 Jun 2023
Learning with a Mole: Transferable latent spatial representations for
  navigation without reconstruction
Learning with a Mole: Transferable latent spatial representations for navigation without reconstructionInternational Conference on Learning Representations (ICLR), 2023
G. Bono
L. Antsfeld
Assem Sadek
G. Monaci
Christian Wolf
SSL
251
8
0
06 Jun 2023
Efficient Multi-Task and Transfer Reinforcement Learning with
  Parameter-Compositional Framework
Efficient Multi-Task and Transfer Reinforcement Learning with Parameter-Compositional FrameworkIEEE Robotics and Automation Letters (RA-L), 2023
Lingfeng Sun
Haichao Zhang
Wei Xu
Masayoshi Tomizuka
217
10
0
02 Jun 2023
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftNeural Information Processing Systems (NeurIPS), 2023
Shalev Lifshitz
Keiran Paster
Harris Chan
Jimmy Ba
Sheila A. McIlraith
LM&Ro
292
96
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuningNeural Information Processing Systems (NeurIPS), 2023
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
393
31
0
01 Jun 2023
Adaptive and Explainable Deployment of Navigation Skills via
  Hierarchical Deep Reinforcement Learning
Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2023
Kyowoon Lee
Seongun Kim
Jaesik Choi
169
18
0
31 May 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?International Conference on Machine Learning (ICML), 2023
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
169
32
0
30 May 2023
Toward Fine Contact Interactions: Learning to Control Normal Contact
  Force with Limited Information
Toward Fine Contact Interactions: Learning to Control Normal Contact Force with Limited InformationIEEE International Conference on Robotics and Automation (ICRA), 2023
Jinda Cui
Jiawei Xu
David Saldaña
J. Trinkle
110
2
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal ApproachNeural Information Processing Systems (NeurIPS), 2023
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
195
25
0
28 May 2023
Visual Affordance Prediction for Guiding Robot Exploration
Visual Affordance Prediction for Guiding Robot ExplorationIEEE International Conference on Robotics and Automation (ICRA), 2023
Homanga Bharadhwaj
Abhi Gupta
Shubham Tulsiani
208
17
0
28 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy ReuseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Kang Xu
Chenjia Bai
Delin Qu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
152
2
0
28 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random FeaturesNeural Information Processing Systems (NeurIPS), 2023
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Jianchao Tan
Abhishek Gupta
OffRLSSL
221
11
0
26 May 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
Future-conditioned Unsupervised Pretraining for Decision TransformerInternational Conference on Machine Learning (ICML), 2023
Zhihui Xie
Zichuan Lin
Deheng Ye
Qiang Fu
Wei Yang
Shuai Li
OffRLOnRL
194
30
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Emergent Agentic Transformer from Chain of Hindsight ExperienceInternational Conference on Machine Learning (ICML), 2023
Hao Liu
Pieter Abbeel
OffRL
149
33
0
26 May 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Reward-Machine-Guided, Self-Paced Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Cevahir Köprülü
Ufuk Topcu
206
3
0
25 May 2023
Beyond Reward: Offline Preference-guided Policy Optimization
Beyond Reward: Offline Preference-guided Policy OptimizationInternational Conference on Machine Learning (ICML), 2023
Yachen Kang
Dingxu Shi
Jinxin Liu
Li He
Xuetao Zhang
OffRL
167
37
0
25 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for
  Digital Chemistry
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
145
7
0
23 May 2023
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement
  Learning
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning
Kibeom Kim
Hyun-Dong Lee
Min Whoo Lee
Moonheon Lee
Minsu Lee
Byoung-Tak Zhang
157
1
0
23 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Testing of Deep Reinforcement Learning Agents with Surrogate ModelsACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Matteo Biagiola
Paolo Tonella
195
30
0
22 May 2023
Augmenting Autotelic Agents with Large Language Models
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAGLM&Ro
184
36
0
21 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
255
1
0
21 May 2023
Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Counterfactual Fairness Filter for Fair-Delay Multi-Robot NavigationAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Hikaru Asano
Ryo Yonetani
Mai Nishimura
Tadashi Kozuno
155
1
0
19 May 2023
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Tom Jurgenson
Aviv Tamar
242
1
0
17 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional CurriculumInternational Conference on Machine Learning (ICML), 2023
Jigang Kim
Daesol Cho
H. J. Kim
229
4
0
17 May 2023
An Ensemble Approach for Automated Theorem Proving Based on Efficient
  Name Invariant Graph Neural Representations
An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural RepresentationsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Achille Fokoue
Ibrahim Abdelaziz
Mayank Agarwal
S. Ikbal
Akihiro Kishimoto
Guilherme Lima
Ndivhuwo Makondo
Radu Marinescu
OODNAI
115
6
0
15 May 2023
Supplementing Gradient-Based Reinforcement Learning with Simple
  Evolutionary Ideas
Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas
H. Khadilkar
120
0
0
10 May 2023
DeformerNet: Learning Bimanual Manipulation of 3D Deformable Objects
DeformerNet: Learning Bimanual Manipulation of 3D Deformable Objects
Bao Thach
Brian Y. Cho
Shing-Hei Ho
Tucker Hermans
Alan Kuntz
211
6
0
08 May 2023
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy OptimizationTransactions of the Association for Computational Linguistics (TACL), 2023
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
159
1
0
05 May 2023
Learning to Extrapolate: A Transductive Approach
Learning to Extrapolate: A Transductive ApproachInternational Conference on Learning Representations (ICLR), 2023
Aviv Netanyahu
Abhishek Gupta
Max Simchowitz
Jianchao Tan
Pulkit Agrawal
223
19
0
27 Apr 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction DataInternational Conference on Machine Learning (ICML), 2023
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
228
18
0
26 Apr 2023
Proximal Curriculum for Reinforcement Learning Agents
Proximal Curriculum for Reinforcement Learning Agents
Georgios Tzannetos
Bárbara Gomes Ribeiro
Parameswaran Kamalaruban
Adish Singla
202
13
0
25 Apr 2023
Two-Memory Reinforcement Learning
Two-Memory Reinforcement Learning
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
OffRL
140
4
0
20 Apr 2023
Safety Guaranteed Manipulation Based on Reinforcement Learning Planner
  and Model Predictive Control Actor
Safety Guaranteed Manipulation Based on Reinforcement Learning Planner and Model Predictive Control Actor
Zhenshan Bing
A. Mavrichev
Si-Si Shen
Xiangtong Yao
Ke Chen
Kai Huang
Alois C. Knoll
99
1
0
18 Apr 2023
Affordances from Human Videos as a Versatile Representation for Robotics
Affordances from Human Videos as a Versatile Representation for RoboticsComputer Vision and Pattern Recognition (CVPR), 2023
Shikhar Bahl
Russell Mendonca
Lili Chen
Unnat Jain
Deepak Pathak
315
244
0
17 Apr 2023
Habits and goals in synergy: a variational Bayesian framework for
  behavior
Habits and goals in synergy: a variational Bayesian framework for behaviorNature Communications (Nat. Commun.), 2023
Dongqi Han
Kenji Doya
Dongsheng Li
Jun Tani
BDL
198
205
0
11 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
273
0
0
07 Apr 2023
ENTL: Embodied Navigation Trajectory Learner
ENTL: Embodied Navigation Trajectory LearnerIEEE International Conference on Computer Vision (ICCV), 2023
Klemen Kotar
Aaron Walsman
Roozbeh Mottaghi
323
11
0
05 Apr 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Optimal Goal-Reaching Reinforcement Learning via Quasimetric LearningInternational Conference on Machine Learning (ICML), 2023
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
416
68
0
03 Apr 2023
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning
Satoshi Kataoka
Youngseog Chung
Seyed Kamyar Seyed Ghasemipour
Pannag R Sanketi
S. Gu
Igor Mordatch
150
7
0
27 Mar 2023
Learning Generative Models with Goal-conditioned Reinforcement Learning
Learning Generative Models with Goal-conditioned Reinforcement Learning
Mariana Vargas Vieyra
Pierre Ménard
GAN
52
0
0
26 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
280
1
0
23 Mar 2023
Planning Goals for Exploration
Planning Goals for ExplorationInternational Conference on Learning Representations (ICLR), 2023
E. Hu
Richard Chang
Oleh Rybkin
Dinesh Jayaraman
160
28
0
23 Mar 2023
A Survey of Historical Learning: Learning Models with Learning History
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MUAI4TS
193
2
0
23 Mar 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned PoliciesInternational Conference on Learning Representations (ICLR), 2023
Junsup Kim
Younggyo Seo
SungSoo Ahn
Kyunghwan Son
Jinwoo Shin
162
15
0
20 Mar 2023
Conversational Tree Search: A New Hybrid Dialog Task
Conversational Tree Search: A New Hybrid Dialog TaskConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
150
10
0
17 Mar 2023
Efficient Learning of High Level Plans from Play
Efficient Learning of High Level Plans from PlayIEEE International Conference on Robotics and Automation (ICRA), 2023
Núria Armengol Urpí
Marco Bagatella
Otmar Hilliges
Georg Martius
Stelian Coros
OffRL
117
4
0
16 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space
  Partitioning
Goal-conditioned Offline Reinforcement Learning through State Space PartitioningMachine-mediated learning (ML), 2023
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
124
4
0
16 Mar 2023
GOATS: Goal Sampling Adaptation for Scooping with Curriculum
  Reinforcement Learning
GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yaru Niu
Shiyu Jin
Zeqing Zhang
Jiacheng Zhu
Ding Zhao
Liangjun Zhang
222
12
0
09 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey
Ankit Shah
George Konidaris
115
4
0
09 Mar 2023
Grasping Student: semi-supervised learning for robotic manipulation
Grasping Student: semi-supervised learning for robotic manipulation
P. Krzywicki
Krzysztof Ciebiera
Rafal Michaluk
Inga Maziarz
Marek Cygan
SSL
96
0
0
08 Mar 2023
Previous
123...8910...252627
Next