ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,339 papers shown
SAGAS: Semantic-Aware Graph-Assisted Stitching for Offline Temporal Logic Planning
SAGAS: Semantic-Aware Graph-Assisted Stitching for Offline Temporal Logic Planning
Ruijia Liu
Ancheng Hou
Shaoyuan Li
Xiang Yin
OffRL
72
0
0
30 Nov 2025
Hyper-GoalNet: Goal-Conditioned Manipulation Policy Learning with HyperNetworks
Hyper-GoalNet: Goal-Conditioned Manipulation Policy Learning with HyperNetworks
Pei Zhou
Wanting Yao
Qian Luo
Xunzhe Zhou
Yanchao Yang
73
1
0
26 Nov 2025
MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics
Lara Bergmann
Cedric Grothues
Klaus Neumann
105
0
0
20 Nov 2025
Expressive Temporal Specifications for Reward Monitoring
Expressive Temporal Specifications for Reward Monitoring
Omar Adalat
Francesco Belardinelli
137
0
0
16 Nov 2025
NFQ2.0: The CartPole Benchmark Revisited
NFQ2.0: The CartPole Benchmark Revisited
Sascha Lange
Roland Hafner
Martin Riedmiller
74
0
0
16 Nov 2025
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Daniel Furelos-Blanco
Charles Pert
Frederik Kelbel
Alex F Spies
Alessandra Russo
Michael Dennis
116
0
0
16 Nov 2025
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Lan Thi Ha Nguyen
Kien Ton Manh
Anh Do Duc
Nam Pham Hai
DRLSSLAI4CE
521
0
0
10 Nov 2025
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Sayambhu Sen
Shalabh Bhatnagar
96
0
0
10 Nov 2025
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Daniel De Dios Allegue
J. He
F. Oliehoek
OffRL
273
0
0
10 Nov 2025
Learning from Online Videos at Inference Time for Computer-Use Agents
Learning from Online Videos at Inference Time for Computer-Use Agents
Yujian Liu
Ze Wang
Hao Chen
Ximeng Sun
X. Yu
J. Wu
Jiang-Long Liu
Emad Barsoum
Zicheng Liu
Shiyu Chang
153
0
0
06 Nov 2025
Adaptable Hindsight Experience Replay for Search-Based Learning
Adaptable Hindsight Experience Replay for Search-Based Learning
Alexandros Vazaios
Jannis Brugger
Cedric Derstroff
Kristian Kersting
Mira Mezini
72
0
0
05 Nov 2025
SLAP: Shortcut Learning for Abstract Planning
SLAP: Shortcut Learning for Abstract Planning
Yaoyao Liu
Bowen Li
Benjamin Eysenbach
Tom Silver
OffRL
125
1
0
02 Nov 2025
Reinforcement Learning for Robotic Safe Control with Force Sensing
Reinforcement Learning for Robotic Safe Control with Force Sensing
Nan Lin
Linrui Zhang
Yuxuan Chen
Z. Chen
Yujun Zhu
Ruoxi Chen
Peichen Wu
Xiaoping Chen
60
9
0
30 Oct 2025
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Sagalpreet Singh
Rishi Saket
A. Raghuveer
110
0
0
29 Oct 2025
Learning "Partner-Aware" Collaborators in Multi-Party Collaboration
Learning "Partner-Aware" Collaborators in Multi-Party Collaboration
Abhijnan Nath
Nikhil Krishnaswamy
118
0
0
26 Oct 2025
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRLCML
180
0
0
24 Oct 2025
A Unified Framework for Zero-Shot Reinforcement Learning
A Unified Framework for Zero-Shot Reinforcement Learning
Jacopo Di Ventura
Jan Felix Kleuker
Aske Plaat
Thomas M. Moerland
OffRL
88
0
0
23 Oct 2025
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
Runpeng Xie
Quanwei Wang
Hao Hu
Zherui Zhou
Ni Mu
Xiyun Li
Yiqin Yang
Shuang Xu
Qianchuan Zhao
Bo Xu
144
0
0
22 Oct 2025
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Anjie Liu
Jianhong Wang
Samuel Kaski
Jun Wang
M. Yang
252
0
0
20 Oct 2025
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Kathryn Wantlin
Chongyi Zheng
Benjamin Eysenbach
176
0
0
20 Oct 2025
DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials
DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials
Xintong Yang
Minglun Wei
Ze Ji
Yu-kun Lai
AI4CE
196
0
0
20 Oct 2025
RLAF: Reinforcement Learning from Automaton Feedback
RLAF: Reinforcement Learning from Automaton Feedback
Mahyar Alinejad
Alvaro Velasquez
Yue Wang
George Atia
OffRL
107
0
0
17 Oct 2025
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
Roger Creus Castanyer
Faisal Mohamed
Pablo Samuel Castro
Cyrus Neary
Glen Berseth
OffRLLRMAI4CE
213
0
0
16 Oct 2025
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Mahsa Bastankhah
Grace Liu
Dilip Arumugam
Thomas L. Griffiths
Benjamin Eysenbach
92
1
0
15 Oct 2025
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
Martin Schuck
Sherif Samy
Angela P. Schoellig
100
0
0
13 Oct 2025
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu
Benjamin Van Durme
Jacob Andreas
Harsh Jhamtani
LLMAG
104
0
0
11 Oct 2025
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Kohio Deflesselle
Mélodie Daniel
Aly Magassouba
Miguel Aranda
Olivier Ly
101
0
0
11 Oct 2025
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu
Yanbiao Ji
Qiuchang Li
Zhiyi Zhang
Shalayiding Sirejiding
Wenyuan Xie
Guodong Zhang
Bayram Bayramli
Yue Ding
Hongtao Lu
156
0
0
11 Oct 2025
BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
Sangyun Lee
Brandon Amos
Giulia Fanti
124
0
0
10 Oct 2025
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Xiaofeng Cao
Mingwei Xu
Xin Yu
Jiangchao Yao
Wei Ye
...
Minling Zhang
Ivor Tsang
Yew-Soon Ong
James T. Kwok
Heng Tao Shen
184
3
0
10 Oct 2025
Agent Learning via Early Experience
Agent Learning via Early Experience
Kai Zhang
Xiangchao Chen
Bo Liu
Tianci Xue
Zeyi Liao
...
J. Zhu
Huan Sun
Jason Weston
Eric Fosler-Lussier
Y. Wu
OffRL
195
6
0
09 Oct 2025
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Evgenii Opryshko
Junwei Quan
C. Voelcker
Yilun Du
Igor Gilitschenski
OffRL
124
2
0
08 Oct 2025
Automaton Constrained Q-Learning
Automaton Constrained Q-Learning
Anastasios Manganaris
Vittorio Giammarino
A. H. Qureshi
191
0
0
06 Oct 2025
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
Jonas Hübotter
Leander Diaz-Bone
Ido Hakimi
Andreas Krause
Moritz Hardt
155
1
0
06 Oct 2025
Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning
Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning
Shafeef Omar
Majid Khadiv
111
0
0
04 Oct 2025
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
259
1
0
03 Oct 2025
Aristotle: IMO-level Automated Theorem Proving
Aristotle: IMO-level Automated Theorem Proving
Tudor Achim
Alex Best
Kevin Der
Mathïs Fédérico
Sergei Gukov
...
Matyas Tamas
Vlad Tenev
Jonathan Thomm
Harold Williams
Lawrence Wu
LRM
166
4
0
01 Oct 2025
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Shen
Yu Xia
Jonathan D. Chang
Prithviraj Ammanabrolu
160
0
0
01 Oct 2025
Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
Brett Barkley
David Fridovich-Keil
OffRL
152
0
0
01 Oct 2025
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Xinyu Zhang
Aishik Deb
Klaus Mueller
76
0
0
30 Sep 2025
In-Context Compositional Q-Learning for Offline Reinforcement Learning
In-Context Compositional Q-Learning for Offline Reinforcement Learning
Qiushui Xu
Yuhao Huang
Yushu Jiang
Lei Song
Jinyu Wang
Wenliang Zheng
Jiang Bian
OffRL
136
0
0
28 Sep 2025
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
Vivek Myers
Bill Chunyuan Zheng
Benjamin Eysenbach
Sergey Levine
OffRL
164
1
0
24 Sep 2025
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
188
0
0
19 Sep 2025
Sample Efficient Experience Replay in Non-stationary Environments
Sample Efficient Experience Replay in Non-stationary Environments
Tianyang Duan
Zongyuan Zhang
Songxiao Guo
Yuanye Zhao
Zheng Lin
...
Yi Liu
Dianxin Luan
Dong Huang
Heming Cui
Yong Cui
132
1
0
18 Sep 2025
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Chirayu Nimonkar
Shlok Shah
Catherine Ji
Benjamin Eysenbach
162
1
0
12 Sep 2025
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Sirui Xu
Yu-Wei Chao
Liuyu Bian
Arsalan Mousavian
Yu-Xiong Wang
Liang-Yan Gui
Wei Yang
108
0
0
11 Sep 2025
Imagined Autocurricula
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
219
1
0
11 Sep 2025
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino
Ruiqi Ni
A. H. Qureshi
OffRLAI4CE
190
1
0
08 Sep 2025
Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks
Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks
Yang Yu
72
1
0
06 Sep 2025
RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
Matthew Lai
Keegan Go
Zhibin Li
Torsten Kroger
S. Schaal
Kelsey Allen
Jonathan Scholz
120
6
0
05 Sep 2025
1234...252627
Next