Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,335 papers shown
Title
MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics
Lara Bergmann
Cedric Grothues
Klaus Neumann
65
0
0
20 Nov 2025
NFQ2.0: The CartPole Benchmark Revisited
Sascha Lange
Roland Hafner
Martin Riedmiller
52
0
0
16 Nov 2025
Expressive Temporal Specifications for Reward Monitoring
Omar Adalat
Francesco Belardinelli
79
0
0
16 Nov 2025
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Daniel Furelos-Blanco
Charles Pert
Frederik Kelbel
Alex F Spies
Alessandra Russo
Michael Dennis
76
0
0
16 Nov 2025
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Daniel De Dios Allegue
J. He
F. Oliehoek
OffRL
225
0
0
10 Nov 2025
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Lan Thi Ha Nguyen
Kien Ton Manh
Anh Do Duc
Nam Pham Hai
DRL
SSL
AI4CE
385
0
0
10 Nov 2025
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Sayambhu Sen
Shalabh Bhatnagar
72
0
0
10 Nov 2025
Learning from Online Videos at Inference Time for Computer-Use Agents
Yujian Liu
Ze Wang
Hao Chen
Ximeng Sun
X. Yu
J. Wu
Jiang-Long Liu
Emad Barsoum
Zicheng Liu
Shiyu Chang
125
0
0
06 Nov 2025
Adaptable Hindsight Experience Replay for Search-Based Learning
Alexandros Vazaios
Jannis Brugger
Cedric Derstroff
Kristian Kersting
Mira Mezini
44
0
0
05 Nov 2025
SLAP: Shortcut Learning for Abstract Planning
Yaoyao Liu
Bowen Li
Benjamin Eysenbach
Tom Silver
OffRL
92
0
0
02 Nov 2025
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Sagalpreet Singh
Rishi Saket
A. Raghuveer
84
0
0
29 Oct 2025
Learning "Partner-Aware" Collaborators in Multi-Party Collaboration
Abhijnan Nath
Nikhil Krishnaswamy
94
0
0
26 Oct 2025
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
CML
156
0
0
24 Oct 2025
A Unified Framework for Zero-Shot Reinforcement Learning
Jacopo Di Ventura
Jan Felix Kleuker
Aske Plaat
Thomas M. Moerland
OffRL
80
0
0
23 Oct 2025
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
Runpeng Xie
Quanwei Wang
Hao Hu
Zherui Zhou
Ni Mu
Xiyun Li
Yiqin Yang
Shuang Xu
Qianchuan Zhao
Bo Xu
112
0
0
22 Oct 2025
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Kathryn Wantlin
Chongyi Zheng
Benjamin Eysenbach
132
0
0
20 Oct 2025
DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials
Xintong Yang
Minglun Wei
Ze Ji
Yu-kun Lai
AI4CE
164
0
0
20 Oct 2025
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Anjie Liu
Jianhong Wang
Samuel Kaski
Jun Wang
M. Yang
180
0
0
20 Oct 2025
RLAF: Reinforcement Learning from Automaton Feedback
Mahyar Alinejad
Alvaro Velasquez
Yue Wang
George Atia
OffRL
73
0
0
17 Oct 2025
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
Roger Creus Castanyer
Faisal Mohamed
Pablo Samuel Castro
Cyrus Neary
Glen Berseth
OffRL
LRM
AI4CE
173
0
0
16 Oct 2025
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Mahsa Bastankhah
Grace Liu
Dilip Arumugam
Thomas L. Griffiths
Benjamin Eysenbach
80
1
0
15 Oct 2025
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
Martin Schuck
Sherif Samy
Angela P. Schoellig
68
0
0
13 Oct 2025
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Kohio Deflesselle
Mélodie Daniel
Aly Magassouba
Miguel Aranda
Olivier Ly
85
0
0
11 Oct 2025
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu
Benjamin Van Durme
Jacob Andreas
Harsh Jhamtani
LLMAG
64
0
0
11 Oct 2025
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu
Yanbiao Ji
Qiuchang Li
Zhiyi Zhang
Shalayiding Sirejiding
Wenyuan Xie
Guodong Zhang
Bayram Bayramli
Yue Ding
Hongtao Lu
92
0
0
11 Oct 2025
BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
Sangyun Lee
Brandon Amos
Giulia Fanti
96
0
0
10 Oct 2025
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Xiaofeng Cao
Mingwei Xu
Xin Yu
Jiangchao Yao
Wei Ye
...
Minling Zhang
Ivor Tsang
Yew-Soon Ong
James T. Kwok
Heng Tao Shen
128
3
0
10 Oct 2025
Agent Learning via Early Experience
Kai Zhang
Xiangchao Chen
Bo Liu
Tianci Xue
Zeyi Liao
...
J. Zhu
Huan Sun
Jason Weston
Eric Fosler-Lussier
Y. Wu
OffRL
154
5
0
09 Oct 2025
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Evgenii Opryshko
Junwei Quan
C. Voelcker
Yilun Du
Igor Gilitschenski
OffRL
92
2
0
08 Oct 2025
Automaton Constrained Q-Learning
Anastasios Manganaris
Vittorio Giammarino
A. H. Qureshi
135
0
0
06 Oct 2025
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
Jonas Hübotter
Leander Diaz-Bone
Ido Hakimi
Andreas Krause
Moritz Hardt
127
0
0
06 Oct 2025
Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning
Shafeef Omar
Majid Khadiv
71
0
0
04 Oct 2025
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
215
1
0
03 Oct 2025
Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
Brett Barkley
David Fridovich-Keil
OffRL
132
0
0
01 Oct 2025
Aristotle: IMO-level Automated Theorem Proving
Tudor Achim
Alex Best
Kevin Der
Mathïs Fédérico
Sergei Gukov
...
Matyas Tamas
Vlad Tenev
Jonathan Thomm
Harold Williams
Lawrence Wu
LRM
142
3
0
01 Oct 2025
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Shen
Yu Xia
Jonathan D. Chang
Prithviraj Ammanabrolu
112
0
0
01 Oct 2025
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Xinyu Zhang
Aishik Deb
Klaus Mueller
56
0
0
30 Sep 2025
In-Context Compositional Q-Learning for Offline Reinforcement Learning
Qiushui Xu
Yuhao Huang
Yushu Jiang
Lei Song
Jinyu Wang
Wenliang Zheng
Jiang Bian
OffRL
88
0
0
28 Sep 2025
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
Vivek Myers
Bill Chunyuan Zheng
Benjamin Eysenbach
Sergey Levine
OffRL
128
1
0
24 Sep 2025
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
132
0
0
19 Sep 2025
Sample Efficient Experience Replay in Non-stationary Environments
Tianyang Duan
Zongyuan Zhang
Songxiao Guo
Yuanye Zhao
Zheng Lin
...
Yi Liu
Dianxin Luan
Dong Huang
Heming Cui
Yong Cui
88
1
0
18 Sep 2025
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Chirayu Nimonkar
Shlok Shah
Catherine Ji
Benjamin Eysenbach
118
1
0
12 Sep 2025
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Sirui Xu
Yu-Wei Chao
Liuyu Bian
Arsalan Mousavian
Yu-Xiong Wang
Liang-Yan Gui
Wei Yang
72
0
0
11 Sep 2025
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
146
1
0
11 Sep 2025
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino
Ruiqi Ni
A. H. Qureshi
OffRL
AI4CE
126
1
0
08 Sep 2025
Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks
Yang Yu
56
1
0
06 Sep 2025
RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
Matthew Lai
Keegan Go
Zhibin Li
Torsten Kroger
S. Schaal
Kelsey Allen
Jonathan Scholz
80
6
0
05 Sep 2025
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang
Fabian Wurzberger
Gerrit Schmid
Sebastian Gottwald
Daniel A. Braun
SSL
176
0
0
03 Sep 2025
HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots
Shipeng Lyu
Fangyuan Wang
Weiwei Lin
Luhao Zhu
D. Navarro-Alarcon
Guodong Guo
74
0
0
26 Aug 2025
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening
Halid Abdulrahim Kadi
K. Terzic
76
0
0
23 Aug 2025
1
2
3
4
...
25
26
27
Next