ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,335 papers shown
Title
MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics
Lara Bergmann
Cedric Grothues
Klaus Neumann
65
0
0
20 Nov 2025
NFQ2.0: The CartPole Benchmark Revisited
NFQ2.0: The CartPole Benchmark Revisited
Sascha Lange
Roland Hafner
Martin Riedmiller
52
0
0
16 Nov 2025
Expressive Temporal Specifications for Reward Monitoring
Expressive Temporal Specifications for Reward Monitoring
Omar Adalat
Francesco Belardinelli
79
0
0
16 Nov 2025
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs
Daniel Furelos-Blanco
Charles Pert
Frederik Kelbel
Alex F Spies
Alessandra Russo
Michael Dennis
76
0
0
16 Nov 2025
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Daniel De Dios Allegue
J. He
F. Oliehoek
OffRL
225
0
0
10 Nov 2025
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Lan Thi Ha Nguyen
Kien Ton Manh
Anh Do Duc
Nam Pham Hai
DRLSSLAI4CE
385
0
0
10 Nov 2025
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Sayambhu Sen
Shalabh Bhatnagar
72
0
0
10 Nov 2025
Learning from Online Videos at Inference Time for Computer-Use Agents
Learning from Online Videos at Inference Time for Computer-Use Agents
Yujian Liu
Ze Wang
Hao Chen
Ximeng Sun
X. Yu
J. Wu
Jiang-Long Liu
Emad Barsoum
Zicheng Liu
Shiyu Chang
125
0
0
06 Nov 2025
Adaptable Hindsight Experience Replay for Search-Based Learning
Adaptable Hindsight Experience Replay for Search-Based Learning
Alexandros Vazaios
Jannis Brugger
Cedric Derstroff
Kristian Kersting
Mira Mezini
44
0
0
05 Nov 2025
SLAP: Shortcut Learning for Abstract Planning
SLAP: Shortcut Learning for Abstract Planning
Yaoyao Liu
Bowen Li
Benjamin Eysenbach
Tom Silver
OffRL
92
0
0
02 Nov 2025
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Sagalpreet Singh
Rishi Saket
A. Raghuveer
84
0
0
29 Oct 2025
Learning "Partner-Aware" Collaborators in Multi-Party Collaboration
Learning "Partner-Aware" Collaborators in Multi-Party Collaboration
Abhijnan Nath
Nikhil Krishnaswamy
94
0
0
26 Oct 2025
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRLCML
156
0
0
24 Oct 2025
A Unified Framework for Zero-Shot Reinforcement Learning
A Unified Framework for Zero-Shot Reinforcement Learning
Jacopo Di Ventura
Jan Felix Kleuker
Aske Plaat
Thomas M. Moerland
OffRL
80
0
0
23 Oct 2025
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
Runpeng Xie
Quanwei Wang
Hao Hu
Zherui Zhou
Ni Mu
Xiyun Li
Yiqin Yang
Shuang Xu
Qianchuan Zhao
Bo Xu
112
0
0
22 Oct 2025
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Kathryn Wantlin
Chongyi Zheng
Benjamin Eysenbach
132
0
0
20 Oct 2025
DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials
DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials
Xintong Yang
Minglun Wei
Ze Ji
Yu-kun Lai
AI4CE
164
0
0
20 Oct 2025
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Anjie Liu
Jianhong Wang
Samuel Kaski
Jun Wang
M. Yang
180
0
0
20 Oct 2025
RLAF: Reinforcement Learning from Automaton Feedback
RLAF: Reinforcement Learning from Automaton Feedback
Mahyar Alinejad
Alvaro Velasquez
Yue Wang
George Atia
OffRL
73
0
0
17 Oct 2025
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
Roger Creus Castanyer
Faisal Mohamed
Pablo Samuel Castro
Cyrus Neary
Glen Berseth
OffRLLRMAI4CE
173
0
0
16 Oct 2025
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Mahsa Bastankhah
Grace Liu
Dilip Arumugam
Thomas L. Griffiths
Benjamin Eysenbach
80
1
0
15 Oct 2025
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
Martin Schuck
Sherif Samy
Angela P. Schoellig
68
0
0
13 Oct 2025
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Kohio Deflesselle
Mélodie Daniel
Aly Magassouba
Miguel Aranda
Olivier Ly
85
0
0
11 Oct 2025
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu
Benjamin Van Durme
Jacob Andreas
Harsh Jhamtani
LLMAG
64
0
0
11 Oct 2025
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu
Yanbiao Ji
Qiuchang Li
Zhiyi Zhang
Shalayiding Sirejiding
Wenyuan Xie
Guodong Zhang
Bayram Bayramli
Yue Ding
Hongtao Lu
92
0
0
11 Oct 2025
BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
Sangyun Lee
Brandon Amos
Giulia Fanti
96
0
0
10 Oct 2025
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Xiaofeng Cao
Mingwei Xu
Xin Yu
Jiangchao Yao
Wei Ye
...
Minling Zhang
Ivor Tsang
Yew-Soon Ong
James T. Kwok
Heng Tao Shen
128
3
0
10 Oct 2025
Agent Learning via Early Experience
Agent Learning via Early Experience
Kai Zhang
Xiangchao Chen
Bo Liu
Tianci Xue
Zeyi Liao
...
J. Zhu
Huan Sun
Jason Weston
Eric Fosler-Lussier
Y. Wu
OffRL
154
5
0
09 Oct 2025
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Evgenii Opryshko
Junwei Quan
C. Voelcker
Yilun Du
Igor Gilitschenski
OffRL
92
2
0
08 Oct 2025
Automaton Constrained Q-Learning
Automaton Constrained Q-Learning
Anastasios Manganaris
Vittorio Giammarino
A. H. Qureshi
135
0
0
06 Oct 2025
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
Jonas Hübotter
Leander Diaz-Bone
Ido Hakimi
Andreas Krause
Moritz Hardt
127
0
0
06 Oct 2025
Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning
Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning
Shafeef Omar
Majid Khadiv
71
0
0
04 Oct 2025
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
215
1
0
03 Oct 2025
Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
Brett Barkley
David Fridovich-Keil
OffRL
132
0
0
01 Oct 2025
Aristotle: IMO-level Automated Theorem Proving
Aristotle: IMO-level Automated Theorem Proving
Tudor Achim
Alex Best
Kevin Der
Mathïs Fédérico
Sergei Gukov
...
Matyas Tamas
Vlad Tenev
Jonathan Thomm
Harold Williams
Lawrence Wu
LRM
142
3
0
01 Oct 2025
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Shen
Yu Xia
Jonathan D. Chang
Prithviraj Ammanabrolu
112
0
0
01 Oct 2025
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Xinyu Zhang
Aishik Deb
Klaus Mueller
56
0
0
30 Sep 2025
In-Context Compositional Q-Learning for Offline Reinforcement Learning
In-Context Compositional Q-Learning for Offline Reinforcement Learning
Qiushui Xu
Yuhao Huang
Yushu Jiang
Lei Song
Jinyu Wang
Wenliang Zheng
Jiang Bian
OffRL
88
0
0
28 Sep 2025
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
Vivek Myers
Bill Chunyuan Zheng
Benjamin Eysenbach
Sergey Levine
OffRL
128
1
0
24 Sep 2025
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
132
0
0
19 Sep 2025
Sample Efficient Experience Replay in Non-stationary Environments
Sample Efficient Experience Replay in Non-stationary Environments
Tianyang Duan
Zongyuan Zhang
Songxiao Guo
Yuanye Zhao
Zheng Lin
...
Yi Liu
Dianxin Luan
Dong Huang
Heming Cui
Yong Cui
88
1
0
18 Sep 2025
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
Chirayu Nimonkar
Shlok Shah
Catherine Ji
Benjamin Eysenbach
118
1
0
12 Sep 2025
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration
Sirui Xu
Yu-Wei Chao
Liuyu Bian
Arsalan Mousavian
Yu-Xiong Wang
Liang-Yan Gui
Wei Yang
72
0
0
11 Sep 2025
Imagined Autocurricula
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
146
1
0
11 Sep 2025
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino
Ruiqi Ni
A. H. Qureshi
OffRLAI4CE
126
1
0
08 Sep 2025
Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks
Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks
Yang Yu
56
1
0
06 Sep 2025
RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
Matthew Lai
Keegan Go
Zhibin Li
Torsten Kroger
S. Schaal
Kelsey Allen
Jonathan Scholz
80
6
0
05 Sep 2025
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang
Fabian Wurzberger
Gerrit Schmid
Sebastian Gottwald
Daniel A. Braun
SSL
176
0
0
03 Sep 2025
HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots
HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots
Shipeng Lyu
Fangyuan Wang
Weiwei Lin
Luhao Zhu
D. Navarro-Alarcon
Guodong Guo
74
0
0
26 Aug 2025
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening
Halid Abdulrahim Kadi
K. Terzic
76
0
0
23 Aug 2025
1234...252627
Next