ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,339 papers shown
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang
Fabian Wurzberger
Gerrit Schmid
Sebastian Gottwald
Daniel A. Braun
SSL
224
0
0
03 Sep 2025
HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots
HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots
Shipeng Lyu
Fangyuan Wang
Weiwei Lin
Luhao Zhu
D. Navarro-Alarcon
Guodong Guo
124
0
0
26 Aug 2025
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening
Halid Abdulrahim Kadi
K. Terzic
110
0
0
23 Aug 2025
Goals and the Structure of Experience
Goals and the Structure of Experience
Nadav Amir
Stas Tiomkin
Angela Langdon
144
0
0
20 Aug 2025
Visuomotor Grasping with World Models for Surgical Robots
Visuomotor Grasping with World Models for Surgical Robots
Hongbin Lin
Bin Li
K. W. S. Au
156
1
0
15 Aug 2025
Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized Planning
Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized PlanningInternational Journal of Control, Automation and Systems (IJCAS), 2025
Sangwoo Jeon
Juchul Shin
Gyeong-Tae Kim
YeonJe Cho
Seongwoo Kim
OffRL
140
0
0
14 Aug 2025
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning
Yan Yu
Yaodong Yang
Zhengbo Lu
Chengdong Ma
Wengang Zhou
Houqiang Li
CML
136
0
0
13 Aug 2025
Towards Safe Imitation Learning via Potential Field-Guided Flow Matching
Towards Safe Imitation Learning via Potential Field-Guided Flow Matching
Haoran Ding
Anqing Duan
Zezhou Sun
Leonel Rozo
Noémie Jaquier
Dezhen Song
Yoshihiko Nakamura
140
0
0
12 Aug 2025
ASkDAgger: Active Skill-level Data Aggregation for Interactive Imitation Learning
ASkDAgger: Active Skill-level Data Aggregation for Interactive Imitation Learning
Jelle Luijkx
Zlatan Ajanović
L. Ferranti
Jens Kober
173
1
0
07 Aug 2025
RecoMind: A Reinforcement Learning Framework for Optimizing In-Session User Satisfaction in Recommendation Systems
RecoMind: A Reinforcement Learning Framework for Optimizing In-Session User Satisfaction in Recommendation Systems
Mehdi Ben Ayed
Fei Feng
Jay Adams
Vishwakarma Singh
Kritarth Anand
Jiajing Xu
OffRL
141
1
0
31 Jul 2025
Test-time Offline Reinforcement Learning on Goal-related Experience
Test-time Offline Reinforcement Learning on Goal-related Experience
Marco Bagatella
Mert Albaba
Jonas Hübotter
Georg Martius
Andreas Krause
OffRL
216
4
0
24 Jul 2025
Sensor-Space Based Robust Kinematic Control of Redundant Soft Manipulator by Learning
Sensor-Space Based Robust Kinematic Control of Redundant Soft Manipulator by Learning
Yinan Meng
Kun Qian
Jiong Yang
Renbo Su
Zhenhong Li
Charlie C. L. Wang
139
0
0
19 Jul 2025
Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI
Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI
Julien Pourcel
Cédric Colas
Pierre-Yves Oudeyer
LRM
255
9
0
10 Jul 2025
2048: Reinforcement Learning in a Delayed Reward Environment
2048: Reinforcement Learning in a Delayed Reward Environment
Prady Saligram
Tanvir Bhathal
Robby Manihani
OffRL
191
1
0
07 Jul 2025
Planning under Uncertainty to Goal Distributions
Planning under Uncertainty to Goal Distributions
Adam Conkey
Tucker Hermans
388
3
0
01 Jul 2025
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
Prajwal Koirala
Cody Fleming
OffRL
319
4
0
26 Jun 2025
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning
Xuechen Zhang
Zijian Huang
Yingcong Li
Chenshun Ni
Jiasi Chen
Samet Oymak
OffRLMoELRM
220
12
0
20 Jun 2025
Energy-Based Transfer for Reinforcement Learning
Energy-Based Transfer for Reinforcement Learning
Zeyun Deng
Jasorsi Ghosh
Fiona Xie
Yuzhe Lu
Katia Sycara
Joseph Campbell
172
0
0
19 Jun 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
304
0
0
18 Jun 2025
ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes
Zeyuan Chen
Qiyang Yan
Yuanpei Chen
Tianhao Wu
Jiyao Zhang
Zihan Ding
Jinzhou Li
Yaodong Yang
Hao Dong
380
5
0
17 Jun 2025
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
Mingkang Zhu
Xi Chen
Zhongdao Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
201
3
0
17 Jun 2025
DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance
DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance
Maximilian Du
Shuran Song
263
4
0
16 Jun 2025
Goal-based Self-Adaptive Generative Adversarial Imitation Learning (Goal-SAGAIL) for Multi-goal Robotic Manipulation Tasks
Goal-based Self-Adaptive Generative Adversarial Imitation Learning (Goal-SAGAIL) for Multi-goal Robotic Manipulation Tasks
Yingyi Kuang
Luis J. Manso
George Vogiatzis
117
0
0
15 Jun 2025
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
Federico Zocco
Monica Malvezzi
152
0
0
13 Jun 2025
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
Mido Assran
Adrien Bardes
David Fan
Q. Garrido
Russell Howes
...
Sarath Chandar
Franziska Meier
Yann LeCun
Michael G. Rabbat
Nicolas Ballas
277
138
0
11 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
219
1
0
10 Jun 2025
Learning The Minimum Action Distance
Learning The Minimum Action Distance
Lorenzo Steccanella
Joshua B. Evans
Özgür Simsek
Anders Jonsson
309
0
0
10 Jun 2025
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Junhong Shen
Hao Bai
Lunjun Zhang
Yifei Zhou
Amrith Rajagopal Setlur
...
Diego Caples
Nan Jiang
Tong Zhang
Ameet Talwalkar
Aviral Kumar
LLMAGLRM
296
17
0
09 Jun 2025
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Seungho Baek
Taegeon Park
Jongchan Park
Seungjun Oh
Yusung Kim
OffRL
276
2
0
09 Jun 2025
Reachability Weighted Offline Goal-conditioned Resampling
Reachability Weighted Offline Goal-conditioned Resampling
Wenyan Yang
Joni Pajarinen
OffRL
203
0
0
03 Jun 2025
SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning
SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning
Yihao Liu
Shuocheng Li
Lang Cao
Yuhang Xie
Mengyu Zhou
Haoyu Dong
Xiaojun Ma
Shi Han
Dongmei Zhang
OffRLReLMLRM
245
5
0
01 Jun 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
283
11
0
29 May 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
254
6
0
29 May 2025
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
V. Wang
Tinghuai Wang
Joni Pajarinen
BDL
160
2
0
27 May 2025
Can Large Reasoning Models Self-Train?
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLMOffRLLRM
416
21
0
27 May 2025
Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning
Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning
Quentin Rouxel
Clemente Donoso
Fei Chen
S. Ivaldi
Jean-Baptiste Mouret
OffRL
394
1
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
307
4
0
26 May 2025
Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies
Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies
Kevin Li
Marinka Zitnik
OffRL
209
0
0
25 May 2025
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
Federico Zocco
Andrea Corti
Monica Malvezzi
AI4CE
339
1
0
24 May 2025
Flattening Hierarchies with Policy Bootstrapping
Flattening Hierarchies with Policy Bootstrapping
John L. Zhou
Jonathan C. Kao
OffRL
386
1
0
20 May 2025
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Hongjoon Ahn
Heewoong Choi
Jisu Han
Taesup Moon
OffRL
323
2
0
19 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
300
3
0
19 May 2025
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Ian Holmes
Min Chi
OffRL
269
2
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement LearningIEEE Access (IEEE Access), 2025
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
256
2
0
15 May 2025
General Dynamic Goal Recognition using Goal-Conditioned and Meta Reinforcement Learning
General Dynamic Goal Recognition using Goal-Conditioned and Meta Reinforcement Learning
Osher Elhadad
Reuth Mirsky
Reuth Mirsky
AI4CE
170
2
0
14 May 2025
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
261
0
0
13 May 2025
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
Hanjung Kim
Jaehyun Kang
Hyolim Kang
Meedeum Cho
Seon Joo Kim
Youngwoon Lee
457
10
0
13 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
327
2
0
06 May 2025
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Chenran Zhao
Dianxi Shi
Mengzhu Wang
Jianqiang Xia
Huanhuan Yang
Songchang Jin
Shaowu Yang
Chunping Qiu
283
0
0
04 May 2025
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
Bofei Liu
Dong Ye
Zunhao Yao
Zhaowei Sun
246
0
0
04 May 2025
Previous
12345...252627
Next