Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,339 papers shown
CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation
Mazal Bethany
Nishant Vishwamitra
Cho-Yu Chiang
Peyman Najafirad
AAML
288
2
0
03 May 2025
Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic
Mikihisa Yuasa
R. Sreenivas
Huy T. Tran
420
0
0
30 Apr 2025
Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots
Brendon Johnson
Alfredo Weitzenfeld
365
1
0
26 Apr 2025
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hanwen Du
Bo Peng
Xia Ning
401
0
0
23 Apr 2025
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
Chao Guo
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
951
21
0
21 Apr 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
290
0
0
15 Apr 2025
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Computer Vision and Pattern Recognition (CVPR), 2025
Zhao Dong
Ka Chen
Zhaoyang Lv
Hong-Xing Yu
Yunzhi Zhang
...
Xiaqing Pan
Mingfei Yan
Jiajun Wu
Carl Ren
Richard Newcombe
366
17
0
11 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Frontiers in Robotics and AI (Front. Robot. AI), 2025
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
514
25
0
11 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
377
0
0
08 Apr 2025
Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks
Sergey Pastukhov
231
0
0
06 Apr 2025
Outlook Towards Deployable Continual Learning for Particle Accelerators
Kishansingh Rajput
Sen Lin
Auralee Edelen
Willem Blokland
Malachi Schram
253
1
0
04 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DV
VLM
OffRL
318
1
0
03 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
241
1
0
02 Apr 2025
MAER-Nav: Bidirectional Motion Learning Through Mirror-Augmented Experience Replay for Robot Navigation
Shanze Wang
Mingao Tan
Zhiyong Yang
Biao Huang
Xiaoyu Shen
Hailong Huang
Wei Zhang
154
0
0
31 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
396
2
0
24 Mar 2025
Causally Aligned Curriculum Learning
International Conference on Learning Representations (ICLR), 2025
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
301
6
0
21 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
243
0
0
20 Mar 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
231
1
0
19 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
OffRL
SSL
571
11
0
19 Mar 2025
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Jianqi Gao
Xizheng Pang
Qi Liu
Yanjie Li
296
1
0
15 Mar 2025
LUMOS: Language-Conditioned Imitation Learning with World Models
IEEE International Conference on Robotics and Automation (ICRA), 2025
Iman Nematollahi
Branton DeMoss
Akshay L Chandra
Nick Hawes
Wolfram Burgard
Ingmar Posner
OffRL
218
7
0
13 Mar 2025
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
Ruizhe Chen
Wenhao Chai
Zhifei Yang
Xiaotian Zhang
Qiufeng Wang
Tony Q.S. Quek
Soujanya Poria
Zuozhu Liu
539
3
0
06 Mar 2025
Causality-Based Reinforcement Learning Method for Multi-Stage Robotic Tasks
Jiechao Deng
Ning Tan
267
0
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
662
14
0
05 Mar 2025
ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Shaofei Cai
Zhancun Mu
Hoang Trung-Dung
Yitao Liang
296
8
0
04 Mar 2025
Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Qiyang Yan
Zihan Ding
Xin Zhou
Adam J. Spiers
239
2
0
04 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
460
7
0
03 Mar 2025
Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference
Wenjie Qiu
Yi-Chen Li
Xuqin Zhang
Tianyi Zhang
Yiming Zhang
Zongzhang Zhang
Yang Yu
ALM
440
2
0
01 Mar 2025
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
572
9
0
24 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
IEEE Systems Conference (SysCon), 2025
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
292
1
0
24 Feb 2025
Theoretical Barriers in Bellman-Based Reinforcement Learning
Brieuc Pinon
Raphaël Jungers
Jean-Charles Delvenne
125
0
0
17 Feb 2025
Dynamic Reinforcement Learning for Actors
Neural Networks (NN), 2025
Katsunari Shibata
AI4CE
121
0
0
17 Feb 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
498
6
0
08 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
605
3
0
03 Feb 2025
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Federico Malato
Ville Hautamaki
295
1
0
03 Feb 2025
Upside Down Reinforcement Learning with Policy Generators
Jacopo Di Ventura
Dylan R. Ashley
Vincent Herrmann
Francesco Faccio
Jürgen Schmidhuber
237
1
0
27 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Jianfeng Dong
Wenjun Zeng
OffRL
903
0
0
22 Jan 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2025
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
636
10
0
12 Jan 2025
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Yueqin Yin
Shentao Yang
Yujia Xie
Ziyi Yang
Yuting Sun
Hany Awadalla
Weizhu Chen
Mingyuan Zhou
327
5
0
07 Jan 2025
Attribute-Based Robotic Grasping with Data-Efficient Adaptation
IEEE Transactions on robotics (IEEE TRO), 2025
Yang Yang
Houjian Yu
Xibai Lou
Yuanhao Liu
Changhyun Choi
409
22
0
04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
413
0
0
03 Jan 2025
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Anthony Kobanda
Rémy Portelas
Odalric-Ambrym Maillard
Ludovic Denoyer
OffRL
CLL
677
2
0
19 Dec 2024
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
355
2
0
16 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Conference on Learning for Dynamics & Control (L4DC), 2024
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep Chinchali
Ufuk Topcu
OffRL
434
1
0
02 Dec 2024
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Communications in nonlinear science & numerical simulation (CNSNS), 2024
Egor E. Nuzhin
Nikolai V. Brilliantov
215
4
0
21 Nov 2024
Precision-Focused Reinforcement Learning Model for Robotic Object Pushing
Lara Bergmann
David P. Leins
R. Haschke
Klaus Neumann
249
7
0
13 Nov 2024
Pre-trained Visual Dynamics Representations for Efficient Policy Learning
European Conference on Computer Vision (ECCV), 2024
Hao Luo
Bohan Zhou
Zongqing Lu
267
4
0
05 Nov 2024
Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically
Kefan Dong
Arvind V. Mahankali
Tengyu Ma
ReLM
LRM
300
11
0
04 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
242
7
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
380
1
0
03 Nov 2024
Previous
1
2
3
4
5
6
...
25
26
27
Next