Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,339 papers shown
Direct Preference Optimization for Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Derrik E. Asher
Anit Kumar Sahu
Mubarak Shah
Vinay P. Namboodiri
Amrit Singh Bedi
366
1
0
01 Nov 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Beyazit Yalcinkaya
Niklas Lauffer
Marcell Vazquez-Chanlatte
Sanjit A. Seshia
AI4CE
485
13
0
31 Oct 2024
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
101
0
0
31 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
452
4
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
International Conference on Learning Representations (ICLR), 2024
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
521
71
0
26 Oct 2024
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Neural Information Processing Systems (NeurIPS), 2024
Zizhao Wang
Jiaheng Hu
Caleb Chuck
Stephen Chen
Roberto Martín-Martín
Amy Zhang
S. Niekum
Peter Stone
OffRL
262
9
0
24 Oct 2024
Safe Load Balancing in Software-Defined-Networking
Computer Communications (Comput. Commun.), 2024
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
224
0
0
22 Oct 2024
Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Nils Grandien
Quentin Delfosse
Kristian Kersting
OffRL
429
5
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
IEEE International Conference on Robotics and Biomimetics (ROBIO), 2024
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
195
0
0
17 Oct 2024
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Loris Gaven
Clément Romac
Thomas Carta
Sylvain Lamprier
Olivier Sigaud
Pierre-Yves Oudeyer
LLMAG
OffRL
156
5
0
16 Oct 2024
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
255
2
0
16 Oct 2024
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
325
3
0
16 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
1.1K
3
0
11 Oct 2024
Effective Exploration Based on the Structural Information Principles
Neural Information Processing Systems (NeurIPS), 2024
Xianghua Zeng
Hao Peng
Angsheng Li
151
5
0
09 Oct 2024
Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation
IEEE-RAS International Conference on Humanoid Robots (Humanoids), 2024
Paul Jansonnie
Bingbing Wu
Julien Perez
Jan Peters
SSL
279
3
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
337
4
0
07 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
318
10
0
03 Oct 2024
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning
Alicia Li
Nishanth Kumar
Tomás Lozano-Pérez
Leslie Kaelbling
OffRL
241
1
0
28 Sep 2024
VertiSelector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Tong Xu
Chenhui Pan
Xuesu Xiao
607
3
0
26 Sep 2024
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
Neural Information Processing Systems (NeurIPS), 2024
Tianyue Ou
Frank F. Xu
Aman Madaan
J. Liu
Robert Lo
Abishek Sridhar
Sudipta Sengupta
Dan Roth
Graham Neubig
Shuyan Zhou
OffRL
246
30
0
24 Sep 2024
Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPC
IEEE International Conference on Robotics and Automation (ICRA), 2024
Aleksi Mäki-Penttilä
Naeim Ebrahimi Toulkani
Reza Ghabcheloo
410
0
0
24 Sep 2024
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
IEEE International Conference on Robotics and Automation (ICRA), 2024
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
344
6
0
21 Sep 2024
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&Ro
OCL
242
1
0
18 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Conference on Robot Learning (CoRL), 2024
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
261
0
0
06 Sep 2024
Simplex-enabled Safe Continual Learning Machine
H. Cao
Y. Mao
Yihao Cai
L. Sha
Marco Caccamo
291
3
0
05 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
404
1
0
05 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
411
0
0
04 Sep 2024
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
151
0
0
30 Aug 2024
Safe Policy Exploration Improvement via Subgoals
Brian Angulo
G. Gorbov
Aleksandr I. Panov
Konstantin Yakovlev
OffRL
159
0
0
25 Aug 2024
Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation
Conference on Robot Learning (CoRL), 2024
Ria Doshi
Homer Walke
Oier Mees
Sudeep Dasari
Sergey Levine
379
98
0
21 Aug 2024
Online Behavior Modification for Expressive User Control of RL-Trained Robots
IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2024
Isaac S. Sheidlower
Mavis Murdock
Emma Bethel
Reuben M. Aronson
E. Short
OffRL
292
3
0
15 Aug 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Neural Information Processing Systems (NeurIPS), 2024
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
372
0
0
14 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
International Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
391
9
0
11 Aug 2024
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
IEEE Transactions on Automation Science and Engineering (T-ASE), 2024
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Hao Fu
Jinzhe Xue
Bin He
358
4
0
10 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
243
0
0
07 Aug 2024
A Value Function Space Approach for Hierarchical Planning with Signal Temporal Logic Tasks
IEEE Control Systems Letters (L-CSS), 2024
Peiran Liu
Yiting He
Yihao Qin
Hang Zhou
Yiding Ji
OffRL
277
0
0
04 Aug 2024
Jacta: A Versatile Planner for Learning Dexterous and Whole-body Manipulation
Conference on Robot Learning (CoRL), 2024
Jan Brüdigam
Ali-Adeeb Abbas
Maks Sorokin
Kuan Fang
Brandon Hung
Maya Guru
Roland Toth
Jiuguang Wang
Sandra Hirche
Simon Le Cleac'h
209
7
0
02 Aug 2024
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Norman Di Palo
Leonard Hasenclever
Jan Humplik
Arunkumar Byravan
130
5
0
30 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
252
27
0
30 Jul 2024
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
402
479
0
24 Jul 2024
WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy
Nirat Saini
Abhinav Shrivastava
228
2
0
22 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
178
0
0
18 Jul 2024
Variable-Agnostic Causal Exploration for Reinforcement Learning
Minh Hoang Nguyen
Hung Le
Svetha Venkatesh
CML
243
3
0
17 Jul 2024
Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi
Andrew Patterson
Martha White
Adam White
175
3
0
12 Jul 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
243
10
0
11 Jul 2024
Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Kevin Yu
Jihye Roh
Ziang Li
Wenhao Gao
Runzhong Wang
Connor W. Coley
326
19
0
08 Jul 2024
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
214
1
0
07 Jul 2024
Embracing Massive Medical Data
Yu-Cheng Chou
Zongwei Zhou
Alan Yuille
CLL
OOD
185
9
0
05 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
162
3
0
05 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
179
0
0
05 Jul 2024
Previous
1
2
3
4
5
...
25
26
27
Next