Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,242 papers shown
Title
Personalized Path Recourse for Reinforcement Learning Agents
Dat Hong
Tong Wang
20
0
0
14 Dec 2023
Learning adaptive planning representations with natural language guidance
L. Wong
Jiayuan Mao
Pratyusha Sharma
Zachary S. Siegel
Jiahai Feng
Noa Korneev
Joshua B. Tenenbaum
Jacob Andreas
LM&Ro
31
21
0
13 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement Learning
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
32
2
0
10 Dec 2023
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
27
1
0
10 Dec 2023
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
34
1
0
08 Dec 2023
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
Lili Chen
Shikhar Bahl
Deepak Pathak
22
41
0
07 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
39
6
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
21
6
0
06 Dec 2023
Understanding Representations Pretrained with Auxiliary Losses for Embodied Agent Planning
Samrudhdhi B. Rangrej
James J. Clark
SSL
37
0
0
06 Dec 2023
Contact Energy Based Hindsight Experience Prioritization
Erdi Sayar
Zhenshan Bing
Carlo DÉramo
Ozgur S. Oguz
Alois Knoll
16
3
0
05 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
29
2
0
05 Dec 2023
Working Backwards: Learning to Place by Picking
Oliver Limoyo
Abhisek Konar
Trevor Ablett
Jonathan Kelly
F. Hogan
Gregory Dudek
25
0
0
04 Dec 2023
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts Design
Romain Lacombe
Lucas Hendren
Khalid El-Awady
15
1
0
04 Dec 2023
Modular Control Architecture for Safe Marine Navigation: Reinforcement Learning and Predictive Safety Filters
Aksel Vaaler
Svein Jostein Husa
Daniel Menges
T. N. Larsen
Adil Rasheed
19
2
0
04 Dec 2023
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
13
0
0
29 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
21
1
0
28 Nov 2023
Offline Skill Generalization via Task and Motion Planning
Shin Watanabe
Geir Horn
J. Tørresen
K. Ellefsen
OffRL
20
0
0
24 Nov 2023
Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
Florian Felten
El-Ghazali Talbi
Grégoire Danoy
11
13
0
21 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
11
2
0
17 Nov 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
40
1
0
09 Nov 2023
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning
Junmin Zhong
Ruofan Wu
Jennie Si
OffRL
11
1
0
07 Nov 2023
PcLast: Discovering Plannable Continuous Latent States
Anurag Koul
Shivakanth Sujit
Shaoru Chen
Ben Evans
Lili Wu
...
Yonathan Efroni
Lekan Molu
Miro Dudik
John Langford
Alex Lamb
OffRL
BDL
26
1
0
06 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
37
4
0
06 Nov 2023
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
31
7
0
03 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
23
5
0
01 Nov 2023
Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback
Max Balsells
M. Torné
Zihan Wang
Samedh Desai
Pulkit Agrawal
Abhishek Gupta
42
10
0
31 Oct 2023
Learning to Discover Skills through Guidance
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
47
6
0
31 Oct 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
36
9
0
30 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Zhaoyi Zhou
Chuning Zhu
Runlong Zhou
Qiwen Cui
Abhishek Gupta
S. S. Du
OffRL
40
8
0
30 Oct 2023
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement
Daesol Cho
Seungjae Lee
H. J. Kim
OODD
29
0
0
30 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
34
6
0
28 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
35
2
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas Corrado
Josiah P. Hanna
29
4
0
26 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
28
6
0
26 Oct 2023
Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight Reinforcement Learning
Sicen Li
Yiming Pang
Panju Bai
Zhaojin Liu
Jiawei Li
Shihao Hu
Liquan Wang
Gang Wang
27
2
0
24 Oct 2023
Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Zidan Wang
Takeru Oba
Takuma Yoneda
Rui Shen
Matthew R. Walter
Bradly C. Stadie
DiffM
32
9
0
21 Oct 2023
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
35
19
0
20 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
30
0
0
17 Oct 2023
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Niket Tandon
Li Zhang
Chris Callison-Burch
Peter Clark
LRM
LLMAG
CLL
21
37
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Guy Van den Broeck
Yitao Liang
83
26
0
12 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
115
122
0
10 Oct 2023
Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot Collaboration
Jakob Thumm
Felix Trost
Matthias Althoff
OffRL
36
6
0
09 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
30
180
0
09 Oct 2023
Compositional Servoing by Recombining Demonstrations
Max Argus
Abhijeet Nayak
Martin Buchner
Silvio Galesso
Abhinav Valada
Thomas Brox
25
0
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
13
1
0
06 Oct 2023
Previous
1
2
3
4
5
6
...
23
24
25
Next