Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.01495
Cited By
v1
v2
v3 (latest)
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,340 papers shown
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2022
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
268
59
0
08 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Neural Information Processing Systems (NeurIPS), 2022
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
423
307
0
03 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
515
75
0
03 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
222
105
0
31 Jan 2022
Contrastive Learning from Demonstrations
International Conference on Robotic Computing (IRC), 2022
André Rosa de Sousa Porfírio Correia
L. A. Alexandre
SSL
228
2
0
30 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
214
32
0
27 Jan 2022
State-Conditioned Adversarial Subgoal Generation
AAAI Conference on Artificial Intelligence (AAAI), 2022
V. Wang
Joni Pajarinen
Tinghuai Wang
Joni-Kristian Kämäräinen
306
15
0
24 Jan 2022
Pearl: Parallel Evolutionary and Reinforcement Learning Library
Rohan Tangri
Danilo P. Mandic
A. Constantinides
134
3
0
24 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Minghuan Liu
Menghui Zhu
Weinan Zhang
356
185
0
20 Jan 2022
Reinforcement Learning based Air Combat Maneuver Generation
Muhammed Murat Özbek
E. Koyuncu
53
5
0
14 Jan 2022
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
OffRL
392
18
0
13 Jan 2022
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics
Swagat Kumar
Hayden Sampson
Ardhendu Behera
136
0
0
11 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Journal of Artificial Intelligence Research (JAIR), 2022
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Katharina Eggensperger
Marius Lindauer
AI4CE
381
126
0
11 Jan 2022
STIR
2
^2
2
: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jesús Bujalance Martín
Fabien Moutarde
OffRL
197
2
0
11 Jan 2022
Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education Scenario
International Symposium on Medical Robotics (ISMR), 2022
Yonghao Long
Jianfeng Cao
Anton Deguet
Russell H. Taylor
Qi Dou
367
24
0
02 Jan 2022
Multiagent Model-based Credit Assignment for Continuous Control
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
115
9
0
27 Dec 2021
Off Environment Evaluation Using Convex Risk Minimization
IEEE International Conference on Robotics and Automation (ICRA), 2021
Pulkit Katdare
Shuijing Liu
Katherine Driggs-Campbell
110
2
0
21 Dec 2021
Proving Theorems using Incremental Learning and Hindsight Experience Replay
International Conference on Machine Learning (ICML), 2021
Eser Aygun
Laurent Orseau
Ankit Anand
Xavier Glorot
Vlad Firoiu
Lei M. Zhang
Doina Precup
Shibl Mourad
CLL
LRM
248
21
0
20 Dec 2021
Replay For Safety
Liran Szlak
Ohad Shamir
OffRL
118
0
0
08 Dec 2021
CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
IEEE Robotics and Automation Letters (RA-L), 2021
Oier Mees
Lukás Hermann
Erick Rosete-Beas
Wolfram Burgard
LM&Ro
539
431
0
06 Dec 2021
Hierarchical Reinforcement Learning with Timed Subgoals
Neural Information Processing Systems (NeurIPS), 2021
Nico Gürtler
Le Chen
Georg Martius
266
30
0
06 Dec 2021
Flexible-Joint Manipulator Trajectory Tracking with Learned Two-Stage Model employing One-Step Future Prediction
International Conference on Robotic Computing (IRC), 2021
D. Pavlichenko
Sven Behnke
140
1
0
06 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
183
22
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
228
19
0
01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
International Conference on Learning Representations (ICLR), 2021
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
344
43
0
26 Nov 2021
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
310
4
0
23 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
International Conference on Learning Representations (ICLR), 2021
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
261
118
0
19 Nov 2021
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning
Christopher Hoang
Sungryull Sohn
Jongwook Choi
Wilka Carvalho
Honglak Lee
203
39
0
18 Nov 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics
Neural Information Processing Systems (NeurIPS), 2021
Ingmar Schubert
Danny Driess
Ozgur S. Oguz
Marc Toussaint
OffRL
147
1
0
15 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
105
1
0
12 Nov 2021
One model Packs Thousands of Items with Recurrent Conditional Query Learning
Knowledge-Based Systems (KBS), 2021
Dongda Li
Zhaoquan Gu
Yuexuan Wang
Changwei Ren
F. Lau
210
21
0
12 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation
Conference on Robot Learning (CoRL), 2021
Isabella Liu
Shagun Uppal
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
Youngwoon Lee
139
13
0
11 Nov 2021
Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments
Eivind Bøhn
E. M. Coates
D. Reinhardt
T. Johansen
163
48
0
07 Nov 2021
Automatic Goal Generation using Dynamical Distance Learning
Bharat Prakash
Nicholas R. Waytowich
T. Mohsenin
Tim Oates
117
2
0
07 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
International Conference on Learning Representations (ICLR), 2021
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
237
48
0
04 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
281
70
0
04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
American Control Conference (ACC), 2021
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
171
11
0
04 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
134
11
0
03 Nov 2021
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Evolutionary Computation (Evol. Comput.), 2021
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
172
6
0
02 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2021
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
113
15
0
02 Nov 2021
Robot Learning from Randomized Simulations: A Review
Frontiers in Robotics and AI (Front. Robot. AI), 2021
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
325
112
0
01 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
452
22
0
30 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
IEEE Access (IEEE Access), 2021
Tung M. Luu
Chang D. Yoo
167
12
0
28 Oct 2021
Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration
International Conference on Advanced Robotics (ICAR), 2021
Brendan Hertel
S. Ahmadzadeh
145
8
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
179
6
0
27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
368
22
0
27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Beining Han
Chongyi Zheng
Harris Chan
Keiran Paster
Michael Ruogu Zhang
Jimmy Ba
OOD
AI4CE
303
16
0
27 Oct 2021
Learning Diverse Policies in MOBA Games via Macro-Goals
Yiming Gao
Bei Shi
Xueying Du
Liang Wang
Guangwei Chen
...
Weixuan Wang
Deheng Ye
Qiang Fu
Wei Yang
Lanxiao Huang
169
14
0
27 Oct 2021
Multitask Adaptation by Retrospective Exploration with Learned World Models
Artem Zholus
Aleksandr I. Panov
CLL
117
0
0
25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Kibeom Kim
Min Whoo Lee
Yoonsung Kim
Je-hwan Ryu
Minsu Lee
Byoung-Tak Zhang
187
9
0
25 Oct 2021
Previous
1
2
3
...
14
15
16
...
25
26
27
Next
Page 15 of 27
Page
of 27
Go