ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,245 papers shown
Title
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
End-to-end Reinforcement Learning of Robotic Manipulation with Robust
  Keypoints Representation
End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation
Tianying Wang
En Yen Puang
Marcus Lee
Yongpeng Wu
Wei Jing
SSL
38
5
0
12 Feb 2022
Online Decision Transformer
Online Decision Transformer
Qinqing Zheng
Amy Zhang
Aditya Grover
OffRL
27
204
0
11 Feb 2022
Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic
  Agents
Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic Agents
Ahmed Akakzia
Olivier Serris
Olivier Sigaud
Cédric Colas
21
6
0
10 Feb 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
51
67
0
09 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in
  Reinforcement Learning
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
38
50
0
08 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
50
250
0
03 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
37
61
0
03 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for
  Offline Reinforcement Learning
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
32
84
0
31 Jan 2022
Contrastive Learning from Demonstrations
Contrastive Learning from Demonstrations
André Rosa de Sousa Porfírio Correia
L. A. Alexandre
SSL
31
2
0
30 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
33
27
0
27 Jan 2022
State-Conditioned Adversarial Subgoal Generation
State-Conditioned Adversarial Subgoal Generation
V. Wang
Joni Pajarinen
Tinghuai Wang
Joni-Kristian Kämäräinen
55
11
0
24 Jan 2022
Pearl: Parallel Evolutionary and Reinforcement Learning Library
Pearl: Parallel Evolutionary and Reinforcement Learning Library
Rohan Tangri
Danilo P. Mandic
A. Constantinides
11
2
0
24 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
40
133
0
20 Jan 2022
Reinforcement Learning based Air Combat Maneuver Generation
Reinforcement Learning based Air Combat Maneuver Generation
Muhammed Murat Özbek
E. Koyuncu
11
4
0
14 Jan 2022
Automated Reinforcement Learning: An Overview
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
OffRL
36
16
0
13 Jan 2022
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based
  Robotics
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics
Swagat Kumar
Hayden Sampson
Ardhendu Behera
19
0
0
11 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation
  Learning on sparse-reward tasks
STIR2^22: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
33
2
0
11 Jan 2022
Integrating Artificial Intelligence and Augmented Reality in Robotic
  Surgery: An Initial dVRK Study Using a Surgical Education Scenario
Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education Scenario
Yonghao Long
Jianfeng Cao
Anton Deguet
Russell H. Taylor
Qi Dou
49
21
0
02 Jan 2022
Multiagent Model-based Credit Assignment for Continuous Control
Multiagent Model-based Credit Assignment for Continuous Control
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
27
5
0
27 Dec 2021
Off Environment Evaluation Using Convex Risk Minimization
Off Environment Evaluation Using Convex Risk Minimization
Pulkit Katdare
Shuijing Liu
Katherine Driggs-Campbell
18
2
0
21 Dec 2021
Proving Theorems using Incremental Learning and Hindsight Experience
  Replay
Proving Theorems using Incremental Learning and Hindsight Experience Replay
Eser Aygun
Laurent Orseau
Ankit Anand
Xavier Glorot
Vlad Firoiu
Lei M. Zhang
Doina Precup
Shibl Mourad
CLL
LRM
29
17
0
20 Dec 2021
Replay For Safety
Replay For Safety
Liran Szlak
Ohad Shamir
OffRL
16
0
0
08 Dec 2021
CALVIN: A Benchmark for Language-Conditioned Policy Learning for
  Long-Horizon Robot Manipulation Tasks
CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Oier Mees
Lukás Hermann
Erick Rosete-Beas
Wolfram Burgard
LM&Ro
36
243
0
06 Dec 2021
Hierarchical Reinforcement Learning with Timed Subgoals
Hierarchical Reinforcement Learning with Timed Subgoals
Nico Gürtler
Le Chen
Georg Martius
59
22
0
06 Dec 2021
Flexible-Joint Manipulator Trajectory Tracking with Learned Two-Stage
  Model employing One-Step Future Prediction
Flexible-Joint Manipulator Trajectory Tracking with Learned Two-Stage Model employing One-Step Future Prediction
D. Pavlichenko
Sven Behnke
17
1
0
06 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
29
18
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous
  manipulation
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
18
18
0
01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
29
36
0
26 Nov 2021
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
33
2
0
23 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
21
99
0
19 Nov 2021
Successor Feature Landmarks for Long-Horizon Goal-Conditioned
  Reinforcement Learning
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning
Christopher Hoang
Sungryull Sohn
Jongwook Choi
Wilka Carvalho
Honglak Lee
23
29
0
18 Nov 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned
  Policies in Robotics
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics
Ingmar Schubert
Danny Driess
Ozgur S. Oguz
Marc Toussaint
OffRL
24
1
0
15 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
21
1
0
12 Nov 2021
One model Packs Thousands of Items with Recurrent Conditional Query
  Learning
One model Packs Thousands of Items with Recurrent Conditional Query Learning
Dongda Li
Zhaoquan Gu
Yuexuan Wang
Changwei Ren
F. Lau
27
17
0
12 Nov 2021
Distilling Motion Planner Augmented Policies into Visual Control
  Policies for Robot Manipulation
Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation
Isabella Liu
Shagun Uppal
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
Youngwoon Lee
30
12
0
11 Nov 2021
Data-Efficient Deep Reinforcement Learning for Attitude Control of
  Fixed-Wing UAVs: Field Experiments
Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments
Eivind Bøhn
E. M. Coates
D. Reinhardt
T. Johansen
30
27
0
07 Nov 2021
Automatic Goal Generation using Dynamical Distance Learning
Automatic Goal Generation using Dynamical Distance Learning
Bharat Prakash
Nicholas R. Waytowich
T. Mohsenin
Tim Oates
27
2
0
07 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
37
41
0
04 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task
  Learning
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
45
63
0
04 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation
  Controlled using Deep Reinforcement Learning
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
19
7
0
04 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
11
7
0
03 Nov 2021
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
22
3
0
02 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
33
11
0
02 Nov 2021
Robot Learning from Randomized Simulations: A Review
Robot Learning from Randomized Simulations: A Review
Fabio Muratore
Fabio Ramos
Greg Turk
Wenhao Yu
Michael Gienger
Jan Peters
AI4CE
18
80
0
01 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
36
17
0
30 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
23
8
0
28 Oct 2021
Similarity-Aware Skill Reproduction based on Multi-Representational
  Learning from Demonstration
Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration
Brendan Hertel
S. Ahmadzadeh
25
8
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
27
5
0
27 Oct 2021
Previous
123...121314...232425
Next