ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,242 papers shown
Title
Universal Policies to Learn Them All
Universal Policies to Learn Them All
Hassam Sheikh
Ladislau Bölöni
OffRL
11
1
0
24 Aug 2019
Reinforcement Learning in Healthcare: A Survey
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
19
549
0
22 Aug 2019
A Generalized Algorithm for Multi-Objective Reinforcement Learning and
  Policy Adaptation
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Runzhe Yang
Xingyuan Sun
Karthik Narasimhan
18
247
0
21 Aug 2019
A survey on intrinsic motivation in reinforcement learning
A survey on intrinsic motivation in reinforcement learning
A. Aubret
L. Matignon
S. Hassas
AI4CE
18
143
0
19 Aug 2019
Mapping State Space using Landmarks for Universal Goal Reaching
Mapping State Space using Landmarks for Universal Goal Reaching
Zhiao Huang
Fangchen Liu
Hao Su
14
66
0
15 Aug 2019
A review on Deep Reinforcement Learning for Fluid Mechanics
A review on Deep Reinforcement Learning for Fluid Mechanics
Paul Garnier
J. Viquerat
Jean Rabault
A. Larcher
A. Kuhnle
E. Hachem
AI4CE
18
253
0
12 Aug 2019
Developing a Simple Model for Sand-Tool Interaction and Autonomously
  Shaping Sand
Developing a Simple Model for Sand-Tool Interaction and Autonomously Shaping Sand
Wooshik Kim
Catherine Pavlov
Aaron M. Johnson
9
6
0
07 Aug 2019
Learning to combine primitive skills: A step towards versatile robotic
  manipulation
Learning to combine primitive skills: A step towards versatile robotic manipulation
Robin Strudel
Alexander Pashevich
Igor Kalevatykh
Ivan Laptev
Josef Sivic
Cordelia Schmid
19
4
0
02 Aug 2019
Hindsight Trust Region Policy Optimization
Hindsight Trust Region Policy Optimization
Hanbo Zhang
Site Bai
Xuguang Lan
David Hsu
Nanning Zheng
30
8
0
29 Jul 2019
Learning to Solve a Rubik's Cube with a Dexterous Hand
Learning to Solve a Rubik's Cube with a Dexterous Hand
Tingguang Li
Weitao Xi
Meng Fang
Jia Xu
Max Q.-H. Meng
6
11
0
26 Jul 2019
Memory Based Trajectory-conditioned Policies for Learning from Sparse
  Rewards
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards
Yijie Guo
Jongwook Choi
Marcin Moczulski
Shengyu Feng
Samy Bengio
Mohammad Norouzi
Honglak Lee
17
10
0
24 Jul 2019
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill
  Discovery
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
38
74
0
18 Jul 2019
Composing Diverse Policies for Temporally Extended Tasks
Composing Diverse Policies for Temporally Extended Tasks
Daniel Angelov
Yordan V. Hristov
Michael G. Burke
S. Ramamoorthy
14
18
0
18 Jul 2019
Self-Attentional Credit Assignment for Transfer in Reinforcement
  Learning
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
Johan Ferret
Raphaël Marinier
M. Geist
Olivier Pietquin
OffRL
21
6
0
18 Jul 2019
Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient
  Training Data through Simulation
Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation
Xiaowei Xing
D. Chang
11
6
0
16 Jul 2019
Learning Self-Correctable Policies and Value Functions from
  Demonstrations with Negative Sampling
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Yuping Luo
Huazhe Xu
Tengyu Ma
SSL
18
13
0
12 Jul 2019
DisCoRL: Continual Reinforcement Learning via Policy Distillation
DisCoRL: Continual Reinforcement Learning via Policy Distillation
Kalifou René Traoré
Hugo Caselles-Dupré
Timothée Lesort
Te Sun
Guanghang Cai
Natalia Díaz Rodríguez
David Filliat
OffRL
32
60
0
11 Jul 2019
Assessing Transferability from Simulation to Reality for Reinforcement
  Learning
Assessing Transferability from Simulation to Reality for Reinforcement Learning
Fabio Muratore
Michael Gienger
Jan Peters
19
61
0
10 Jul 2019
A Review of Robot Learning for Manipulation: Challenges,
  Representations, and Algorithms
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
33
356
0
06 Jul 2019
Learning a Behavioral Repertoire from Demonstrations
Learning a Behavioral Repertoire from Demonstrations
Niels Justesen
Miguel González Duque
Daniel Cabarcas Jaramillo
Jean-Baptiste Mouret
S. Risi
OffRL
14
2
0
05 Jul 2019
Self-supervised Learning of Distance Functions for Goal-Conditioned
  Reinforcement Learning
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam
Eric Crawford
T. Doan
Doina Precup
OffRL
SSL
13
24
0
05 Jul 2019
On the Weaknesses of Reinforcement Learning for Neural Machine
  Translation
On the Weaknesses of Reinforcement Learning for Neural Machine Translation
Leshem Choshen
Lior Fox
Zohar Aizenbud
Omri Abend
13
104
0
03 Jul 2019
Dynamics-Aware Unsupervised Discovery of Skills
Dynamics-Aware Unsupervised Discovery of Skills
Archit Sharma
S. Gu
Sergey Levine
Vikash Kumar
Karol Hausman
14
398
0
02 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
17
336
0
30 Jun 2019
Compositional Transfer in Hierarchical Reinforcement Learning
Compositional Transfer in Hierarchical Reinforcement Learning
Markus Wulfmeier
A. Abdolmaleki
Roland Hafner
Jost Tobias Springenberg
Michael Neunert
Tim Hertweck
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
19
27
0
26 Jun 2019
Proximal Distilled Evolutionary Reinforcement Learning
Proximal Distilled Evolutionary Reinforcement Learning
Cristian Bodnar
Ben Day
Pietro Lió
30
71
0
24 Jun 2019
Neural networks with motivation
Neural networks with motivation
Sergey A. Shuvaev
Ngoc B. Tran
Marcus Stephenson-Jones
Bo-wen Li
A. Koulakov
9
8
0
23 Jun 2019
Placeto: Learning Generalizable Device Placement Algorithms for
  Distributed Machine Learning
Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning
Ravichandra Addanki
S. Venkatakrishnan
Shreyan Gupta
Hongzi Mao
Mohammad Alizadeh
OOD
OffRL
20
66
0
20 Jun 2019
Experience Replay Optimization
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
OffRL
9
102
0
19 Jun 2019
Reward Prediction Error as an Exploration Objective in Deep RL
Reward Prediction Error as an Exploration Objective in Deep RL
Riley Simmons-Edler
Ben Eisner
Daniel Yang
Anthony Bisulco
E. Mitchell
Sebastian Seung
Daniel D. Lee
15
5
0
19 Jun 2019
Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study
Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study
Cam Linke
Nadia M. Ady
Martha White
T. Degris
Adam White
10
16
0
19 Jun 2019
Directed Exploration for Reinforcement Learning
Directed Exploration for Reinforcement Learning
Z. Guo
Emma Brunskill
16
11
0
18 Jun 2019
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Yiding Jiang
S. Gu
Kevin Patrick Murphy
Chelsea Finn
OffRL
18
222
0
18 Jun 2019
LPaintB: Learning to Paint from Self-Supervision
LPaintB: Learning to Paint from Self-Supervision
Biao Jia
Jonathan Brandt
R. Měch
Byungmoon Kim
Tianyi Zhou
SSL
11
12
0
17 Jun 2019
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual
  Inputs and Natural Rewards
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Gerrit Schoettler
Ashvin Nair
Jianlan Luo
Shikhar Bahl
J. A. Ojea
Eugen Solowjow
Sergey Levine
OffRL
18
190
0
13 Jun 2019
Goal-conditioned Imitation Learning
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
22
219
0
13 Jun 2019
Sub-Goal Trees -- a Framework for Goal-Directed Trajectory Prediction
  and Optimization
Sub-Goal Trees -- a Framework for Goal-Directed Trajectory Prediction and Optimization
Tom Jurgenson
E. Groshev
Aviv Tamar
14
7
0
12 Jun 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement
  Learning
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
26
284
0
12 Jun 2019
Continual Reinforcement Learning deployed in Real-life using Policy
  Distillation and Sim2Real Transfer
Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer
Kalifou René Traoré
Hugo Caselles-Dupré
Timothée Lesort
Te Sun
Natalia Díaz Rodríguez
David Filliat
CLL
OffRL
20
44
0
11 Jun 2019
Learning Powerful Policies by Using Consistent Dynamics Model
Learning Powerful Policies by Using Consistent Dynamics Model
Shagun Sodhani
Anirudh Goyal
T. Deleu
Yoshua Bengio
Sergey Levine
Jian Tang
OffRL
11
5
0
11 Jun 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
29
85
0
10 Jun 2019
Boosting Soft Actor-Critic: Emphasizing Recent Experience without
  Forgetting the Past
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Che Wang
Keith Ross
16
45
0
10 Jun 2019
Curiosity-Driven Multi-Criteria Hindsight Experience Replay
Curiosity-Driven Multi-Criteria Hindsight Experience Replay
John Lanier
Stephen Marcus McAleer
Pierre Baldi
OffRL
8
25
0
09 Jun 2019
An Extensible Interactive Interface for Agent Design
An Extensible Interactive Interface for Agent Design
Matthew Rahtz
James Fang
Anca Dragan
Dylan Hadfield-Menell
11
1
0
06 Jun 2019
BayesSim: adaptive domain randomization via probabilistic inference for
  robotics simulators
BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators
F. Ramos
Rafael Possas
Dieter Fox
6
155
0
04 Jun 2019
Options as responses: Grounding behavioural hierarchies in multi-agent
  RL
Options as responses: Grounding behavioural hierarchies in multi-agent RL
A. Vezhnevets
Yuhuai Wu
Rémi Leblond
Joel Z. Leibo
AI4CE
18
17
0
04 Jun 2019
Harnessing Reinforcement Learning for Neural Motion Planning
Harnessing Reinforcement Learning for Neural Motion Planning
Tom Jurgenson
Aviv Tamar
OOD
9
63
0
01 Jun 2019
Sequence Modeling of Temporal Credit Assignment for Episodic
  Reinforcement Learning
Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning
Yang Liu
Yunan Luo
Yuanyi Zhong
Xi Chen
Qiang Liu
Jian-wei Peng
14
35
0
31 May 2019
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep
  Model-Based RL for Sparse Cost Robotic Tasks
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks
Brijen Thananjeyan
Ashwin Balakrishna
Ugo Rosolia
Felix Li
R. McAllister
Joseph E. Gonzalez
Sergey Levine
Francesco Borrelli
Ken Goldberg
OffRL
6
4
0
31 May 2019
Towards Finding Longer Proofs
Towards Finding Longer Proofs
Zsolt Zombori
Adrián Csiszárik
Henryk Michalewski
C. Kaliszyk
Josef Urban
OffRL
LRM
29
15
0
30 May 2019
Previous
123...2122232425
Next