Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.01495
Cited By
Hindsight Experience Replay
5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hindsight Experience Replay"
50 / 1,245 papers shown
Title
On Divergence Measures for Bayesian Pseudocoresets
Balhae Kim
J. Choi
Seanie Lee
Yoonho Lee
Jung-Woo Ha
Juho Lee
DD
21
11
0
12 Oct 2022
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation
Lingfeng Tao
Jiucai Zhang
Michael Bowman
Xiaoli Zhang
46
5
0
11 Oct 2022
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials
Aviral Kumar
Anika Singh
F. Ebert
Mitsuhiko Nakamoto
Yanlai Yang
Chelsea Finn
Sergey Levine
OffRL
OnRL
131
66
0
11 Oct 2022
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning
Seungjae Lee
Jigang Kim
Inkyu Jang
H. J. Kim
OffRL
35
10
0
11 Oct 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
36
3
0
11 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
Tianli Ding
L. Graesser
Saminda Abeyruwan
David B. DÁmbrosio
Anish Shankar
P. Sermanet
Pannag R Sanketi
Corey Lynch
59
21
0
07 Oct 2022
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
42
0
0
05 Oct 2022
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera based on Neuromorphic Reinforcement Learning
Junjie Jiang
Delei Kong
Kuanxu Hou
Xinjie Huang
Zhuang Hao
Zheng Fang
37
9
0
05 Oct 2022
DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Daochen Zha
Louis Feng
Qiaoyu Tan
Zirui Liu
Kwei-Herng Lai
Bhargav Bhushanam
Yuandong Tian
A. Kejariwal
Xia Hu
LMTD
OffRL
33
28
0
05 Oct 2022
Grounding Language with Visual Affordances over Unstructured Data
Oier Mees
Jessica Borja-Diaz
Wolfram Burgard
LM&Ro
121
109
0
04 Oct 2022
Predictive Event Segmentation and Representation with Neural Networks: A Self-Supervised Model Assessed by Psychological Experiments
Hamit Basgol
I. Ayhan
Emre Ugur
58
1
0
04 Oct 2022
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
31
9
0
04 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
35
240
0
03 Oct 2022
Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations
Francisco Roldan Sanchez
Qiang-qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
23
1
0
03 Oct 2022
Efficiently Learning Small Policies for Locomotion and Manipulation
Shashank Hegde
Gaurav Sukhatme
40
3
0
30 Sep 2022
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Naman Shah
Siddharth Srivastava
26
2
0
30 Sep 2022
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
Yecheng Jason Ma
Shagun Sodhani
Dinesh Jayaraman
Osbert Bastani
Vikash Kumar
Amy Zhang
SSL
OffRL
38
289
0
30 Sep 2022
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
42
39
0
29 Sep 2022
Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping
Gabriella Pizzuto
Hetong Wang
Hatem Fakhruldeen
Bei Peng
K. Luck
Andrew I. Cooper
30
2
0
29 Sep 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
22
1
0
26 Sep 2022
Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning
Hugo Caselles-Dupré
Olivier Sigaud
Mohamed Chetouani
27
2
0
26 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
48
0
0
23 Sep 2022
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning
Abraham George
Alison Bartsch
A. Farimani
OffRL
24
5
0
22 Sep 2022
Goal-Aware Generative Adversarial Imitation Learning from Imperfect Demonstration for Robotic Cloth Manipulation
Yoshihisa Tsurumine
Takamitsu Matsubara
40
13
0
21 Sep 2022
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
40
7
0
19 Sep 2022
Latent Plans for Task-Agnostic Offline Reinforcement Learning
Erick Rosete-Beas
Oier Mees
Gabriel Kalweit
Joschka Boedecker
Wolfram Burgard
OffRL
51
81
0
19 Sep 2022
Towards advanced robotic manipulation
Francisco Roldan Sanchez
Stephen J. Redmond
Kevin McGuinness
Noel E. O'Connor
30
1
0
19 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
45
35
0
19 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Yue Liu
Ding Zhao
79
45
0
16 Sep 2022
Neuromuscular Reinforcement Learning to Actuate Human Limbs through FES
Nat Wannawas
A. Shafti
Aldo A. Faisal
OffRL
21
9
0
16 Sep 2022
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System
Xuehui Yu
Jingchi Jiang
Xinmiao Yu
Yi Guan
Xue Li
16
0
0
15 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
47
10
0
15 Sep 2022
Meta-Reinforcement Learning via Language Instructions
Zhenshan Bing
A. Koch
Xiangtong Yao
Kai-Qi Huang
Alois C. Knoll
LM&Ro
65
19
0
11 Sep 2022
Generalization in Neural Networks: A Broad Survey
Chris Rohlfs
OOD
AI4CE
21
6
0
04 Sep 2022
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandrea
26
1
0
31 Aug 2022
Beyond Supervised Continual Learning: a Review
Benedikt Bagus
A. Gepperth
Timothée Lesort
BDL
CLL
37
10
0
30 Aug 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
S. Feizi
OffRL
27
2
0
28 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
42
27
0
19 Aug 2022
Intelligent problem-solving as integrated hierarchical reinforcement learning
Manfred Eppe
Christian Gumbsch
Matthias Kerzel
Phuong D. H. Nguyen
Martin Volker Butz
S. Wermter
31
75
0
18 Aug 2022
Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
B. Liu
Yihao Feng
Qian Liu
Peter Stone
19
3
0
17 Aug 2022
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
T. Basaklar
S. Gumussoy
Ümit Y. Ogras
24
39
0
16 Aug 2022
Learning Shape Control of Elastoplastic Deformable Linear Objects
Rita Laezza
Y. Karayiannidis
AI4CE
29
22
0
03 Aug 2022
Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse Rewards
Yongle Luo
Yuxin Wang
Kun Dong
Qiaosheng Zhang
Erkang Cheng
Zhiyong Sun
Bo Song
28
18
0
01 Aug 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
Learning Dynamic Manipulation Skills from Haptic-Play
Taeyoon Lee
D. Sung
Kyoung-Whan Choi
Choong-Keun Lee
Changwoo Park
Keunjun Choi
53
3
0
28 Jul 2022
Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks
David Klee
Ondrej Biza
Robert Platt
OffRL
32
1
0
22 Jul 2022
Learning to Solve Soft-Constrained Vehicle Routing Problems with Lagrangian Relaxation
Qiaoyue Tang
Yangzhe Kong
Lemeng Pan
Choon-woo Lee
35
3
0
20 Jul 2022
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
32
166
0
19 Jul 2022
Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
OffRL
35
5
0
19 Jul 2022
Previous
1
2
3
...
9
10
11
...
23
24
25
Next