ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,345 papers shown
Unsupervised Reinforcement Learning for Transferable Manipulation Skill
  Discovery
Unsupervised Reinforcement Learning for Transferable Manipulation Skill DiscoveryIEEE Robotics and Automation Letters (RA-L), 2022
Daesol Cho
Jigang Kim
H. J. Kim
OffRLSSL
241
20
0
29 Apr 2022
Bilinear value networks
Bilinear value networks
Zhang-Wei Hong
Ge Yang
Pulkit Agrawal
OffRL
354
10
0
28 Apr 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
351
38
0
27 Apr 2022
Relational Abstractions for Generalized Reinforcement Learning on
  Symbolic Problems
Relational Abstractions for Generalized Reinforcement Learning on Symbolic ProblemsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Rushang Karia
Siddharth Srivastava
NAIOffRL
168
16
0
27 Apr 2022
Executive Function: A Contrastive Value Policy for Resampling and
  Relabeling Perceptions via Hindsight Summarization?
Executive Function: A Contrastive Value Policy for Resampling and Relabeling Perceptions via Hindsight Summarization?
Christopher T. Lengerich
Ben Lengerich
173
1
0
27 Apr 2022
Can Foundation Models Perform Zero-Shot Task Specification For Robot
  Manipulation?
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?Conference on Learning for Dynamics & Control (L4DC), 2022
Yuchen Cui
S. Niekum
Abhi Gupta
Vikash Kumar
Aravind Rajeswaran
LM&Ro
220
93
0
23 Apr 2022
Learning how to Interact with a Complex Interface using Hierarchical
  Reinforcement Learning
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Gheorghe Comanici
Amelia Glaese
Anita Gergely
Daniel Toyama
Zafarali Ahmed
Tyler Jackson
P. Hamel
Doina Precup
149
4
0
21 Apr 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charles Burton Snell
Mengjiao Yang
Justin Fu
Yi Su
Sergey Levine
286
30
0
18 Apr 2022
Divide & Conquer Imitation Learning
Divide & Conquer Imitation LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
274
5
0
15 Apr 2022
Efficient and practical quantum compiler towards multi-qubit systems
  with deep reinforcement learning
Efficient and practical quantum compiler towards multi-qubit systems with deep reinforcement learningQuantum Science and Technology (QST), 2022
Qiuhao Chen
Yuxuan Du
Qi Zhao
Yuliang Jiao
Xiliang Lu
Xingyao Wu
221
17
0
14 Apr 2022
GloCAL: Glocalized Curriculum-Aided Learning of Multiple Tasks with
  Application to Robotic Grasping
GloCAL: Glocalized Curriculum-Aided Learning of Multiple Tasks with Application to Robotic GraspingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Anil Kurkcu
C. Acar
D. Campolo
K. P. Tee
183
1
0
14 Apr 2022
What Matters in Language Conditioned Robotic Imitation Learning over
  Unstructured Data
What Matters in Language Conditioned Robotic Imitation Learning over Unstructured DataIEEE Robotics and Automation Letters (RA-L), 2022
Oier Mees
Lukás Hermann
Wolfram Burgard
LM&Ro
340
193
0
13 Apr 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement
  Learning in Safety-Critical Driving Scenarios
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving ScenariosInternational Conference on Machine Learning Technologies (ICMLT), 2022
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
170
1
0
11 Apr 2022
Learning Object-Centered Autotelic Behaviors with Graph Neural Networks
Learning Object-Centered Autotelic Behaviors with Graph Neural Networks
Ahmed Akakzia
Olivier Sigaud
219
0
0
11 Apr 2022
gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement
  Learning Approach
gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach
Johannes Dornheim
OffRLAI4CE
145
5
0
11 Apr 2022
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning
  for Robotics
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for RoboticsInternational Conference on Development and Learning (ICDL), 2022
Frank Röder
Manfred Eppe
S. Wermter
202
7
0
08 Apr 2022
Automatic Parameter Optimization Using Genetic Algorithm in Deep
  Reinforcement Learning for Robotic Manipulation Tasks
Automatic Parameter Optimization Using Genetic Algorithm in Deep Reinforcement Learning for Robotic Manipulation Tasks
Adarsh Sehgal
Nicholas Ward
Hung M. La
S. Louis
175
1
0
07 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
122
0
0
05 Apr 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hierarchical Reinforcement Learning under Mixed ObservabilityWorkshop on the Algorithmic Foundations of Robotics (WAFR), 2022
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert Platt
Chris Amato
254
4
0
02 Apr 2022
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable
  Object Manipulations with Tools
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with ToolsInternational Conference on Learning Representations (ICLR), 2022
Xingyu Lin
Zhiao Huang
Yunzhu Li
J. Tenenbaum
David Held
Chuang Gan
286
85
0
31 Mar 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in
  Intrinsic Motivation
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
172
2
0
29 Mar 2022
A Visual Navigation Perspective for Category-Level Object Pose
  Estimation
A Visual Navigation Perspective for Category-Level Object Pose EstimationEuropean Conference on Computer Vision (ECCV), 2022
Jiaxin Guo
Fangxun Zhong
R. Xiong
Yunhui Liu
Yue Wang
Yiyi Liao
OCL
312
10
0
25 Mar 2022
The Challenges of Continuous Self-Supervised Learning
The Challenges of Continuous Self-Supervised LearningEuropean Conference on Computer Vision (ECCV), 2022
Senthil Purushwalkam
Pedro Morgado
Abhinav Gupta
CLL
276
56
0
23 Mar 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances
Possibility Before Utility: Learning And Using Hierarchical AffordancesInternational Conference on Learning Representations (ICLR), 2022
Robby Costales
Shariq Iqbal
Fei Sha
290
5
0
23 Mar 2022
One After Another: Learning Incremental Skills for a Changing World
One After Another: Learning Incremental Skills for a Changing WorldInternational Conference on Learning Representations (ICLR), 2022
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
355
15
0
21 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a
  soft actor-critic approach
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approachNeural Networks (NN), 2022
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
251
57
0
19 Mar 2022
Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulation
Goal-conditioned dual-action imitation learning for dexterous dual-arm robot manipulationIEEE Transactions on robotics (TRO), 2022
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
354
61
0
18 Mar 2022
RB2: Robotic Manipulation Benchmarking with a Twist
RB2: Robotic Manipulation Benchmarking with a Twist
Sudeep Dasari
Jianren Wang
Joyce Hong
Shikhar Bahl
Yixin Lin
...
David Held
Lerrel Pinto
Deepak Pathak
Vikash Kumar
Abhi Gupta
339
34
0
15 Mar 2022
PLATO: Predicting Latent Affordances Through Object-Centric Play
PLATO: Predicting Latent Affordances Through Object-Centric PlayConference on Robot Learning (CoRL), 2022
Suneel Belkhale
Dorsa Sadigh
OffRL
244
15
0
10 Mar 2022
Policy Architectures for Compositional Generalization in Control
Policy Architectures for Compositional Generalization in Control
Allan Zhou
Vikash Kumar
Chelsea Finn
Aravind Rajeswaran
224
28
0
10 Mar 2022
Neuro-symbolic Natural Logic with Introspective Revision for Natural
  Language Inference
Neuro-symbolic Natural Logic with Introspective Revision for Natural Language InferenceTransactions of the Association for Computational Linguistics (TACL), 2022
Yufei Feng
Xiaoyu Yang
Xiao-Dan Zhu
Michael A. Greenspan
LRMNAI
416
14
0
09 Mar 2022
Multi-Objective reward generalization: Improving performance of Deep
  Reinforcement Learning for applications in single-asset trading
Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading
F. Cornalba
C. Disselkamp
Davide Scassola
Christopher Helf
258
12
0
09 Mar 2022
Policy-Based Bayesian Experimental Design for Non-Differentiable
  Implicit Models
Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models
Vincent Lim
Ellen R. Novoseller
Jeffrey Ichnowski
Huang Huang
Ken Goldberg
OffRL
241
15
0
08 Mar 2022
Learning Sensorimotor Primitives of Sequential Manipulation Tasks from
  Visual Demonstrations
Learning Sensorimotor Primitives of Sequential Manipulation Tasks from Visual DemonstrationsIEEE International Conference on Robotics and Automation (ICRA), 2022
Junchi Liang
Bowen Wen
Kostas Bekris
Abdeslam Boularias
SSL
138
17
0
08 Mar 2022
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
I. Kanitscheider
Harrison Edwards
131
0
0
04 Mar 2022
Self-Supervised Learning for Joint Pushing and Grasping Policies in
  Highly Cluttered Environments
Self-Supervised Learning for Joint Pushing and Grasping Policies in Highly Cluttered EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2022
Yongliang Wang
Kamal Mokhtar
C. Heemskerk
Hamidreza Kasaei
SSL
378
20
0
04 Mar 2022
Evolving Curricula with Regret-Based Environment Design
Evolving Curricula with Regret-Based Environment DesignInternational Conference on Machine Learning (ICML), 2022
Jack Parker-Holder
Minqi Jiang
Michael Dennis
Mikayel Samvelyan
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
532
174
0
02 Mar 2022
Model-free Neural Lyapunov Control for Safe Robot Navigation
Model-free Neural Lyapunov Control for Safe Robot NavigationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Zikang Xiong
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
286
17
0
02 Mar 2022
GA+DDPG+HER: Genetic Algorithm-Based Function Optimizer in Deep
  Reinforcement Learning for Robotic Manipulation Tasks
GA+DDPG+HER: Genetic Algorithm-Based Function Optimizer in Deep Reinforcement Learning for Robotic Manipulation TasksInternational Conference on Robotic Computing (IRC), 2022
Adarsh Sehgal
Nicholas Ward
Hung M. La
C. Papachristos
S. Louis
176
6
0
28 Feb 2022
Weakly Supervised Disentangled Representation for Goal-conditioned
  Reinforcement Learning
Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement LearningIEEE Robotics and Automation Letters (RA-L), 2022
Zhifeng Qian
Mingyu You
Hongjun Zhou
Bin He
DRLOffRL
212
7
0
28 Feb 2022
Exploring with Sticky Mittens: Reinforcement Learning with Expert
  Interventions via Option Templates
Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option TemplatesConference on Robot Learning (CoRL), 2022
Souradeep Dutta
Kaustubh Sridhar
Osbert Bastani
Guang Cheng
James Weimer
Insup Lee
J. Parish-Morris
356
2
0
25 Feb 2022
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL
  With Upside Down RL
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL
Kai Arulkumaran
Dylan R. Ashley
Jürgen Schmidhuber
R. Srivastava
OffRL
238
8
0
24 Feb 2022
Learning Program Synthesis for Integer Sequences from Scratch
Learning Program Synthesis for Integer Sequences from ScratchAAAI Conference on Artificial Intelligence (AAAI), 2022
Thibault Gauthier
Josef Urban
305
12
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in IntralogisticsApplied Sciences (Appl. Sci.), 2022
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
284
25
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
313
25
0
23 Feb 2022
Continual Auxiliary Task Learning
Continual Auxiliary Task LearningNeural Information Processing Systems (NeurIPS), 2022
Matt McLeod
Chun-Ping Lo
M. Schlegel
Andrew Jacobsen
Raksha Kumaraswamy
Martha White
Adam White
CLL
184
11
0
22 Feb 2022
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum
  Generation
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum GenerationInternational Conference on Learning Representations (ICLR), 2022
Yuqing Du
Pieter Abbeel
Aditya Grover
265
20
0
22 Feb 2022
CCPT: Automatic Gameplay Testing and Validation with
  Curiosity-Conditioned Proximal Trajectories
CCPT: Automatic Gameplay Testing and Validation with Curiosity-Conditioned Proximal Trajectories
Alessandro Sestini
Linus Gisslén
Joakim Bergdahl
Konrad Tollmar
Andrew D. Bagdanov
185
7
0
21 Feb 2022
Goal-directed Planning and Goal Understanding by Active Inference:
  Evaluation Through Simulated and Physical Robot Experiments
Goal-directed Planning and Goal Understanding by Active Inference: Evaluation Through Simulated and Physical Robot Experiments
Takazumi Matsumoto
Wataru Ohata
Fabien C. Y. Benureau
Jun Tani
155
13
0
21 Feb 2022
AKB-48: A Real-World Articulated Object Knowledge Base
AKB-48: A Real-World Articulated Object Knowledge BaseComputer Vision and Pattern Recognition (CVPR), 2022
Liu Liu
Wenqiang Xu
Haoyuan Fu
Sucheng Qian
Yong-Jin Han
Cewu Lu
267
122
0
17 Feb 2022
Previous
123...131415...252627
Next
Page 14 of 27
Pageof 27