ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,339 papers shown
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement
  Learning
Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement Learning
Maxence Faldor
Félix Chalumeau
Manon Flageat
Antoine Cully
299
4
0
10 Dec 2023
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a
  High Replay Ratio and Regularization
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
277
1
0
10 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
309
3
0
08 Dec 2023
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
Lili Chen
Shikhar Bahl
Deepak Pathak
218
59
0
07 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLMOffRLOnRL
326
12
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
299
8
0
06 Dec 2023
Understanding Representations Pretrained with Auxiliary Losses for
  Embodied Agent Planning
Understanding Representations Pretrained with Auxiliary Losses for Embodied Agent Planning
Samrudhdhi B. Rangrej
James J. Clark
SSL
270
0
0
06 Dec 2023
Contact Energy Based Hindsight Experience Prioritization
Contact Energy Based Hindsight Experience PrioritizationIEEE International Conference on Robotics and Automation (ICRA), 2023
Erdi Sayar
Zhenshan Bing
Carlo DÉramo
Ozgur S. Oguz
Alois Knoll
237
4
0
05 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Visual Hindsight Self-Imitation Learning for Interactive NavigationIEEE Access (IEEE Access), 2023
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
215
2
0
05 Dec 2023
Working Backwards: Learning to Place by Picking
Working Backwards: Learning to Place by PickingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Oliver Limoyo
Abhisek Konar
Trevor Ablett
Jonathan Kelly
F. Hogan
Gregory Dudek
338
0
0
04 Dec 2023
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse
  Catalysts Design
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts Design
Romain Lacombe
Lucas Hendren
Khalid El-Awady
141
2
0
04 Dec 2023
Modular Control Architecture for Safe Marine Navigation: Reinforcement
  Learning and Predictive Safety Filters
Modular Control Architecture for Safe Marine Navigation: Reinforcement Learning and Predictive Safety Filters
Aksel Vaaler
Svein Jostein Husa
Daniel Menges
T. N. Larsen
Adil Rasheed
293
2
0
04 Dec 2023
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement
  Learning
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
158
0
0
29 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Goal-conditioned Offline Planning from Curious ExplorationNeural Information Processing Systems (NeurIPS), 2023
Marco Bagatella
Georg Martius
OffRL
310
1
0
28 Nov 2023
Offline Skill Generalization via Task and Motion Planning
Offline Skill Generalization via Task and Motion Planning
Shin Watanabe
Geir Horn
J. Tørresen
K. Ellefsen
OffRL
262
0
0
24 Nov 2023
Multi-Objective Reinforcement Learning Based on Decomposition: A
  Taxonomy and Framework
Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and FrameworkJournal of Artificial Intelligence Research (JAIR), 2023
Florian Felten
El-Ghazali Talbi
Grégoire Danoy
189
31
0
21 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
153
3
0
17 Nov 2023
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
200
3
0
09 Nov 2023
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for
  Deep Reinforcement Learning
Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning
Junmin Zhong
Ruofan Wu
Jennie Si
OffRL
125
1
0
07 Nov 2023
PcLast: Discovering Plannable Continuous Latent States
PcLast: Discovering Plannable Continuous Latent StatesInternational Conference on Machine Learning (ICML), 2023
Anurag Koul
Shivakanth Sujit
Shaoru Chen
Ben Evans
Lili Wu
...
Yonathan Efroni
Lekan Molu
Miro Dudik
John Langford
Alex Lamb
OffRLBDL
342
1
0
06 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
492
7
0
06 Nov 2023
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
SMORE: Score Models for Offline Goal-Conditioned Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
484
13
0
03 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement
  Learning
Selectively Sharing Experiences Improves Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
M. Gerstgrasser
Tom Danino
Sarah Keren
286
10
0
01 Nov 2023
Autonomous Robotic Reinforcement Learning with Asynchronous Human
  Feedback
Autonomous Robotic Reinforcement Learning with Asynchronous Human FeedbackConference on Robot Learning (CoRL), 2023
Max Balsells
M. Torné
Zihan Wang
Samedh Desai
Pulkit Agrawal
Abhishek Gupta
269
13
0
31 Oct 2023
Learning to Discover Skills through Guidance
Learning to Discover Skills through GuidanceNeural Information Processing Systems (NeurIPS), 2023
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
370
11
0
31 Oct 2023
Contrastive Difference Predictive Coding
Contrastive Difference Predictive CodingInternational Conference on Learning Representations (ICLR), 2023
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TSOffRL
328
26
0
31 Oct 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with
  Learned Models
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
426
15
0
30 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of SkillsInternational Conference on Machine Learning (ICML), 2023
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSLDRL
288
16
0
30 Oct 2023
Free from Bellman Completeness: Trajectory Stitching via Model-based
  Return-conditioned Supervised Learning
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningInternational Conference on Learning Representations (ICLR), 2023
Zhaoyi Zhou
Chuning Zhu
Runlong Zhou
Qiwen Cui
Abhishek Gupta
S. S. Du
OffRL
225
10
0
30 Oct 2023
Diversify & Conquer: Outcome-directed Curriculum RL via
  Out-of-Distribution Disagreement
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution DisagreementNeural Information Processing Systems (NeurIPS), 2023
Daesol Cho
Seungjae Lee
H. J. Kim
OODD
284
4
0
30 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent PriorsNeural Information Processing Systems (NeurIPS), 2023
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
270
14
0
28 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and
  Imitation Learning
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
304
9
0
27 Oct 2023
Understanding when Dynamics-Invariant Data Augmentations Benefit
  Model-Free Reinforcement Learning Updates
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning UpdatesInternational Conference on Learning Representations (ICLR), 2023
Nicholas Corrado
Josiah P. Hanna
297
6
0
26 Oct 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
CQM: Curriculum Reinforcement Learning with a Quantized World ModelNeural Information Processing Systems (NeurIPS), 2023
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
232
14
0
26 Oct 2023
Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight
  Reinforcement Learning
Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight Reinforcement LearningScientific Reports (Sci Rep), 2023
Sicen Li
Yiming Pang
Panju Bai
Zhaojin Liu
Jiawei Li
Shihao Hu
Liquan Wang
Gang Wang
279
12
0
24 Oct 2023
Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good
  States
Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good StatesConference on Robot Learning (CoRL), 2023
Zidan Wang
Takeru Oba
Takuma Yoneda
Rui Shen
Matthew R. Walter
Bradly C. Stadie
DiffM
258
14
0
21 Oct 2023
Teaching Language Models to Self-Improve through Interactive
  Demonstrations
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRMReLM
272
27
0
20 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in
  Continuous Control
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous ControlNeural Information Processing Systems (NeurIPS), 2023
Chao Li
Chen Gong
Qiang He
Xinwen Hou
249
4
0
17 Oct 2023
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation
  and Generalization
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Niket Tandon
Li Zhang
Chris Callison-Burch
Peter Clark
LRMLLMAGCLL
215
61
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRLLM&Ro
361
44
0
15 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
GROOT: Learning to Follow Instructions by Watching Gameplay VideosInternational Conference on Learning Representations (ICLR), 2023
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
330
38
0
12 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Understanding the Effects of RLHF on LLM Generalisation and DiversityInternational Conference on Learning Representations (ICLR), 2023
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CEALM
590
267
0
10 Oct 2023
Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot
  Collaboration
Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot CollaborationIEEE International Conference on Robotics and Automation (ICRA), 2023
Jakob Thumm
Felix Trost
Matthias Althoff
OffRL
326
10
0
09 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
278
27
0
09 Oct 2023
Learning Interactive Real-World Simulators
Learning Interactive Real-World SimulatorsInternational Conference on Learning Representations (ICLR), 2023
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&RoPINN
350
334
0
09 Oct 2023
Compositional Servoing by Recombining Demonstrations
Compositional Servoing by Recombining DemonstrationsIEEE International Conference on Robotics and Automation (ICRA), 2023
Max Argus
Abhijeet Nayak
Martin Buchner
Silvio Galesso
Abhinav Valada
Thomas Brox
218
1
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in
  Non-Visual Environments: A Comparison
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A ComparisonInternational Conference on Machine Learning, Optimization, and Data Science (MOD), 2023
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
191
2
0
06 Oct 2023
Pre-Training and Fine-Tuning Generative Flow Networks
Pre-Training and Fine-Tuning Generative Flow NetworksInternational Conference on Learning Representations (ICLR), 2023
Ling Pan
Moksh Jain
Kanika Madan
Yoshua Bengio
253
18
0
05 Oct 2023
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning
  under Dynamics
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning under DynamicsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Aravind Sivaramakrishnan
Sumanth Tangirala
Edgar Granados
Noah R. Carver
Kostas E. Bekris
281
3
0
05 Oct 2023
Learning to Reach Goals via Diffusion
Learning to Reach Goals via DiffusionInternational Conference on Machine Learning (ICML), 2023
V. Jain
Siamak Ravanbakhsh
DiffMOffRL
188
10
0
04 Oct 2023
Previous
123...678...252627
Next
Page 7 of 27
Pageof 27