ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay
v1v2v3 (latest)

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,339 papers shown
Disentangled Representations for Causal Cognition
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
260
4
0
30 Jun 2024
Learning Formal Mathematics From Intrinsic Motivation
Learning Formal Mathematics From Intrinsic Motivation
Gabriel Poesia
David Broman
Nick Haber
Noah D. Goodman
LRM
305
29
0
30 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
265
11
0
29 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with
  Mutually Responsive Policies
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
158
0
0
26 Jun 2024
OCALM: Object-Centric Assessment with Language Models
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Johannes Czech
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&RoLRM
280
1
0
24 Jun 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
381
31
0
24 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
278
3
0
22 Jun 2024
Learning telic-controllable state representations
Learning telic-controllable state representations
Nadav Amir
Stas Tiomkin
301
1
0
20 Jun 2024
Metacognitive AI: Framework and the Case for a Neurosymbolic Approach
Metacognitive AI: Framework and the Case for a Neurosymbolic Approach
Hua Wei
Paulo Shakarian
Christian Lebiere
Bruce Draper
Nikhil Krishnaswamy
Sergei Nirenburg
LRM
218
7
0
17 Jun 2024
Large Reasoning Models for 3D Floorplanning in EDA: Learning from
  Imperfections
Large Reasoning Models for 3D Floorplanning in EDA: Learning from Imperfections
Fin Amin
N. Rouf
Tse-Han Pan
Md. Kamal Ibn Shafi
Paul D. Franzon
189
0
0
15 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
238
45
0
13 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep
  Reinforcement Learning Algorithms
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
96
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse
  Q-Vectors
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
89
1
0
12 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
598
3
0
09 Jun 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
345
2
0
05 Jun 2024
Multi-Agent Transfer Learning via Temporal Contrastive Learning
Multi-Agent Transfer Learning via Temporal Contrastive Learning
Weihao Zeng
Joseph Campbell
Simon Stepputtis
Katia Sycara
OffRL
248
2
0
03 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training,
  Integration, and Agent-Human Alignment
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
298
6
0
03 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
539
8
0
03 Jun 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task
  Reinforcement Learning
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Po-Shao Lin
Jia-Fong Yeh
Yi-Ting Chen
Winston H. Hsu
263
0
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
230
35
0
02 Jun 2024
Exploring the limits of Hierarchical World Models in Reinforcement
  Learning
Exploring the limits of Hierarchical World Models in Reinforcement Learning
Robin Schiewer
Anand Subramoney
Laurenz Wiskott
240
7
0
01 Jun 2024
Towards Learning Foundation Models for Heuristic Functions to Solve
  Pathfinding Problems
Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems
Vedant Khandelwal
Amit Sheth
Forest Agostinelli
258
4
0
01 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
199
4
0
30 May 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight
  Tuning on Multi-source Data
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Zifan Song
Yudong Wang
Wenwei Zhang
Kuikun Liu
Chengqi Lyu
...
Qipeng Guo
Hang Yan
Dahua Lin
Kai-xiang Chen
Cairong Zhao
SyDa
160
6
0
29 May 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
190
10
0
29 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action
  Space
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
76
0
1
26 May 2024
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
403
13
0
26 May 2024
RoboArm-NMP: a Learning Environment for Neural Motion Planning
RoboArm-NMP: a Learning Environment for Neural Motion Planning
Tom Jurgenson
Matan Sudry
Gal Avineri
Aviv Tamar
165
0
0
25 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
298
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
885
164
0
23 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
545
867
0
20 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
288
3
0
20 May 2024
Going into Orbit: Massively Parallelizing Episodic Reinforcement
  Learning
Going into Orbit: Massively Parallelizing Episodic Reinforcement Learning
Jan Oberst
Johann Bonneau
97
0
0
19 May 2024
Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular Networks
Generalized Multi-Objective Reinforcement Learning with Envelope Updates in URLLC-enabled Vehicular NetworksIEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2024
Zijiang Yan
Hina Tabassum
235
7
0
18 May 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of
  Gradient Directions for Policy Improvement
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy ImprovementAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
241
5
0
14 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep
  Reinforcement Learning
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TSCML
122
0
0
14 May 2024
AnyRotate: Gravity-Invariant In-Hand Object Rotation with Sim-to-Real
  Touch
AnyRotate: Gravity-Invariant In-Hand Object Rotation with Sim-to-Real Touch
Max Yang
Chenghua Lu
Alex Church
Yijiong Lin
Christopher J. Ford
Haoran Li
Efi Psomopoulou
David A.W. Barton
Nathan Lepora
343
33
0
12 May 2024
A Minimalist Prompt for Zero-Shot Policy Learning
A Minimalist Prompt for Zero-Shot Policy Learning
Meng Song
Xuezhi Wang
Tanay Biradar
Yao Qin
Manmohan Chandraker
OffRL
188
2
0
09 May 2024
Learning Planning Abstractions from Language
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
269
4
0
06 May 2024
Artificial Intelligence in the Autonomous Navigation of Endovascular
  Interventions: A Systematic Review
Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic ReviewFrontiers in Human Neuroscience (Front. Hum. Neurosci.), 2023
Harry Robertshaw
Lennart Karstensen
Benjamin Jackson
Hadi Sadati
K. Rhode
Sebastien Ourselin
Alejandro Granados
Thomas C Booth
144
25
0
06 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with
  Reinforcement Learning
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
262
5
0
06 May 2024
Proximal Curriculum with Task Correlations for Deep Reinforcement
  Learning
Proximal Curriculum with Task Correlations for Deep Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
217
6
0
03 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through
  Exploiting State-Action Space Structure
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
188
2
0
01 May 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong
Zikang Shan
Guhao Feng
Wei Xiong
Xinle Cheng
Li Zhao
Di He
Jiang Bian
Liwei Wang
622
97
0
29 Apr 2024
Distilling Privileged Information for Dubins Traveling Salesman Problems
  with Neighborhoods
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods
M. Shin
Su-Jeong Park
Seung-Keol Ryu
Heeyeon Kim
Han-Lim Choi
261
1
0
25 Apr 2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement
  Learning via Hindsight Relabeling
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh
Wesley A Suttle
Brian M Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
288
5
0
20 Apr 2024
Towards a Research Community in Interpretable Reinforcement Learning:
  the InterpPol Workshop
Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop
Hector Kohler
Quentin Delfosse
Paul Festor
Philippe Preux
292
0
0
16 Apr 2024
A Survey on Deep Learning for Theorem Proving
A Survey on Deep Learning for Theorem Proving
Zhaoyu Li
Jialiang Sun
Logan Murphy
Qidong Su
Zenan Li
Xian Zhang
Kaiyu Yang
Xujie Si
LRM
284
49
0
15 Apr 2024
Provable Interactive Learning with Hindsight Instruction Feedback
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Kumar Misra
Aldo Pacchiano
Rob Schapire
282
1
0
14 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
406
86
0
12 Apr 2024
Previous
123456...252627
Next