ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,242 papers shown
Title
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Attention-Based Reward Shaping for Sparse and Delayed Rewards
Ian Holmes
Min Chi
OffRL
22
0
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
14
0
0
15 May 2025
General Dynamic Goal Recognition
General Dynamic Goal Recognition
Osher Elhadad
Reuth Mirsky
AI4CE
14
0
0
14 May 2025
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
29
0
0
13 May 2025
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
Hanjung Kim
Jaehyun Kang
Hyolim Kang
Meedeum Cho
Seon Joo Kim
Youngwoon Lee
34
0
0
13 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
47
0
0
06 May 2025
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Chenran Zhao
Dianxi Shi
Mengzhu Wang
Jianqiang Xia
Huanhuan Yang
Songchang Jin
Shaowu Yang
Chunping Qiu
27
0
0
04 May 2025
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
Bofei Liu
Dong Ye
Zunhao Yao
Zhaowei Sun
28
0
0
04 May 2025
CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation
CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation
Mazal Bethany
Nishant Vishwamitra
Cho-Yu Chiang
Peyman Najafirad
AAML
28
0
0
03 May 2025
Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic
Neuro-Symbolic Generation of Explanations for Robot Policies with Weighted Signal Temporal Logic
Mikihisa Yuasa
R. Sreenivas
Huy T. Tran
40
0
0
30 Apr 2025
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Hanwen Du
B. Peng
Xia Ning
25
0
0
23 Apr 2025
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
J. Z. Wang
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
154
1
0
21 Apr 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
35
0
0
15 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
51
1
0
11 Apr 2025
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Zhao Dong
Ka Chen
Zhaoyang Lv
Hong-Xing Yu
Yunzhi Zhang
...
Xiaqing Pan
Mingfei Yan
Jiajun Wu
Carl Ren
Richard Newcombe
44
1
0
11 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Ning Yang
Stephen Xia
OffRL
33
0
0
08 Apr 2025
Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks
Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks
Sergey Pastukhov
26
0
0
06 Apr 2025
Outlook Towards Deployable Continual Learning for Particle Accelerators
Outlook Towards Deployable Continual Learning for Particle Accelerators
Kishansingh Rajput
Sen Lin
Auralee Edelen
Willem Blokland
Malachi Schram
26
0
0
04 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DV
VLM
OffRL
41
0
0
03 Apr 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
33
1
0
02 Apr 2025
Learning to chain-of-thought with Jensen's evidence lower bound
Learning to chain-of-thought with Jensen's evidence lower bound
Yunhao Tang
Sid Wang
Rémi Munos
BDL
OffRL
LRM
55
0
0
25 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
103
2
0
24 Mar 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
61
3
0
21 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
41
0
0
20 Mar 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
46
0
0
19 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
SSL
OffRL
67
0
0
19 Mar 2025
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation
Jianqi Gao
Xizheng Pang
Qi Liu
Yanjie Li
48
0
0
15 Mar 2025
LUMOS: Language-Conditioned Imitation Learning with World Models
Iman Nematollahi
Branton DeMoss
Akshay L Chandra
Nick Hawes
Wolfram Burgard
Ingmar Posner
OffRL
43
0
0
13 Mar 2025
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
Ruizhe Chen
Wenhao Chai
Zhifei Yang
Xiaotian Zhang
Qiufeng Wang
Tony Q. S. Quek
Soujanya Poria
Zuozhu Liu
50
0
0
06 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Anton van den Hengel
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
59
1
0
05 Mar 2025
Causality-Based Reinforcement Learning Method for Multi-Stage Robotic Tasks
Jiechao Deng
Ning Tan
55
0
0
05 Mar 2025
ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment
Shaofei Cai
Zhancun Mu
Anji Liu
Yitao Liang
56
1
0
04 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
41
0
0
03 Mar 2025
Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference
Wenjie Qiu
Yi-Chen Li
Xuqin Zhang
Tianyi Zhang
Y. Zhang
Zongzhang Zhang
Yang Yu
ALM
46
0
0
01 Mar 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
60
0
0
24 Feb 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
118
1
0
24 Feb 2025
Theoretical Barriers in Bellman-Based Reinforcement Learning
Theoretical Barriers in Bellman-Based Reinforcement Learning
Brieuc Pinon
Raphaël Jungers
Jean-Charles Delvenne
32
0
0
17 Feb 2025
VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning
VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning
Qingyuan Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
100
0
0
11 Feb 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
65
0
0
08 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
116
1
0
03 Feb 2025
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Federico Malato
Ville Hautamaki
37
0
0
03 Feb 2025
Upside Down Reinforcement Learning with Policy Generators
Upside Down Reinforcement Learning with Policy Generators
Jacopo Di Ventura
Dylan R. Ashley
Vincent Herrmann
Francesco Faccio
Jürgen Schmidhuber
29
0
0
27 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
175
0
0
22 Jan 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
Pareto Set Learning for Multi-Objective Reinforcement Learning
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
42
2
0
12 Jan 2025
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Yueqin Yin
Shentao Yang
Yujia Xie
Ziyi Yang
Yuting Sun
Hany Awadalla
Weizhu Chen
Mingyuan Zhou
50
0
0
07 Jan 2025
Attribute-Based Robotic Grasping with Data-Efficient Adaptation
Attribute-Based Robotic Grasping with Data-Efficient Adaptation
Yang Yang
Houjian Yu
Xibai Lou
Yuanhao Liu
Changhyun Choi
52
8
0
04 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
53
0
0
03 Jan 2025
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Anthony Kobanda
Rémy Portelas
Odalric-Ambrym Maillard
Ludovic Denoyer
OffRL
CLL
77
0
0
19 Dec 2024
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down
  Maps
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
79
1
0
16 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
92
0
0
02 Dec 2024
1234...232425
Next