ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.01495
  4. Cited By
Hindsight Experience Replay

Hindsight Experience Replay

5 July 2017
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
    OffRL
ArXivPDFHTML

Papers citing "Hindsight Experience Replay"

50 / 1,242 papers shown
Title
Pre-Training and Fine-Tuning Generative Flow Networks
Pre-Training and Fine-Tuning Generative Flow Networks
Ling Pan
Moksh Jain
Kanika Madan
Yoshua Bengio
47
13
0
05 Oct 2023
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning
  under Dynamics
Roadmaps with Gaps over Controllers: Achieving Efficiency in Planning under Dynamics
A. Sivaramakrishnan
Sumanth Tangirala
Edgar Granados
Noah R. Carver
Kostas E. Bekris
25
3
0
05 Oct 2023
Learning to Reach Goals via Diffusion
Learning to Reach Goals via Diffusion
V. Jain
Siamak Ravanbakhsh
DiffM
OffRL
43
3
0
04 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable
  Diffusion Model
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
45
29
0
03 Oct 2023
Learning and reusing primitive behaviours to improve Hindsight
  Experience Replay sample efficiency
Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
OffRL
OnRL
26
1
0
03 Oct 2023
Efficient Planning with Latent Diffusion
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
40
4
0
30 Sep 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAtt
LRM
54
1
0
29 Sep 2023
HyperPPO: A scalable method for finding small policies for robotic
  control
HyperPPO: A scalable method for finding small policies for robotic control
Luming Tang
Zhehui Huang
Gaurav Sukhatme
22
3
0
28 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
34
1
0
28 Sep 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and
  Goal-Conditioned
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
21
1
0
28 Sep 2023
Learning to Terminate in Object Navigation
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
32
3
0
28 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with
  Optimal-Transport Regularization
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
39
1
0
27 Sep 2023
Maximum diffusion reinforcement learning
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd D. Murphey
AI4CE
DiffM
14
5
0
26 Sep 2023
On the Benefit of Optimal Transport for Curriculum Reinforcement
  Learning
On the Benefit of Optimal Transport for Curriculum Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
41
3
0
25 Sep 2023
Policy Stitching: Learning Transferable Robot Policies
Policy Stitching: Learning Transferable Robot Policies
Pingcheng Jian
Easop Lee
Zachary I. Bell
Michael M. Zavlanos
Boyuan Chen
OffRL
27
8
0
24 Sep 2023
Boosting Offline Reinforcement Learning for Autonomous Driving with
  Hierarchical Latent Skills
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills
Zenan Li
Fan Nie
Q. Sun
Fang Da
Hang Zhao
OffRL
36
6
0
24 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via
  Model-based Rollout
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
30
2
0
24 Sep 2023
Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Chethan Bhateja
Derek Guo
Dibya Ghosh
Anika Singh
Manan Tomar
Q. Vuong
Yevgen Chebotar
Sergey Levine
Aviral Kumar
OffRL
36
20
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
Q-Transformer: Scalable Offline Reinforcement Learning via
  Autoregressive Q-Functions
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
131
81
0
18 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
44
9
0
18 Sep 2023
Equivariant Data Augmentation for Generalization in Offline
  Reinforcement Learning
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning
Cristina Pinneri
Sarah Bechtle
Markus Wulfmeier
Arunkumar Byravan
Jingwei Zhang
William F. Whitney
Martin Riedmiller
OffRL
25
2
0
14 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
41
0
0
08 Sep 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
26
0
0
31 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRM
ALM
33
160
0
03 Aug 2023
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
Kevin Denamganai
Daniel Hernández
Ozan Vardal
S. Missaoui
James Alfred Walker
31
0
0
28 Jul 2023
Contrastive Example-Based Control
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
31
4
0
24 Jul 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement
  Learning via Latent Landmark Graphs
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
33
0
0
22 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
30
44
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
27
1
0
21 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from
  Human-in-the-Loop Feedback
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
M. Torné
Max Balsells
Zihan Wang
Samedh Desai
Tao Chen
Pulkit Agrawal
Abhishek Gupta
26
8
0
20 Jul 2023
Goal-Conditioned Reinforcement Learning with Disentanglement-based
  Reachability Planning
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Bin He
26
3
0
20 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
30
10
0
20 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
37
4
0
16 Jul 2023
The SocialAI School: Insights from Developmental Psychology Towards
  Artificial Socio-Cultural Agents
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
18
19
0
15 Jul 2023
Bi-Touch: Bimanual Tactile Manipulation with Sim-to-Real Deep
  Reinforcement Learning
Bi-Touch: Bimanual Tactile Manipulation with Sim-to-Real Deep Reinforcement Learning
Yijiong Lin
Alex Church
Max Yang
Haoran Li
John Lloyd
Dandan Zhang
Nathan Lepora
25
27
0
12 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
27
12
0
06 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
Learning to Solve Tasks with Exploring Prior Behaviours
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
23
3
0
06 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
  Learning
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
36
5
0
06 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of
  Circular Cylinder with Sparse Surface Pressure Sensing
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
Goal Representations for Instruction Following: A Semi-Supervised
  Language Interface to Control
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Vivek Myers
Andre Wang He
Kuan Fang
Homer Walke
Philippe Hansen-Estruch
Ching-An Cheng
Mihai Jalobeanu
Andrey Kolobov
Anca Dragan
Sergey Levine
LM&Ro
27
29
0
30 Jun 2023
HYDRA: Hybrid Robot Actions for Imitation Learning
HYDRA: Hybrid Robot Actions for Imitation Learning
Suneel Belkhale
Yuchen Cui
Dorsa Sadigh
21
36
0
29 Jun 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
21
3
0
29 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential
  Object Manipulation Tasks with Sparse Rewards
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
13
0
0
28 Jun 2023
CEIL: Generalized Contextual Imitation Learning
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
36
18
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
44
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
27
16
0
24 Jun 2023
Learning from Pixels with Expert Observations
Learning from Pixels with Expert Observations
M. Hoang
Long Dinh
Hai V. Nguyen
OffRL
32
2
0
24 Jun 2023
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot
  Policy Imitation
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Massimiliano Patacchiola
Mingfei Sun
Katja Hofmann
Richard Turner
OffRL
29
1
0
23 Jun 2023
Granger-Causal Hierarchical Skill Discovery
Granger-Causal Hierarchical Skill Discovery
Caleb Chuck
Kevin Black
Aditya Arjun
Yuke Zhu
S. Niekum
OffRL
38
1
0
15 Jun 2023
Previous
123...567...232425
Next