ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.06521
  4. Cited By
Hierarchical Reinforcement Learning By Discovering Intrinsic Options
v1v2v3 (latest)

Hierarchical Reinforcement Learning By Discovering Intrinsic Options

International Conference on Learning Representations (ICLR), 2021
16 January 2021
Jesse Zhang
Haonan Yu
Wenyuan Xu
    BDL
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Reinforcement Learning By Discovering Intrinsic Options"

49 / 49 papers shown
The Horcrux: Mechanistically Interpretable Task Decomposition for Detecting and Mitigating Reward Hacking in Embodied AI Systems
The Horcrux: Mechanistically Interpretable Task Decomposition for Detecting and Mitigating Reward Hacking in Embodied AI Systems
Subramanyam Sahoo
Jared Junkin
172
0
0
22 Nov 2025
Taxonomy and Trends in Reinforcement Learning for Robotics and Control Systems: A Structured Review
Taxonomy and Trends in Reinforcement Learning for Robotics and Control Systems: A Structured Review
Kumater Ter
Ore-Ofe Ajayi
Daniel Udekwe
294
0
0
11 Oct 2025
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Xiaofeng Cao
Mingwei Xu
Xin Yu
Jiangchao Yao
Wei Ye
...
Minling Zhang
Ivor Tsang
Yew-Soon Ong
James T. Kwok
Heng Tao Shen
195
13
0
10 Oct 2025
Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects
Jonathan Colaço Carr
Qinyi Sun
Cameron Allen
187
0
0
06 Oct 2025
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
Jiejun Tan
Zhicheng Dou
Yan Yu
Jiehan Cheng
Qiang Ju
Jian Xie
Ji-Rong Wen
161
5
0
11 Aug 2025
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
SkiLD: Unsupervised Skill Discovery Guided by Factor InteractionsNeural Information Processing Systems (NeurIPS), 2024
Zizhao Wang
Jiaheng Hu
Caleb Chuck
Stephen Chen
Roberto Martín-Martín
Amy Zhang
S. Niekum
Peter Stone
OffRL
326
11
0
24 Oct 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement
  Learning in POMDP Environments
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
273
1
0
26 Jul 2024
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for
  Robot Learning
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Yixiao Wang
Yifei Zhang
Mingxiao Huo
Ran Tian
Xiang Zhang
...
Chenfeng Xu
Pengliang Ji
Wei Zhan
Mingyu Ding
Masayoshi Tomizuka
MoE
302
52
0
01 Jul 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with
  Mutually Responsive Policies
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
173
0
0
26 Jun 2024
Hierarchical Decision Making Based on Structural Information Principles
Hierarchical Decision Making Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
298
0
0
15 Apr 2024
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks
Shaunak A. Mehta
Soheil Habibian
Dylan P. Losey
SSL
229
6
0
20 Mar 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language
  Model Critique in Text Generation
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRLALM
351
19
0
14 Jan 2024
Policy Optimization with Smooth Guidance Learned from State-Only
  Demonstrations
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
421
0
0
30 Dec 2023
SkillDiffuser: Interpretable Hierarchical Planning via Skill
  Abstractions in Diffusion-Based Task Execution
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Zhixuan Liang
Yao Mu
Hengbo Ma
Masayoshi Tomizuka
Mingyu Ding
Ping Luo
504
70
0
18 Dec 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
316
161
0
08 Nov 2023
Learning to Discover Skills through Guidance
Learning to Discover Skills through GuidanceNeural Information Processing Systems (NeurIPS), 2023
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
401
12
0
31 Oct 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision
  Making with Large Language Models
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
405
8
0
27 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
362
28
0
27 Oct 2023
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large
  Language Model Guidance
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance
Jesse Zhang
Jiahui Zhang
Karl Pertsch
Ziyi Liu
Xiang Ren
Minsuk Chang
Shao-Hua Sun
Joseph J Lim
LLMAGLM&Ro
490
87
0
16 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
415
78
0
13 Oct 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic ManipulationInformation Fusion (Inf. Fusion), 2023
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
256
26
0
22 Sep 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
  Learning
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
259
6
0
06 Jul 2023
Partitioning Distributed Compute Jobs with Reinforcement Learning and
  Graph Neural Networks
Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural NetworksSocial Science Research Network (SSRN), 2023
Christopher W. F. Parsonson
Zacharaya Shabka
Alessandro Ottino
G. Zervas
260
0
0
31 Jan 2023
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Sam Powers
Eliot Xing
Abhinav Gupta
KELMCLL
296
7
0
31 Dec 2022
SHIRO: Soft Hierarchical Reinforcement Learning
SHIRO: Soft Hierarchical Reinforcement Learning
Kandai Watanabe
Mathew Strong
Omer Eldar
210
1
0
24 Dec 2022
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer
  across Agents
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
Minghuan Liu
Zhengbang Zhu
Menghui Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
225
0
0
18 Dec 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Jiabo He
Cong Wang
254
2
0
28 Nov 2022
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended
  Exploration
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Giulia Vezzani
Dhruva Tirumala
Markus Wulfmeier
Dushyant Rao
A. Abdolmaleki
...
Tim Hertweck
Thomas Lampe
Fereshteh Sadeghi
N. Heess
Martin Riedmiller
OffRL
335
9
0
24 Nov 2022
Emergency action termination for immediate reaction in hierarchical
  reinforcement learning
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
174
0
0
11 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRLOnRLAI4CE
251
31
0
08 Nov 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
277
53
0
19 Sep 2022
Learning Temporally Extended Skills in Continuous Domains as Symbolic
  Actions for Planning
Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for PlanningConference on Robot Learning (CoRL), 2022
Jan Achterhold
Markus Krimmel
Joerg Stueckler
352
11
0
11 Jul 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRLLRM
174
5
0
03 Jun 2022
Safer Autonomous Driving in a Stochastic, Partially-Observable
  Environment by Hierarchical Contingency Planning
Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning
Ugo Lecerf
Christelle Yemdji Tchassi
Pietro Michiardi
192
1
0
13 Apr 2022
Plan Your Target and Learn Your Skills: Transferable State-Only
  Imitation Learning via Decoupled Policy Optimization
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy OptimizationInternational Conference on Machine Learning (ICML), 2022
Minghuan Liu
Zhengbang Zhu
Yuzheng Zhuang
Weinan Zhang
Jianye Hao
Yong Yu
Jun Wang
367
13
0
04 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from LanguageNeural Information Processing Systems (NeurIPS), 2022
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&RoOffRL
692
35
0
28 Feb 2022
A Survey on Deep Reinforcement Learning-based Approaches for Adaptation
  and Generalization
A Survey on Deep Reinforcement Learning-based Approaches for Adaptation and Generalization
Pamul Yadav
Ashutosh Mishra
Junyong Lee
Shiho Kim
OffRLAI4CE
316
14
0
17 Feb 2022
Lipschitz-constrained Unsupervised Skill Discovery
Lipschitz-constrained Unsupervised Skill DiscoveryInternational Conference on Learning Representations (ICLR), 2022
Seohong Park
Jongwook Choi
Jaekyeom Kim
Honglak Lee
Gunhee Kim
331
68
0
02 Feb 2022
Learning Transferable Motor Skills with Hierarchical Latent Mixture
  Policies
Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Dushyant Rao
Fereshteh Sadeghi
Leonard Hasenclever
Markus Wulfmeier
Martina Zambelli
...
Dhruva Tirumala
Y. Aytar
J. Merel
N. Heess
R. Hadsell
285
33
0
09 Dec 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon ReasoningInternational Conference on Learning Representations (ICLR), 2021
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
284
49
0
04 Nov 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
433
23
0
27 Oct 2021
Hierarchical Skills for Efficient Exploration
Hierarchical Skills for Efficient ExplorationNeural Information Processing Systems (NeurIPS), 2021
Jonas Gehring
Gabriel Synnaeve
Andreas Krause
Nicolas Usunier
282
48
0
20 Oct 2021
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with
  Dual Coordination Mechanism
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
Zhiwei Xu
Yunpeng Bai
Bin Zhang
Dapeng Li
Guoliang Fan
377
44
0
14 Oct 2021
HAC Explore: Accelerating Exploration with Hierarchical Reinforcement
  Learning
HAC Explore: Accelerating Exploration with Hierarchical Reinforcement Learning
Willie McClinton
Andrew Levy
George Konidaris
125
5
0
12 Aug 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Unsupervised Skill Discovery with Bottleneck Option LearningInternational Conference on Machine Learning (ICML), 2021
Jaekyeom Kim
Seohong Park
Gunhee Kim
265
39
0
27 Jun 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
TAAC: Temporally Abstract Actor-Critic for Continuous ControlNeural Information Processing Systems (NeurIPS), 2021
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
235
26
0
13 Apr 2021
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep
  Reinforcement Learning
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2021
Jinxin Liu
Xuetao Zhang
Qiangxing Tian
Ruihao Zhang
329
26
0
11 Apr 2021
Program Synthesis Guided Reinforcement Learning for Partially Observed
  Environments
Program Synthesis Guided Reinforcement Learning for Partially Observed EnvironmentsNeural Information Processing Systems (NeurIPS), 2021
Yichen Yang
J. Inala
Osbert Bastani
Yewen Pu
Armando Solar-Lezama
Martin Rinard
310
12
0
22 Feb 2021
Dynamic Subgoal-based Exploration via Bayesian Optimization
Dynamic Subgoal-based Exploration via Bayesian Optimization
Yijia Wang
Matthias Poloczek
Daniel R. Jiang
460
4
0
21 Oct 2019
1
Page 1 of 1