ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.08294
  4. Cited By
Multi-Level Discovery of Deep Options

Multi-Level Discovery of Deep Options

24 March 2017
Roy Fox
S. Krishnan
Ion Stoica
Ken Goldberg
ArXivPDFHTML

Papers citing "Multi-Level Discovery of Deep Options"

50 / 79 papers shown
Title
Hierarchical Task Decomposition for Execution Monitoring and Error Recovery: Understanding the Rationale Behind Task Demonstrations
Hierarchical Task Decomposition for Execution Monitoring and Error Recovery: Understanding the Rationale Behind Task Demonstrations
Christoph Willibald
Dongheui Lee
47
0
0
07 May 2025
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning
Chak Lam Shek
Pratap Tokekar
53
0
0
24 Mar 2025
Data Augmentation for Instruction Following Policies via Trajectory Segmentation
Niklas Höpner
Ilaria Tiddi
H. V. Hoof
47
0
0
25 Feb 2025
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Amirhossein Mesbah
Reshad Hosseini
Seyed Pooya Shariatpanahi
M. N. Ahmadabadi
77
0
0
21 Dec 2024
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Thomas P Cannon
Özgür Simsek
AI4CE
41
0
0
05 Nov 2024
Identifying Selections for Unsupervised Subtask Discovery
Identifying Selections for Unsupervised Subtask Discovery
Yiwen Qiu
Yujia Zheng
Kun Zhang
32
0
0
28 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement
  Learning in POMDP Environments
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
41
0
0
26 Jul 2024
Language-guided Skill Learning with Temporal Variational Inference
Language-guided Skill Learning with Temporal Variational Inference
Haotian Fu
Pratyusha Sharma
Elias Stengel-Eskin
George Konidaris
Nicolas Le Roux
Marc-Alexandre Côté
Xingdi Yuan
38
7
0
26 Feb 2024
LOTUS: Continual Imitation Learning for Robot Manipulation Through
  Unsupervised Skill Discovery
LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery
Weikang Wan
Yifeng Zhu
Rutav Shah
Yuke Zhu
SSL
LM&Ro
VLM
CLL
36
23
0
03 Nov 2023
Diversity for Contingency: Learning Diverse Behaviors for Efficient
  Adaptation and Transfer
Diversity for Contingency: Learning Diverse Behaviors for Efficient Adaptation and Transfer
Finn Rietz
J. A. Stork
33
0
0
11 Oct 2023
Iterative Option Discovery for Planning, by Planning
Iterative Option Discovery for Planning, by Planning
Kenny Young
Richard S. Sutton
25
2
0
02 Oct 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
XSkill: Cross Embodiment Skill Discovery
XSkill: Cross Embodiment Skill Discovery
Mengda Xu
Zhenjia Xu
Cheng Chi
Manuela Veloso
Shuran Song
36
65
0
19 Jul 2023
Landmark Guided Active Exploration with State-specific Balance
  Coefficient
Landmark Guided Active Exploration with State-specific Balance Coefficient
Fei Cui
Jiaojiao Fang
Mengke Yang
Guizhong Liu
25
0
0
30 Jun 2023
Creating Multi-Level Skill Hierarchies in Reinforcement Learning
Creating Multi-Level Skill Hierarchies in Reinforcement Learning
Joshua B. Evans
Özgür Simsek
9
3
0
16 Jun 2023
PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical
  Reinforcement Learning
PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning
Utsav Singh
Vinay P. Namboodiri
OffRL
36
3
0
10 Jun 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
31
3
0
07 Apr 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
35
1
0
23 Mar 2023
Reusable Options through Gradient-based Meta Learning
Reusable Options through Gradient-based Meta Learning
David Kuric
H. V. Hoof
32
0
0
22 Dec 2022
Learning Options via Compression
Learning Options via Compression
Yiding Jiang
E. Liu
Benjamin Eysenbach
Zico Kolter
Chelsea Finn
OffRL
25
13
0
08 Dec 2022
Dichotomy of Control: Separating What You Can Control from What You
  Cannot
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
25
42
0
24 Oct 2022
Learning Neuro-Symbolic Skills for Bilevel Planning
Learning Neuro-Symbolic Skills for Bilevel Planning
Tom Silver
Ashay Athalye
J. Tenenbaum
Tomas Lozano-Perez
L. Kaelbling
37
59
0
21 Jun 2022
Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from
  Demonstrations
Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations
Yanwei Wang
Nadia Figueroa
Shen Li
Ankit J. Shah
J. Shah
11
21
0
09 Jun 2022
A Versatile Agent for Fast Learning from Human Instructors
A Versatile Agent for Fast Learning from Human Instructors
Yiwen Chen
Zedong Zhang
Hao-Kang Liu
Jiayi Tan
C. Chew
Marcelo H. Ang Jr
12
0
0
01 Mar 2022
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill
  Acquisition
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Dylan Slack
Yinlam Chow
Bo Dai
Nevan Wichers
OffRL
24
7
0
10 Feb 2022
Bayesian Nonparametrics for Offline Skill Discovery
Bayesian Nonparametrics for Offline Skill Discovery
Valentin Villecroze
H. Braviner
Panteha Naderian
Chris J. Maddison
G. Loaiza-Ganem
BDL
OffRL
20
8
0
09 Feb 2022
State-Conditioned Adversarial Subgoal Generation
State-Conditioned Adversarial Subgoal Generation
V. Wang
Joni Pajarinen
Tinghuai Wang
Joni-Kristian Kämäräinen
55
11
0
24 Jan 2022
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
32
41
0
04 Nov 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Skill Induction and Planning with Latent Language
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
202
108
0
04 Oct 2021
Bottom-Up Skill Discovery from Unsegmented Demonstrations for
  Long-Horizon Robot Manipulation
Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation
Yifeng Zhu
Peter Stone
Yuke Zhu
45
61
0
28 Sep 2021
Hierarchical Representation Learning for Markov Decision Processes
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
22
4
0
03 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
129
0
25 May 2021
Online Baum-Welch algorithm for Hierarchical Imitation Learning
Online Baum-Welch algorithm for Hierarchical Imitation Learning
Vittorio Giammarino
I. Paschalidis
OffRL
16
2
0
22 Mar 2021
Learning Task Decomposition with Ordered Memory Policy Network
Learning Task Decomposition with Ordered Memory Policy Network
Yucheng Lu
Songlin Yang
Siyuan Zhou
Aaron Courville
J. Tenenbaum
Chuang Gan
19
15
0
19 Mar 2021
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded
  as Weighted Finite Automata
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded as Weighted Finite Automata
Tianyu Wang
Nikolay Atanasov
24
0
0
10 Mar 2021
Gesture Recognition in Robotic Surgery: a Review
Gesture Recognition in Robotic Surgery: a Review
Beatrice van Amsterdam
Matthew J. Clarkson
Danail Stoyanov
42
91
0
29 Jan 2021
Augmenting Policy Learning with Routines Discovered from a Single
  Demonstration
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Zelin Zhao
Chuang Gan
Jiajun Wu
Xiaoxiao Guo
J. Tenenbaum
OffRL
11
5
0
23 Dec 2020
From Pixels to Legs: Hierarchical Learning of Quadruped Locomotion
From Pixels to Legs: Hierarchical Learning of Quadruped Locomotion
Deepali Jain
Atil Iscen
Ken Caluwaerts
26
35
0
23 Nov 2020
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
Avi Singh
Huihan Liu
G. Zhou
Albert Yu
Nicholas Rhinehart
Sergey Levine
OffRL
OnRL
30
138
0
19 Nov 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning
Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning
Brian Ichter
P. Sermanet
Corey Lynch
19
38
0
13 Oct 2020
Provable Hierarchical Imitation Learning via EM
Provable Hierarchical Imitation Learning via EM
Zhiyu Zhang
I. Paschalidis
27
17
0
07 Oct 2020
Data-efficient Hindsight Off-policy Option Learning
Data-efficient Hindsight Off-policy Option Learning
Markus Wulfmeier
Dushyant Rao
Roland Hafner
Thomas Lampe
A. Abdolmaleki
...
Michael Neunert
Dhruva Tirumala
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
23
47
0
30 Jul 2020
Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot
  Transfer
Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer
Thomas J. Ringstrom
Mohammadhosein Hasanbeig
Alessandro Abate
13
3
0
06 Jul 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
27
47
0
30 Jun 2020
Learning Robot Skills with Temporal Variational Inference
Learning Robot Skills with Temporal Variational Inference
Tanmay Shankar
Abhinav Gupta
DRL
BDL
38
74
0
29 Jun 2020
SOAC: The Soft Option Actor-Critic Architecture
SOAC: The Soft Option Actor-Critic Architecture
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
L. Xia
Qianchuan Zhao
20
6
0
25 Jun 2020
Modeling Long-horizon Tasks as Sequential Interaction Landscapes
Modeling Long-horizon Tasks as Sequential Interaction Landscapes
Soren Pirk
Karol Hausman
Alexander Toshev
Mohi Khansari
22
27
0
08 Jun 2020
12
Next