ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.03012
  4. Cited By
Stochastic Neural Networks for Hierarchical Reinforcement Learning

Stochastic Neural Networks for Hierarchical Reinforcement Learning

10 April 2017
Carlos Florensa
Yan Duan
Pieter Abbeel
    BDL
ArXiv (abs)PDFHTML

Papers citing "Stochastic Neural Networks for Hierarchical Reinforcement Learning"

50 / 221 papers shown
Versatile Skill Control via Self-supervised Adversarial Imitation of
  Unlabeled Mixed Motions
Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed MotionsIEEE International Conference on Robotics and Automation (ICRA), 2022
Chenhao Li
Sebastian Blaes
Pavel Kolev
Marin Vlastelica
Jonas Frey
Georg Martius
SSL
248
38
0
16 Sep 2022
Learning Temporally Extended Skills in Continuous Domains as Symbolic
  Actions for Planning
Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for PlanningConference on Robot Learning (CoRL), 2022
Jan Achterhold
Markus Krimmel
Joerg Stueckler
319
11
0
11 Jul 2022
CompoSuite: A Compositional Reinforcement Learning Benchmark
CompoSuite: A Compositional Reinforcement Learning Benchmark
Jorge Armando Mendez Mendez
Marcel Hussing
Meghna Gummadi
Eric Eaton
CoGeOffRL
218
16
0
08 Jul 2022
Matching options to tasks using Option-Indexed Hierarchical
  Reinforcement Learning
Matching options to tasks using Option-Indexed Hierarchical Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Kushal Chauhan
Soumya Chatterjee
Akash Reddy
Balaraman Ravindran
Pradeep Shenoy
125
0
0
12 Jun 2022
Deep Hierarchical Planning from Pixels
Deep Hierarchical Planning from PixelsNeural Information Processing Systems (NeurIPS), 2022
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
223
118
0
08 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRLLRM
135
5
0
03 Jun 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement
  Learning
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
367
6
0
26 May 2022
Developing cooperative policies for multi-stage reinforcement learning
  tasks
Developing cooperative policies for multi-stage reinforcement learning tasksIEEE Robotics and Automation Letters (RA-L), 2022
J. Erskine
Christopher F. Lehnert
176
12
0
11 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically
  Simulated Characters
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated CharactersACM Transactions on Graphics (TOG), 2022
Xue Bin Peng
Yunrong Guo
L. Halper
Sergey Levine
Sanja Fidler
188
15
0
04 May 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
406
66
0
17 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward FunctionsNeural Information Processing Systems (NeurIPS), 2022
Robert Meier
Asier Mujika
290
7
0
16 Feb 2022
ASC me to Do Anything: Multi-task Training for Embodied AI
ASC me to Do Anything: Multi-task Training for Embodied AI
Jiasen Lu
Jordi Salvador
Roozbeh Mottaghi
Aniruddha Kembhavi
186
3
0
14 Feb 2022
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill
  Acquisition
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Dylan Slack
Yinlam Chow
Bo Dai
Nevan Wichers
OffRL
330
7
0
10 Feb 2022
Bayesian Nonparametrics for Offline Skill Discovery
Bayesian Nonparametrics for Offline Skill DiscoveryInternational Conference on Machine Learning (ICML), 2022
Valentin Villecroze
H. Braviner
Panteha Naderian
Chris J. Maddison
Gabriel Loaiza-Ganem
BDLOffRL
266
9
0
09 Feb 2022
GrASP: Gradient-Based Affordance Selection for Planning
GrASP: Gradient-Based Affordance Selection for Planning
Vivek Veeriah
Zeyu Zheng
Richard L. Lewis
Satinder Singh
178
4
0
08 Feb 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
483
76
0
01 Feb 2022
Transfering Hierarchical Structure with Dual Meta Imitation Learning
Transfering Hierarchical Structure with Dual Meta Imitation Learning
Chongkai Gao
Yizhou Jiang
F. Chen
178
9
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Haichao Zhang
Wei Xu
Haonan Yu
249
11
0
24 Jan 2022
Learning Transferable Motor Skills with Hierarchical Latent Mixture
  Policies
Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Dushyant Rao
Fereshteh Sadeghi
Leonard Hasenclever
Markus Wulfmeier
Martina Zambelli
...
Dhruva Tirumala
Y. Aytar
J. Merel
N. Heess
R. Hadsell
241
33
0
09 Dec 2021
Hierarchical Reinforcement Learning with Timed Subgoals
Hierarchical Reinforcement Learning with Timed SubgoalsNeural Information Processing Systems (NeurIPS), 2021
Nico Gürtler
Le Chen
Georg Martius
263
30
0
06 Dec 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RANIEEE Access (IEEE Access), 2021
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
178
48
0
12 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon ReasoningInternational Conference on Learning Representations (ICLR), 2021
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
234
48
0
04 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
443
22
0
30 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
365
21
0
27 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
338
75
0
26 Oct 2021
Hierarchical Skills for Efficient Exploration
Hierarchical Skills for Efficient ExplorationNeural Information Processing Systems (NeurIPS), 2021
Jonas Gehring
Gabriel Synnaeve
Andreas Krause
Nicolas Usunier
233
47
0
20 Oct 2021
Socialbots on Fire: Modeling Adversarial Behaviors of Socialbots via
  Multi-Agent Hierarchical Reinforcement Learning
Socialbots on Fire: Modeling Adversarial Behaviors of Socialbots via Multi-Agent Hierarchical Reinforcement LearningThe Web Conference (WWW), 2021
Thai Le
Long Tran-Thanh
tql
AAML
188
9
0
20 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior
  Engineering beyond Reward Maximization
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
140
15
0
10 Oct 2021
Training Transition Policies via Distribution Matching for Complex Tasks
Training Transition Policies via Distribution Matching for Complex TasksInternational Conference on Learning Representations (ICLR), 2021
Ju-Seung Byun
Andrew Perrault
127
6
0
08 Oct 2021
Pick Your Battles: Interaction Graphs as Population-Level Objectives for
  Strategic Diversity
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic DiversityAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
M. Garnelo
Wojciech M. Czarnecki
Siqi Liu
Dhruva Tirumala
Junhyuk Oh
Gauthier Gidel
H. V. Hasselt
David Balduzzi
230
25
0
08 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning
The Information Geometry of Unsupervised Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
SSLOffRL
271
38
0
06 Oct 2021
Self-supervised Reinforcement Learning with Independently Controllable
  Subgoals
Self-supervised Reinforcement Learning with Independently Controllable SubgoalsConference on Robot Learning (CoRL), 2021
Antonios Tragoudaras
Georg Martius
Fanny Yang
SSL
232
18
0
09 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning
  Algorithms
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
103
3
0
04 Sep 2021
Learning Meta Representations for Agents in Multi-Agent Reinforcement
  Learning
Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Shenao Zhang
Li Shen
Lei Han
Li Shen
229
8
0
30 Aug 2021
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Collect & Infer -- a fresh look at data-efficient Reinforcement LearningConference on Robot Learning (CoRL), 2021
Martin Riedmiller
Jost Tobias Springenberg
Agrim Gupta
N. Heess
OffRL
174
21
0
23 Aug 2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
J. J. Nieto
Roger Creus
Xavier Giró-i-Nieto
SSLDRL
168
5
0
18 Jul 2021
Adaptable Agent Populations via a Generative Model of Policies
Adaptable Agent Populations via a Generative Model of PoliciesNeural Information Processing Systems (NeurIPS), 2021
Kenneth Derek
Phillip Isola
200
16
0
15 Jul 2021
Motion Planning by Learning the Solution Manifold in Trajectory
  Optimization
Motion Planning by Learning the Solution Manifold in Trajectory Optimization
Takayuki Osa
168
25
0
13 Jul 2021
Towards Autonomous Pipeline Inspection with Hierarchical Reinforcement
  Learning
Towards Autonomous Pipeline Inspection with Hierarchical Reinforcement LearningInternational Conference on Robot Intelligence Technology and Applications (RITA), 2021
N. Botteghi
L.J.L. Grefte
M. Poel
B. Sirmaçek
C. Brune
Edwin Dertien
Stefano Stramigioli
127
6
0
08 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Unsupervised Skill Discovery with Bottleneck Option LearningInternational Conference on Machine Learning (ICML), 2021
Jaekyeom Kim
Seohong Park
Gunhee Kim
209
38
0
27 Jun 2021
Discovering Generalizable Skills via Automated Generation of Diverse
  Tasks
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
188
8
0
26 Jun 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
  Relabeling Experience and Unsupervised Pre-training
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-trainingInternational Conference on Machine Learning (ICML), 2021
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
405
351
0
09 Jun 2021
DisTop: Discovering a Topological representation to learn diverse and
  rewarding skills
DisTop: Discovering a Topological representation to learn diverse and rewarding skillsIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2021
A. Aubret
L. Matignon
S. Hassas
194
12
0
06 Jun 2021
Learning Routines for Effective Off-Policy Reinforcement Learning
Learning Routines for Effective Off-Policy Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Edoardo Cetin
Oya Celiktutan
113
1
0
05 Jun 2021
Variational Empowerment as Representation Learning for Goal-Based
  Reinforcement Learning
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Jongwook Choi
Archit Sharma
Honglak Lee
Sergey Levine
S. Gu
DRL
211
23
0
02 Jun 2021
Reward is enough for convex MDPs
Reward is enough for convex MDPsNeural Information Processing Systems (NeurIPS), 2021
Tom Zahavy
Brendan O'Donoghue
Guillaume Desjardins
Satinder Singh
330
81
0
01 Jun 2021
Reducing the Deployment-Time Inference Control Costs of Deep
  Reinforcement Learning Agents via an Asymmetric Architecture
Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric ArchitectureIEEE International Conference on Robotics and Automation (ICRA), 2021
Chin-Jui Chang
Yu-Wei Chu
Chao-Hsien Ting
Hao-Kang Liu
Zhang-Wei Hong
Chun-Yi Lee
AI4CE
108
2
0
30 May 2021
Composable Energy Policies for Reactive Motion Generation and
  Reinforcement Learning
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Julen Urain
Anqi Li
Puze Liu
Carlo DÉramo
Jan Peters
202
31
0
11 May 2021
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods
  Inspired by Stigmergy and Ant Colonies
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies
Austin Nguyen
126
2
0
08 May 2021
GAN-Based Interactive Reinforcement Learning from Demonstration and
  Human Evaluative Feedback
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative FeedbackIEEE International Conference on Robotics and Automation (ICRA), 2021
Jie Huang
Rongshun Juan
R. Gomez
Keisuke Nakamura
Q. Sha
Bo He
Guangliang Li
195
13
0
14 Apr 2021
Previous
12345
Next