ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.03012
  4. Cited By
Stochastic Neural Networks for Hierarchical Reinforcement Learning

Stochastic Neural Networks for Hierarchical Reinforcement Learning

10 April 2017
Carlos Florensa
Yan Duan
Pieter Abbeel
    BDL
ArXiv (abs)PDFHTML

Papers citing "Stochastic Neural Networks for Hierarchical Reinforcement Learning"

50 / 221 papers shown
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary
  Rewards
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary RewardsNeural Information Processing Systems (NeurIPS), 2019
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
158
91
0
10 Oct 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics ModelsConference on Robot Learning (CoRL), 2019
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Agrim Gupta
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
181
44
0
09 Oct 2019
Playing Atari Ball Games with Hierarchical Reinforcement Learning
Playing Atari Ball Games with Hierarchical Reinforcement Learning
Hua Huang
Adrian Barbu
85
0
0
27 Sep 2019
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Ofir Nachum
Haoran Tang
Xingyu Lu
S. Gu
Honglak Lee
Sergey Levine
265
116
0
23 Sep 2019
Dynamics-aware Embeddings
Dynamics-aware EmbeddingsInternational Conference on Learning Representations (ICLR), 2019
William F. Whitney
Rajat Agarwal
Dong Wang
Abhinav Gupta
SSL
297
55
0
25 Aug 2019
A survey on intrinsic motivation in reinforcement learning
A survey on intrinsic motivation in reinforcement learning
A. Aubret
L. Matignon
S. Hassas
AI4CE
404
158
0
19 Aug 2019
Are You for Real? Detecting Identity Fraud via Dialogue Interactions
Are You for Real? Detecting Identity Fraud via Dialogue InteractionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Weikang Wang
Jiajun Zhang
Cunliang Kong
Chengqing Zong
Zhifei Li
197
19
0
19 Aug 2019
Learning to combine primitive skills: A step towards versatile robotic
  manipulation
Learning to combine primitive skills: A step towards versatile robotic manipulation
Robin Strudel
Alexander Pashevich
Igor Kalevatykh
Ivan Laptev
Josef Sivic
Cordelia Schmid
217
4
0
02 Aug 2019
Hybrid system identification using switching density networks
Hybrid system identification using switching density networksConference on Robot Learning (CoRL), 2019
Michael G. Burke
Yordan V. Hristov
S. Ramamoorthy
355
13
0
09 Jul 2019
Dynamics-Aware Unsupervised Discovery of Skills
Dynamics-Aware Unsupervised Discovery of SkillsInternational Conference on Learning Representations (ICLR), 2019
Archit Sharma
S. Gu
Sergey Levine
Vikash Kumar
Karol Hausman
350
454
0
02 Jul 2019
Reinforcement Learning with Competitive Ensembles of
  Information-Constrained Primitives
Reinforcement Learning with Competitive Ensembles of Information-Constrained PrimitivesInternational Conference on Learning Representations (ICLR), 2019
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
134
54
0
25 Jun 2019
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Language as an Abstraction for Hierarchical Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2019
Yiding Jiang
S. Gu
Kevin Patrick Murphy
Chelsea Finn
OffRL
255
238
0
18 Jun 2019
Goal-conditioned Imitation Learning
Goal-conditioned Imitation LearningNeural Information Processing Systems (NeurIPS), 2019
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
380
257
0
13 Jun 2019
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Sub-policy Adaptation for Hierarchical Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2019
Alexander C. Li
Carlos Florensa
I. Clavera
Pieter Abbeel
362
82
0
13 Jun 2019
Transfer Learning by Modeling a Distribution over Policies
Transfer Learning by Modeling a Distribution over Policies
Disha Shrivastava
Eeshan Gunesh Dhekane
Riashat Islam
OODOffRL
114
0
0
09 Jun 2019
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale
  Ride-Hailing Platforms
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing PlatformsInternational Conference on Information and Knowledge Management (CIKM), 2019
Jiarui Jin
M. Zhou
Weinan Zhang
Minne Li
Zilong Guo
...
Xiaocheng Tang
Chenxi Wang
Jun Wang
Guobin Wu
Jieping Ye
142
106
0
27 May 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
Composing Task-Agnostic Policies with Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2019
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
186
33
0
25 May 2019
MCP: Learning Composable Hierarchical Control with Multiplicative
  Compositional Policies
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional PoliciesNeural Information Processing Systems (NeurIPS), 2019
Xue Bin Peng
Michael Chang
Grace Zhang
Pieter Abbeel
Sergey Levine
179
216
0
23 May 2019
A Regularized Opponent Model with Maximum Entropy Objective
A Regularized Opponent Model with Maximum Entropy ObjectiveInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
148
35
0
17 May 2019
MaMiC: Macro and Micro Curriculum for Robotic Reinforcement Learning
MaMiC: Macro and Micro Curriculum for Robotic Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2019
Manan Tomar
Akhil Sathuluri
Balaraman Ravindran
127
4
0
17 May 2019
Meta Reinforcement Learning with Task Embedding and Shared Policy
Meta Reinforcement Learning with Task Embedding and Shared PolicyInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Lin Lan
Zhenguo Li
X. Guan
Peijie Wang
OffRL
288
52
0
16 May 2019
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Mega-Reward: Achieving Human-Level Play without Extrinsic RewardsAAAI Conference on Artificial Intelligence (AAAI), 2019
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Shangtong Zhang
Andrzej Wojcicki
Mai Xu
LRM
360
16
0
12 May 2019
Continual and Multi-task Reinforcement Learning With Shared Episodic
  Memory
Continual and Multi-task Reinforcement Learning With Shared Episodic Memory
A. Sorokin
Andrey Kravchenko
KELMCLL
116
5
0
07 May 2019
Hierarchical Policy Learning is Sensitive to Goal Space Design
Hierarchical Policy Learning is Sensitive to Goal Space Design
Zach Dwiel
Madhavun Candadai
Mariano Phielipp
Arjun K. Bansal
230
15
0
04 May 2019
Routing Networks and the Challenges of Modular and Compositional
  Computation
Routing Networks and the Challenges of Modular and Compositional Computation
Clemens Rosenbaum
Ignacio Cases
Matthew D Riemer
Tim Klinger
216
93
0
29 Apr 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning OptionsNeural Information Processing Systems (NeurIPS), 2019
Shangtong Zhang
Shimon Whiteson
509
84
0
29 Apr 2019
Multitask Soft Option Learning
Multitask Soft Option Learning
Maximilian Igl
Andrew Gambardella
Jinke He
Nantas Nardelli
N. Siddharth
Wendelin Bohmer
Shimon Whiteson
346
26
0
01 Apr 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
167
45
0
18 Mar 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong
Murtaza Dalal
Steven Lin
Ashvin Nair
Shikhar Bahl
Sergey Levine
OffRLSSL
497
297
0
08 Mar 2019
The Termination Critic
The Termination CriticInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Anna Harutyunyan
Will Dabney
Diana Borsa
N. Heess
Rémi Munos
Doina Precup
OffRL
162
54
0
26 Feb 2019
Self-organization of action hierarchy and compositionality by
  reinforcement learning with recurrent neural networks
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks
Dongqi Han
Kenji Doya
Jun Tani
AI4CE
317
21
0
29 Jan 2019
Hierarchical Reinforcement Learning via Advantage-Weighted Information
  Maximization
Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
OffRL
202
32
0
05 Jan 2019
Self-supervised Learning of Image Embedding for Continuous Control
Self-supervised Learning of Image Embedding for Continuous Control
Carlos Florensa
Jonas Degrave
N. Heess
Jost Tobias Springenberg
Martin Riedmiller
SSL
155
55
0
03 Jan 2019
NADPEx: An on-policy temporally consistent exploration method for deep
  reinforcement learning
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wayne Zhang
Liang Lin
109
9
0
21 Dec 2018
CompILE: Compositional Imitation Learning and Execution
CompILE: Compositional Imitation Learning and Execution
Thomas Kipf
Yujia Li
H. Dai
V. Zambaldi
Alvaro Sanchez-Gonzalez
Edward Grefenstette
Pushmeet Kohli
Peter W. Battaglia
VLM
225
14
0
04 Dec 2018
Modulated Policy Hierarchies
Modulated Policy Hierarchies
Alexander Pashevich
Danijar Hafner
James Davidson
Rahul Sukthankar
Cordelia Schmid
88
6
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRLAI4CE
413
1,406
0
30 Nov 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRLOffRLSSL
218
187
0
28 Nov 2018
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
150
19
0
10 Nov 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLMOffRL
361
143
0
15 Oct 2018
InfoSSM: Interpretable Unsupervised Learning of Nonparametric
  State-Space Model for Multi-modal Dynamics
InfoSSM: Interpretable Unsupervised Learning of Nonparametric State-Space Model for Multi-modal Dynamics
Young-Jin Park
Han-Lim Choi
99
0
0
19 Sep 2018
Variational Option Discovery Algorithms
Variational Option Discovery Algorithms
Joshua Achiam
Harrison Edwards
Dario Amodei
Pieter Abbeel
DRL
228
193
0
26 Jul 2018
Representational efficiency outweighs action efficiency in human program
  induction
Representational efficiency outweighs action efficiency in human program inductionAnnual Meeting of the Cognitive Science Society (CogSci), 2018
Sophia Sanborn
David D. Bourgin
Michael Chang
Tom Griffiths
137
8
0
18 Jul 2018
Visual Reinforcement Learning with Imagined Goals
Visual Reinforcement Learning with Imagined Goals
Ashvin Nair
Vitchyr H. Pong
Murtaza Dalal
Shikhar Bahl
Steven Lin
Sergey Levine
SSL
306
577
0
12 Jul 2018
VFunc: a Deep Generative Model for Functions
VFunc: a Deep Generative Model for Functions
Philip Bachman
Riashat Islam
Alessandro Sordoni
Zafarali Ahmed
VLMBDL
141
8
0
11 Jul 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
202
528
0
14 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement
  Learning with Trajectory Embeddings
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSLBDLAIFin
149
153
0
07 Jun 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
156
75
0
25 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
565
915
0
21 May 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
189
202
0
09 Apr 2018
Previous
12345
Next
Page 4 of 5