Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1206.6473
Cited By
Compositional Planning Using Optimal Option Models
International Conference on Machine Learning (ICML), 2012
27 June 2012
David Silver
K. Ciosek
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Compositional Planning Using Optimal Option Models"
35 / 35 papers shown
Multi-layer Abstraction for Nested Generation of Options (MANGO) in Hierarchical Reinforcement Learning
Alessio Arcudi
Davide Sartor
Alberto Sinigaglia
Vincent François-Lavet
Gian Antonio Susto
83
0
0
25 Aug 2025
A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes
Thomas J. Ringstrom
Paul Schrater
160
0
0
11 Jun 2025
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Giuseppe Paolo
Khyati Khandelwal
Abdelhakim Benechehab
Albert Thomas
Jun Yao
387
4
0
21 Feb 2025
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRL
SSL
305
4
0
01 Oct 2024
A General Theory for Compositional Generalization
Jingwen Fu
Zhizheng Zhang
Yan Lu
Nanning Zheng
AI4CE
CoGe
254
2
0
20 May 2024
Hierarchical Reinforcement Learning for Power Network Topology Control
Blazej Manczak
Jan Viebahn
H. V. Hoof
238
9
0
03 Nov 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
474
4
0
30 Sep 2023
Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills
Ben-ya Halevy
Y. Aperstein
Dotan Di Castro
GP
OffRL
173
1
0
23 Jun 2023
Hierarchical Imitation Learning with Vector Quantized Models
International Conference on Machine Learning (ICML), 2023
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
251
16
0
30 Jan 2023
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Naman Shah
Siddharth Srivastava
331
2
0
30 Sep 2022
Generalised Policy Improvement with Geometric Policy Composition
International Conference on Machine Learning (ICML), 2022
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
186
10
0
17 Jun 2022
Reward-Respecting Subtasks for Model-Based Reinforcement Learning
Artificial Intelligence (AIJ), 2022
R. Sutton
Marlos C. Machado
G. Z. Holland
David Szepesvari Finbarr Timbers
Finbarr Timbers
B. Tanner
Adam White
340
27
0
07 Feb 2022
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
283
16
0
28 Sep 2021
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Nan Rosemary Ke
Aniket Didolkar
Sarthak Mittal
Anirudh Goyal
Guillaume Lajoie
Stefan Bauer
Danilo Jimenez Rezende
Yoshua Bengio
Michael C. Mozer
C. Pal
CML
221
68
0
02 Jul 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDL
MoE
136
33
0
19 Apr 2021
Solving Compositional Reinforcement Learning Problems via Task Reduction
International Conference on Learning Representations (ICLR), 2021
Yunfei Li
Yilin Wu
Huazhe Xu
Xiaolong Wang
Yi Wu
205
20
0
13 Mar 2021
Discovery of Options via Meta-Learned Subgoals
Neural Information Processing Systems (NeurIPS), 2021
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
202
35
0
12 Feb 2021
Interpretable Reinforcement Learning Inspired by Piaget's Theory of Cognitive Development
Aref Hakimzadeh
Yanbo Xue
P. Setoodeh
123
7
0
01 Feb 2021
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
348
46
0
29 Sep 2020
Learning Compositional Neural Programs for Continuous Control
Thomas Pierrot
Nicolas Perrin
Feryal M. P. Behbahani
Alexandre Laterre
Olivier Sigaud
Karim Beguir
Nando de Freitas
CLL
270
4
0
27 Jul 2020
Planning with Abstract Learned Models While Learning Transferable Subtasks
AAAI Conference on Artificial Intelligence (AAAI), 2019
J. Winder
Stephanie Milani
Matthew Landen
Erebus Oh
Shane Parr
S. Squire
Marie desJardins
Cynthia Matuszek
140
10
0
16 Dec 2019
Learning from Trajectories via Subgoal Discovery
Neural Information Processing Systems (NeurIPS), 2019
S. Paul
J. Baar
Amit K. Roy-Chowdhury
260
49
0
03 Nov 2019
Combating the Compounding-Error Problem with a Multi-step Model
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
LRM
256
61
0
30 May 2019
Learning Robust Options by Conditional Value at Risk Optimization
Neural Information Processing Systems (NeurIPS), 2019
Takuya Hiraoka
Takahisa Imagawa
Tatsuya Mori
Takashi Onishi
Yoshimasa Tsuruoka
338
29
0
22 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
Neural Information Processing Systems (NeurIPS), 2019
Shangtong Zhang
Shimon Whiteson
544
84
0
29 Apr 2019
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
164
19
0
10 Nov 2018
Finding Options that Minimize Planning Time
Yuu Jinnai
David Abel
D Ellis Hershkowitz
Michael Littman
George Konidaris
170
42
0
16 Oct 2018
Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition
Xuan Liu
Jie Fu
116
5
0
05 Oct 2018
Improving On-policy Learning with Statistical Reward Accumulation
Yubin Deng
K. Yu
Dahua Lin
Xiaoou Tang
Chen Change Loy
OffRL
101
0
0
07 Sep 2018
Variational Intrinsic Control
Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
DRL
OffRL
310
457
0
22 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
459
1,277
0
16 Nov 2016
Principled Option Learning in Markov Decision Processes
Roy Fox
Michal Moshkovitz
Naftali Tishby
153
11
0
18 Sep 2016
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
288
1,198
0
16 Sep 2016
Adaptive Skills, Adaptive Partitions (ASAP)
D. Mankowitz
Timothy A. Mann
Shie Mannor
221
61
0
10 Feb 2016
Value Iteration with Options and State Aggregation
K. Ciosek
David Silver
116
11
0
16 Jan 2015
1
Page 1 of 1