ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1206.6473
  4. Cited By
Compositional Planning Using Optimal Option Models

Compositional Planning Using Optimal Option Models

International Conference on Machine Learning (ICML), 2012
27 June 2012
David Silver
K. Ciosek
ArXiv (abs)PDFHTML

Papers citing "Compositional Planning Using Optimal Option Models"

35 / 35 papers shown
Multi-layer Abstraction for Nested Generation of Options (MANGO) in Hierarchical Reinforcement Learning
Multi-layer Abstraction for Nested Generation of Options (MANGO) in Hierarchical Reinforcement Learning
Alessio Arcudi
Davide Sartor
Alberto Sinigaglia
Vincent François-Lavet
Gian Antonio Susto
83
0
0
25 Aug 2025
A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes
A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes
Thomas J. Ringstrom
Paul Schrater
160
0
0
11 Jun 2025
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Giuseppe Paolo
Khyati Khandelwal
Abdelhakim Benechehab
Albert Thomas
Jun Yao
387
4
0
21 Feb 2025
Contrastive Abstraction for Reinforcement Learning
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRLSSL
305
4
0
01 Oct 2024
A General Theory for Compositional Generalization
A General Theory for Compositional Generalization
Jingwen Fu
Zhizheng Zhang
Yan Lu
Nanning Zheng
AI4CECoGe
254
2
0
20 May 2024
Hierarchical Reinforcement Learning for Power Network Topology Control
Hierarchical Reinforcement Learning for Power Network Topology Control
Blazej Manczak
Jan Viebahn
H. V. Hoof
238
9
0
03 Nov 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better
  Generalization in Reinforcement Learning
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
474
4
0
30 Sep 2023
Offline Skill Graph (OSG): A Framework for Learning and Planning using
  Offline Reinforcement Learning Skills
Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills
Ben-ya Halevy
Y. Aperstein
Dotan Di Castro
GPOffRL
173
1
0
23 Jun 2023
Hierarchical Imitation Learning with Vector Quantized Models
Hierarchical Imitation Learning with Vector Quantized ModelsInternational Conference on Machine Learning (ICML), 2023
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
251
16
0
30 Jan 2023
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Multi-Task Option Learning and Discovery for Stochastic Path Planning
Naman Shah
Siddharth Srivastava
331
2
0
30 Sep 2022
Generalised Policy Improvement with Geometric Policy Composition
Generalised Policy Improvement with Geometric Policy CompositionInternational Conference on Machine Learning (ICML), 2022
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
186
10
0
17 Jun 2022
Reward-Respecting Subtasks for Model-Based Reinforcement Learning
Reward-Respecting Subtasks for Model-Based Reinforcement LearningArtificial Intelligence (AIJ), 2022
R. Sutton
Marlos C. Machado
G. Z. Holland
David Szepesvari Finbarr Timbers
Finbarr Timbers
B. Tanner
Adam White
340
27
0
07 Feb 2022
A First-Occupancy Representation for Reinforcement Learning
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
283
16
0
28 Sep 2021
Systematic Evaluation of Causal Discovery in Visual Model Based
  Reinforcement Learning
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Nan Rosemary Ke
Aniket Didolkar
Sarthak Mittal
Anirudh Goyal
Guillaume Lajoie
Stefan Bauer
Danilo Jimenez Rezende
Yoshua Bengio
Michael C. Mozer
C. Pal
CML
221
68
0
02 Jul 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement
  Learning
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDLMoE
136
33
0
19 Apr 2021
Solving Compositional Reinforcement Learning Problems via Task Reduction
Solving Compositional Reinforcement Learning Problems via Task ReductionInternational Conference on Learning Representations (ICLR), 2021
Yunfei Li
Yilin Wu
Huazhe Xu
Xiaolong Wang
Yi Wu
205
20
0
13 Mar 2021
Discovery of Options via Meta-Learned Subgoals
Discovery of Options via Meta-Learned SubgoalsNeural Information Processing Systems (NeurIPS), 2021
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
202
35
0
12 Feb 2021
Interpretable Reinforcement Learning Inspired by Piaget's Theory of
  Cognitive Development
Interpretable Reinforcement Learning Inspired by Piaget's Theory of Cognitive Development
Aref Hakimzadeh
Yanbo Xue
P. Setoodeh
123
7
0
01 Feb 2021
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
348
46
0
29 Sep 2020
Learning Compositional Neural Programs for Continuous Control
Learning Compositional Neural Programs for Continuous Control
Thomas Pierrot
Nicolas Perrin
Feryal M. P. Behbahani
Alexandre Laterre
Olivier Sigaud
Karim Beguir
Nando de Freitas
CLL
270
4
0
27 Jul 2020
Planning with Abstract Learned Models While Learning Transferable
  Subtasks
Planning with Abstract Learned Models While Learning Transferable SubtasksAAAI Conference on Artificial Intelligence (AAAI), 2019
J. Winder
Stephanie Milani
Matthew Landen
Erebus Oh
Shane Parr
S. Squire
Marie desJardins
Cynthia Matuszek
140
10
0
16 Dec 2019
Learning from Trajectories via Subgoal Discovery
Learning from Trajectories via Subgoal DiscoveryNeural Information Processing Systems (NeurIPS), 2019
S. Paul
J. Baar
Amit K. Roy-Chowdhury
260
49
0
03 Nov 2019
Combating the Compounding-Error Problem with a Multi-step Model
Combating the Compounding-Error Problem with a Multi-step Model
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
LRM
256
61
0
30 May 2019
Learning Robust Options by Conditional Value at Risk Optimization
Learning Robust Options by Conditional Value at Risk OptimizationNeural Information Processing Systems (NeurIPS), 2019
Takuya Hiraoka
Takahisa Imagawa
Tatsuya Mori
Takashi Onishi
Yoshimasa Tsuruoka
338
29
0
22 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning OptionsNeural Information Processing Systems (NeurIPS), 2019
Shangtong Zhang
Shimon Whiteson
544
84
0
29 Apr 2019
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
164
19
0
10 Nov 2018
Finding Options that Minimize Planning Time
Finding Options that Minimize Planning Time
Yuu Jinnai
David Abel
D Ellis Hershkowitz
Michael Littman
George Konidaris
170
42
0
16 Oct 2018
Compositional planning in Markov decision processes: Temporal
  abstraction meets generalized logic composition
Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition
Xuan Liu
Jie Fu
116
5
0
05 Oct 2018
Improving On-policy Learning with Statistical Reward Accumulation
Improving On-policy Learning with Statistical Reward Accumulation
Yubin Deng
K. Yu
Dahua Lin
Xiaoou Tang
Chen Change Loy
OffRL
101
0
0
07 Sep 2018
Variational Intrinsic Control
Variational Intrinsic Control
Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
DRLOffRL
310
457
0
22 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
459
1,277
0
16 Nov 2016
Principled Option Learning in Markov Decision Processes
Principled Option Learning in Markov Decision Processes
Roy Fox
Michal Moshkovitz
Naftali Tishby
153
11
0
18 Sep 2016
The Option-Critic Architecture
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
288
1,198
0
16 Sep 2016
Adaptive Skills, Adaptive Partitions (ASAP)
Adaptive Skills, Adaptive Partitions (ASAP)
D. Mankowitz
Timothy A. Mann
Shie Mannor
221
61
0
10 Feb 2016
Value Iteration with Options and State Aggregation
Value Iteration with Options and State Aggregation
K. Ciosek
David Silver
116
11
0
16 Jan 2015
1
Page 1 of 1