Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.13105
Cited By
The Option Keyboard: Combining Skills in Reinforcement Learning
24 June 2021
André Barreto
Diana Borsa
Shaobo Hou
Gheorghe Comanici
Eser Aygun
P. Hamel
Daniel Toyama
Jonathan J. Hunt
Shibl Mourad
David Silver
Doina Precup
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Option Keyboard: Combining Skills in Reinforcement Learning"
50 / 63 papers shown
Title
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
19
0
0
01 May 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
70
0
0
19 Mar 2025
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Thomas P Cannon
Özgür Simsek
AI4CE
36
0
0
05 Nov 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martín-Martín
45
1
0
15 Oct 2024
Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation
Paul Jansonnie
Bingbing Wu
Julien Perez
Jan Peters
SSL
20
0
0
07 Oct 2024
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes
Guillermo Infante
Anders Jonsson
Vicenç Gómez
17
1
0
09 Jul 2024
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions
Zhening Li
Gabriel Poesia
Armando Solar-Lezama
OffRL
42
1
0
12 Jun 2024
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
23
0
0
03 Jun 2024
Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Guillermo Infante
David Kuric
Anders Jonsson
Vicencc Gómez
H. V. Hoof
OffRL
32
2
0
22 Mar 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
14
2
0
08 Feb 2024
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
Tristan Bester
Benjamin Rosman
Steven D. James
Geraud Nangue Tasse
16
1
0
18 Dec 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho
Andre Saraiva
Angelos Filos
Andrew Kyle Lampinen
Loic Matthey
Richard L. Lewis
Honglak Lee
Satinder Singh
Danilo Jimenez Rezende
Daniel Zoran
13
3
0
24 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
43
1
0
16 Oct 2023
Policy composition in reinforcement learning via multi-objective policy optimization
Shruti Mishra
Ankit Anand
Jordan Hoffmann
N. Heess
Martin Riedmiller
A. Abdolmaleki
Doina Precup
20
0
0
29 Aug 2023
Diversifying AI: Towards Creative Chess with AlphaZero
Tom Zahavy
Vivek Veeriah
Shaobo Hou
Kevin Waugh
Matthew Lai
Edouard Leurent
Nenad Tomašev
Lisa Schut
Demis Hassabis
Satinder Singh
34
15
0
17 Aug 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
29
0
0
21 May 2023
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Wanqiao Xu
Shi Dong
Dilip Arumugam
Benjamin Van Roy
32
8
0
19 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang
Chenjia Bai
Hongyi Guo
Siyuan Li
Bin Zhao
Zhen Wang
Peng Liu
Xuelong Li
SSL
29
16
0
08 May 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
29
4
0
24 Mar 2023
Bounding the Optimal Value Function in Compositional Reinforcement Learning
Jacob Adamczyk
Volodymyr Makarenko
A. Arriojas
Stas Tiomkin
R. Kulkarni
OffRL
32
2
0
05 Mar 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
16
0
0
28 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
11
2
0
09 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Composing Task Knowledge with Modular Successor Feature Approximators
Wilka Carvalho
Angelos Filos
Richard L. Lewis
Honglak Lee
Satinder Singh
17
7
0
28 Jan 2023
Reusable Options through Gradient-based Meta Learning
David Kuric
H. V. Hoof
26
0
0
22 Dec 2022
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
M. Ostaszewski
Michal Bortkiewicz
M. Żarski
Maciej Wołczyk
Jack Parker-Holder
Stephen J. Roberts
Piotr Milo's
KELM
OffRL
CLL
27
7
0
29 Nov 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
30
31
0
24 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
ASPiRe:Adaptive Skill Priors for Reinforcement Learning
Mengda Xu
Manuela Veloso
Shuran Song
CLL
OffRL
16
10
0
30 Sep 2022
The Alberta Plan for AI Research
R. Sutton
Michael Bowling
P. Pilarski
13
24
0
23 Aug 2022
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
J. MacGlashan
Evan Archer
A. Devlic
Takuma Seno
Craig Sherstan
Peter R. Wurman
AI PeterStoneSony
19
6
0
24 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis
Shayegan Omidshafiei
A. Kapishnikov
Yannick Assogba
Lucas Dixon
Been Kim
OffRL
36
5
0
17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition
S. Thakoor
Mark Rowland
Diana Borsa
Will Dabney
Rémi Munos
André Barreto
OffRL
14
7
0
17 Jun 2022
Meta-Learning Parameterized Skills
Haotian Fu
Shangqun Yu
Saket Tiwari
Michael Littman
George Konidaris
35
6
0
07 Jun 2022
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
21
5
0
06 Jun 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning
Geraud Nangue Tasse
Devon Jarvis
Steven D. James
Benjamin Rosman
44
4
0
25 May 2022
Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla
Diego Perez-Liebana
14
1
0
20 May 2022
Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Alexis Jacq
Johan Ferret
Olivier Pietquin
M. Geist
19
9
0
16 Mar 2022
A Versatile Agent for Fast Learning from Human Instructors
Yiwen Chen
Zedong Zhang
Hao-Kang Liu
Jiayi Tan
C. Chew
Marcelo H ANG Jr
12
0
0
01 Mar 2022
Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates
Souradeep Dutta
Kaustubh Sridhar
Osbert Bastani
Yan Sun
James Weimer
Insup Lee
J. Parish-Morris
22
2
0
25 Feb 2022
Continual Auxiliary Task Learning
Matt McLeod
Chun-Ping Lo
M. Schlegel
Andrew Jacobsen
Raksha Kumaraswamy
Martha White
Adam White
CLL
16
8
0
22 Feb 2022
Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates
Safa Alver
Doina Precup
OffRL
4
17
0
30 Dec 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng-Tao Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
29
41
0
04 Nov 2021
Successor Feature Representations
Chris Reinke
Xavier Alameda-Pineda
29
5
0
29 Oct 2021
Option Transfer and SMDP Abstraction with Successor Features
Dongge Han
Sebastian Tschiatschek
9
1
0
18 Oct 2021
Temporal Abstraction in Reinforcement Learning with the Successor Representation
Marlos C. Machado
André Barreto
Doina Precup
Michael Bowling
16
40
0
12 Oct 2021
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
A New Representation of Successor Features for Transfer across Dissimilar Environments
Majid Abdolshah
Hung Le
Thommen George Karimpanal
Sunil R. Gupta
Santu Rana
Svetha Venkatesh
10
18
0
18 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
103
0
14 Jul 2021
1
2
Next