The Option Keyboard: Combining Skills in Reinforcement Learning

24 June 2021

Daniel Toyama

David Silver

Papers citing "The Option Keyboard: Combining Skills in Reinforcement Learning"

50 / 63 papers shown

Title
Constructing an Optimal Behavior Basis for the Option Keyboard L. N. Alegre A. Bazzan André Barreto Bruno C. da Silva 19 0 0 01 May 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning Rishav Rishav Somjit Nath Vincent Michalski Samira Ebrahimi Kahou FAtt OffRL 70 0 0 19 Mar 2025
Accelerating Task Generalisation with Multi-Level Skill Hierarchies Thomas P Cannon Özgür Simsek AI4CE 36 0 0 05 Nov 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning Jiaheng Hu Zizhao Wang Peter Stone Roberto Martín-Martín 45 1 0 15 Oct 2024
Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation Paul Jansonnie Bingbing Wu Julien Perez Jan Peters SSL 20 0 0 07 Oct 2024
Hierarchical Average-Reward Linearly-solvable Markov Decision Processes Guillermo Infante Anders Jonsson Vicenç Gómez 17 1 0 09 Jul 2024
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions Zhening Li Gabriel Poesia Armando Solar-Lezama OffRL 42 1 0 12 Jun 2024
A New View on Planning in Online Reinforcement Learning Kevin Roice Parham Mohammad Panahi Scott M. Jordan Adam White Martha White OffRL 23 0 0 03 Jun 2024
Planning with a Learned Policy Basis to Optimally Solve Complex Tasks Guillermo Infante David Kuric Anders Jonsson Vicencc Gómez H. V. Hoof OffRL 32 2 0 22 Mar 2024
Learning Uncertainty-Aware Temporally-Extended Actions Joongkyu Lee Seung Joon Park Yunhao Tang Min-hwan Oh 14 2 0 08 Feb 2024
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure Tristan Bester Benjamin Rosman Steven D. James Geraud Nangue Tasse 16 1 0 18 Dec 2023
Contrastive Difference Predictive Coding Chongyi Zheng Ruslan Salakhutdinov Benjamin Eysenbach AI4TS OffRL 28 11 0 31 Oct 2023
Combining Behaviors with the Successor Features Keyboard Wilka Carvalho Andre Saraiva Angelos Filos Andrew Kyle Lampinen Loic Matthey Richard L. Lewis Honglak Lee Satinder Singh Danilo Jimenez Rezende Daniel Zoran 13 3 0 24 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning Parvin Malekzadeh Ming Hou Konstantinos N. Plataniotis 43 1 0 16 Oct 2023
Policy composition in reinforcement learning via multi-objective policy optimization Shruti Mishra Ankit Anand Jordan Hoffmann N. Heess Martin Riedmiller A. Abdolmaleki Doina Precup 20 0 0 29 Aug 2023
Diversifying AI: Towards Creative Chess with AlphaZero Tom Zahavy Vivek Veeriah Shaobo Hou Kevin Waugh Matthew Lai Edouard Leurent Nenad Tomašev Lisa Schut Demis Hassabis Satinder Singh 34 15 0 17 Aug 2023
Unsupervised Discovery of Continuous Skills on a Sphere Takahisa Imagawa Takuya Hiraoka Yoshimasa Tsuruoka 29 0 0 21 May 2023
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models Wanqiao Xu Shi Dong Dilip Arumugam Benjamin Van Roy 32 8 0 19 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery Rushuai Yang Chenjia Bai Hongyi Guo Siyuan Li Bin Zhao Zhen Wang Peng Liu Xuelong Li SSL 29 16 0 08 May 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition Y. Liu Aamir Ahmad 29 4 0 24 Mar 2023
Bounding the Optimal Value Function in Compositional Reinforcement Learning Jacob Adamczyk Volodymyr Makarenko A. Arriojas Stas Tiomkin R. Kulkarni OffRL 32 2 0 05 Mar 2023
Hierarchical Reinforcement Learning in Complex 3D Environments Bernardo Avila-Pires Feryal M. P. Behbahani Hubert Soyer Kyriacos Nikiforou Thomas Keck Satinder Singh OffRL 16 0 0 28 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals Akhil Bagaria Ray Jiang Ramana Kumar Tom Schaul LRM 11 2 0 09 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition P. Sunehag A. Vezhnevets Edgar A. Duénez-Guzmán Igor Mordach Joel Z Leibo 26 2 0 02 Feb 2023
Composing Task Knowledge with Modular Successor Feature Approximators Wilka Carvalho Angelos Filos Richard L. Lewis Honglak Lee Satinder Singh 17 7 0 28 Jan 2023
Reusable Options through Gradient-based Meta Learning David Kuric H. V. Hoof 26 0 0 22 Dec 2022
The Effectiveness of World Models for Continual Reinforcement Learning Samuel Kessler M. Ostaszewski Michal Bortkiewicz M. Żarski Maciej Wołczyk Jack Parker-Holder Stephen J. Roberts Piotr Milo's KELM OffRL CLL 27 7 0 29 Nov 2022
Melting Pot 2.0 J. Agapiou A. Vezhnevets Edgar A. Duénez-Guzmán Jayd Matyas Yiran Mao ... Sukhdeep Singh Julia Haas Igor Mordatch D. Mobbs Joel Z Leibo 30 31 0 24 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering Andrei A. Rusu Sebastian Flennerhag Dushyant Rao Razvan Pascanu R. Hadsell 34 6 0 22 Oct 2022
ASPiRe:Adaptive Skill Priors for Reinforcement Learning Mengda Xu Manuela Veloso Shuran Song CLL OffRL 16 10 0 30 Sep 2022
The Alberta Plan for AI Research R. Sutton Michael Bowling P. Pilarski 13 24 0 23 Aug 2022
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents J. MacGlashan Evan Archer A. Devlic Takuma Seno Craig Sherstan Peter R. Wurman AI PeterStoneSony 19 6 0 24 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis Shayegan Omidshafiei A. Kapishnikov Yannick Assogba Lucas Dixon Been Kim OffRL 36 5 0 17 Jun 2022
Generalised Policy Improvement with Geometric Policy Composition S. Thakoor Mark Rowland Diana Borsa Will Dabney Rémi Munos André Barreto OffRL 14 7 0 17 Jun 2022
Meta-Learning Parameterized Skills Haotian Fu Shangqun Yu Saket Tiwari Michael Littman George Konidaris 35 6 0 07 Jun 2022
Goal-Space Planning with Subgoal Models Chun-Ping Lo Kevin Roice Parham Mohammad Panahi Scott M. Jordan Adam White Gábor Mihucz Farzane Aminmansour Martha White 21 5 0 06 Jun 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning Geraud Nangue Tasse Devon Jarvis Steven D. James Benjamin Rosman 44 4 0 25 May 2022
Task Relabelling for Multi-task Transfer using Successor Features Martin Balla Diego Perez-Liebana 14 1 0 20 May 2022
Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act Alexis Jacq Johan Ferret Olivier Pietquin M. Geist 19 9 0 16 Mar 2022
A Versatile Agent for Fast Learning from Human Instructors Yiwen Chen Zedong Zhang Hao-Kang Liu Jiayi Tan C. Chew Marcelo H ANG Jr 12 0 0 01 Mar 2022
Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates Souradeep Dutta Kaustubh Sridhar Osbert Bastani Yan Sun James Weimer Insup Lee J. Parish-Morris 22 2 0 25 Feb 2022
Continual Auxiliary Task Learning Matt McLeod Chun-Ping Lo M. Schlegel Andrew Jacobsen Raksha Kumaraswamy Martha White Adam White CLL 16 8 0 22 Feb 2022
Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates Safa Alver Doina Precup OffRL 4 17 0 30 Dec 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Dhruv Shah Peng-Tao Xu Yao Lu Ted Xiao Alexander Toshev Sergey Levine Brian Ichter OffRL 29 41 0 04 Nov 2021
Successor Feature Representations Chris Reinke Xavier Alameda-Pineda 29 5 0 29 Oct 2021
Option Transfer and SMDP Abstraction with Successor Features Dongge Han Sebastian Tschiatschek 9 1 0 18 Oct 2021
Temporal Abstraction in Reinforcement Learning with the Successor Representation Marlos C. Machado André Barreto Doina Precup Michael Bowling 16 40 0 12 Oct 2021
When should agents explore? Miruna Pislar David Szepesvari Georg Ostrovski Diana Borsa Tom Schaul 40 22 0 26 Aug 2021
A New Representation of Successor Features for Transfer across Dissimilar Environments Majid Abdolshah Hung Le Thommen George Karimpanal Sunil R. Gupta Santu Rana Svetha Venkatesh 10 18 0 18 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot Joel Z Leibo Edgar A. Duénez-Guzmán A. Vezhnevets J. Agapiou P. Sunehag Raphael Köster Jayd Matyas Charlie Beattie Igor Mordatch T. Graepel OffRL 58 103 0 14 Jul 2021