Learning and Planning in Complex Action Spaces

Learning and Planning in Complex Action Spaces

13 April 2021

Julian Schrittwieser

Ioannis Antonoglou

David Silver

Papers citing "Learning and Planning in Complex Action Spaces"

18 / 18 papers shown

Title
Trust-Region Twisted Policy Improvement Joery A. de Vries Jinke He Yaniv Oren M. Spaan OffRL LRM 30 0 0 08 Apr 2025
OptionZero: Planning with Learned Options Po-Wei Huang Pei-Chiun Peng Hung Guei Ti-Rong Wu 55 0 0 23 Feb 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Chunyu Xuan Yazhe Niu Yuan Pu Shuai Hu Yu Liu Jing Yang 65 0 0 03 Jan 2025
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning Jiayu Chen Wentse Chen Jeff Schneider OffRL 31 1 0 15 Oct 2024
Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning Jason Piquenot Maxime Bérar Pierre Héroux Jean-Yves Ramel R. Raveaux Sébastien Adam 23 0 0 02 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning Ruize Zhang Zelai Xu Chengdong Ma Chao Yu Weijuan Tu ... Deheng Ye Wenbo Ding Yaodong Yang Yu Wang Yu Wang SyDa SSL OnRL 51 8 0 02 Aug 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models Yuan Pu Yazhe Niu Jiyuan Ren Zhenjie Yang Hongsheng Li Yu Liu OffRL 49 1 0 15 Jun 2024
Policy-Based Self-Competition for Planning Problems Jonathan Pirnay Q. Göttl Jakob Burger D. G. Grimm 34 3 0 07 Jun 2023
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning Volodymyr Tkachuk Seyed Alireza Bakhtiari Johannes Kirschner Matej Jusup Ilija Bogunovic Csaba Szepesvári 26 4 0 08 Feb 2023
Investigating the role of model-based learning in exploration and transfer Jacob Walker Eszter Vértes Yazhe Li Gabriel Dulac-Arnold Ankesh Anand T. Weber Jessica B. Hamrick OffRL 36 7 0 08 Feb 2023
Continuous Neural Algorithmic Planners Yu He Petar Velivcković Pietro Lio' Andreea Deac 26 5 0 29 Nov 2022
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search Ayush Raina Jonathan Cagan Christopher McComb AI4CE 25 9 0 28 Nov 2022
Planning for Sample Efficient Imitation Learning Zhao-Heng Yin Weirui Ye Qifeng Chen Yang Gao OffRL 28 21 0 18 Oct 2022
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions Tian Tian K. Young R. Sutton 21 1 0 04 Jul 2022
Learning Large Neighborhood Search Policy for Integer Programming Yaoxin Wu Wen Song Zhiguang Cao Jie Zhang 21 40 0 01 Nov 2021
Mastering Atari Games with Limited Data Weirui Ye Shao-Wei Liu Thanard Kurutach Pieter Abbeel Yang Gao VLM 40 222 0 30 Oct 2021
Evaluating model-based planning and planner amortization for continuous control Arunkumar Byravan Leonard Hasenclever Piotr Trochim M. Berk Mirza Alessandro Davide Ialongo ... Jost Tobias Springenberg A. Abdolmaleki N. Heess J. Merel Martin Riedmiller 55 17 0 07 Oct 2021
Dream and Search to Control: Latent Space Planning for Continuous Control Anurag Koul Varun V. Kumar Alan Fern Somdeb Majumdar 19 6 0 19 Oct 2020