ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.06303
  4. Cited By
Learning and Planning in Complex Action Spaces

Learning and Planning in Complex Action Spaces

13 April 2021
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
ArXivPDFHTML

Papers citing "Learning and Planning in Complex Action Spaces"

18 / 18 papers shown
Title
Trust-Region Twisted Policy Improvement
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
30
0
0
08 Apr 2025
OptionZero: Planning with Learned Options
OptionZero: Planning with Learned Options
Po-Wei Huang
Pei-Chiun Peng
Hung Guei
Ti-Rong Wu
55
0
0
23 Feb 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
65
0
0
03 Jan 2025
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based
  Reinforcement Learning
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
31
1
0
15 Oct 2024
Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning
Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning
Jason Piquenot
Maxime Bérar
Pierre Héroux
Jean-Yves Ramel
R. Raveaux
Sébastien Adam
23
0
0
02 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
49
1
0
15 Jun 2024
Policy-Based Self-Competition for Planning Problems
Policy-Based Self-Competition for Planning Problems
Jonathan Pirnay
Q. Göttl
Jakob Burger
D. G. Grimm
34
3
0
07 Jun 2023
Efficient Planning in Combinatorial Action Spaces with Applications to
  Cooperative Multi-Agent Reinforcement Learning
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Volodymyr Tkachuk
Seyed Alireza Bakhtiari
Johannes Kirschner
Matej Jusup
Ilija Bogunovic
Csaba Szepesvári
26
4
0
08 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Continuous Neural Algorithmic Planners
Continuous Neural Algorithmic Planners
Yu He
Petar Velivcković
Pietro Lio'
Andreea Deac
26
5
0
29 Nov 2022
Learning to design without prior data: Discovering generalizable design
  strategies using deep learning and tree search
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
25
9
0
28 Nov 2022
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
28
21
0
18 Oct 2022
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous
  in Actions
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
Tian Tian
K. Young
R. Sutton
21
1
0
04 Jul 2022
Learning Large Neighborhood Search Policy for Integer Programming
Learning Large Neighborhood Search Policy for Integer Programming
Yaoxin Wu
Wen Song
Zhiguang Cao
Jie Zhang
21
40
0
01 Nov 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
40
222
0
30 Oct 2021
Evaluating model-based planning and planner amortization for continuous
  control
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Dream and Search to Control: Latent Space Planning for Continuous
  Control
Dream and Search to Control: Latent Space Planning for Continuous Control
Anurag Koul
Varun V. Kumar
Alan Fern
Somdeb Majumdar
19
6
0
19 Oct 2020
1