ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.09613
  4. Cited By
A0C: Alpha Zero in Continuous Action Space

A0C: Alpha Zero in Continuous Action Space

24 May 2018
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
ArXivPDFHTML

Papers citing "A0C: Alpha Zero in Continuous Action Space"

26 / 26 papers shown
Title
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search
  through State Occupancy Regularization
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
33
1
0
07 Jul 2024
ConstrainedZero: Chance-Constrained POMDP Planning using Learned
  Probabilistic Failure Surrogates and Adaptive Safety Constraints
ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints
Robert J. Moss
Arec Jamgochian
Johannes Fischer
Anthony Corso
Mykel J. Kochenderfer
45
4
0
01 May 2024
SPO: Sequential Monte Carlo Policy Optimisation
SPO: Sequential Monte Carlo Policy Optimisation
Matthew Macfarlane
Edan Toledo
Donal Byrne
Paul Duckworth
Alexandre Laterre
35
1
0
12 Feb 2024
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned
  Approximations
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations
Robert J. Moss
Anthony Corso
J. Caers
Mykel J. Kochenderfer
42
7
0
31 May 2023
Multi-Stage Monte Carlo Tree Search for Non-Monotone Object
  Rearrangement Planning in Narrow Confined Environments
Multi-Stage Monte Carlo Tree Search for Non-Monotone Object Rearrangement Planning in Narrow Confined Environments
Hanwen Ren
A. H. Qureshi
28
1
0
26 May 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search
  Applications
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications
Marco Kemmerling
Daniel Lutticke
Robert H. Schmitt
37
14
0
14 Mar 2023
Learning to design without prior data: Discovering generalizable design
  strategies using deep learning and tree search
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
53
10
0
28 Nov 2022
Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search
Kalle Kujanpää
Amin Babadi
Yi Zhao
Arno Solin
Alexander Ilin
Joni Pajarinen
LRM
246
2
0
04 Oct 2022
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel
  Test Environments
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments
John Tan Chong Min
Mehul Motani
37
1
0
13 Jul 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
65
101
0
19 Jun 2022
GrASP: Gradient-Based Affordance Selection for Planning
GrASP: Gradient-Based Affordance Selection for Planning
Vivek Veeriah
Zeyu Zheng
Richard L. Lewis
Satinder Singh
33
4
0
08 Feb 2022
Lyapunov Exponents for Diversity in Differentiable Games
Lyapunov Exponents for Diversity in Differentiable Games
Jonathan Lorraine
Paul Vicol
Jack Parker-Holder
Tal Kachman
Luke Metz
Jakob N. Foerster
35
7
0
24 Dec 2021
AlphaDDA: Strategies for Adjusting the Playing Strength of a Fully
  Trained AlphaZero System to a Suitable Human Training Partner
AlphaDDA: Strategies for Adjusting the Playing Strength of a Fully Trained AlphaZero System to a Suitable Human Training Partner
Kazuhisa Fujita
11
3
0
11 Nov 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
27
37
0
17 Jul 2021
Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative
  Environments
Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments
Benjamin Rivière
Wolfgang Hoenig
Matthew O. Anderson
Soon-Jo Chung
50
12
0
20 Apr 2021
Learning and Planning in Complex Action Spaces
Learning and Planning in Complex Action Spaces
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
35
78
0
13 Apr 2021
Combining Off and On-Policy Training in Model-Based Reinforcement
  Learning
Combining Off and On-Policy Training in Model-Based Reinforcement Learning
Alexandre Borges
Arlindo L. Oliveira
25
2
0
24 Feb 2021
Dream and Search to Control: Latent Space Planning for Continuous
  Control
Dream and Search to Control: Latent Space Planning for Continuous Control
Anurag Koul
Varun V. Kumar
Alan Fern
Somdeb Majumdar
30
6
0
19 Oct 2020
Local Search for Policy Iteration in Continuous Control
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
22
15
0
12 Oct 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
21
17
0
11 Aug 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
36
47
0
30 Jun 2020
Tackling Morpion Solitaire with AlphaZero-likeRanked Reward
  Reinforcement Learning
Tackling Morpion Solitaire with AlphaZero-likeRanked Reward Reinforcement Learning
Hui Wang
Mike Preuss
M. Emmerich
Aske Plaat
LRM
17
9
0
14 Jun 2020
Continuous Control for Searching and Planning with a Learned Model
Continuous Control for Searching and Planning with a Learned Model
Xuxi Yang
Werner Duvaud
Peng Wei
33
5
0
12 Jun 2020
Parallelization of Monte Carlo Tree Search in Continuous Domains
Parallelization of Monte Carlo Tree Search in Continuous Domains
Karl Kurzer
Christoph Hörtnagl
Johann Marius Zöllner
LRM
9
4
0
30 Mar 2020
Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement
  Planning
Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning
Yann Labbé
Sergey Zagoruyko
Igor Kalevatykh
Ivan Laptev
Justin Carpentier
Mathieu Aubry
Josef Sivic
OCL
16
69
0
23 Apr 2019
Ranked Reward: Enabling Self-Play Reinforcement Learning for
  Combinatorial Optimization
Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization
Alexandre Laterre
Yunguan Fu
Mohamed Khalil Jabri
A. Cohen
David Kas
Karl Hajjar
T. Dahl
Amine Kerkeni
Karim Beguir
6
77
0
04 Jul 2018
1