Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.04931
Cited By
Monte Carlo Tree Search: A Review of Recent Modifications and Applications
8 March 2021
M. Świechowski
Konrad Godlewski
B. Sawicki
Jacek Mañdziuk
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Monte Carlo Tree Search: A Review of Recent Modifications and Applications"
23 / 23 papers shown
Title
Adaptive Stress Testing Black-Box LLM Planners
Neeloy Chakraborty
John Pohovey
Melkior Ornik
Katherine Driggs-Campbell
28
0
0
08 May 2025
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Xiaoyu Tan
Tianchu Yao
C. Qu
Bin Li
Minghao Yang
...
Haozhe Wang
Xihe Qiu
Wei Chu
Yinghui Xu
Yuan Qi
OffRL
LRM
44
2
0
17 Feb 2025
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
Shuai Lyu
Haoran Luo
Zhonghong Ou
Yifan Zhu
Xiaoran Shang
Yang Qin
Meina Song
AI4TS
LRM
60
1
0
17 Feb 2025
Quantum Circuit Design using a Progressive Widening Enhanced Monte Carlo Tree Search
Vincenzo Lipardi
Domenica Dibenedetto
Georgios Stamoulis
Mark H.M. Winands
117
0
0
06 Feb 2025
Conversation Games and a Strategic View of the Turing Test
Kaveh Aryan
LRM
70
0
0
30 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
59
0
0
03 Jan 2025
Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning
Jason Piquenot
Maxime Bérar
Pierre Héroux
Jean-Yves Ramel
R. Raveaux
Sébastien Adam
16
0
0
02 Oct 2024
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Shengyu Feng
Xiang Kong
Shuang Ma
Aonan Zhang
Dong Yin
Chong-Jun Wang
Ruoming Pang
Yiming Yang
LRM
25
0
0
02 Oct 2024
QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE
Junjie Zhao
Chengxi Zhang
Min Qin
Peng Yang
OOD
31
3
0
08 Sep 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
41
1
0
15 Jun 2024
Reinforcement learning-based architecture search for quantum machine learning
Frederic Rapp
D. Kreplin
Marco F. Huber
M. Roth
AI4CE
27
5
0
04 Jun 2024
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs
Michael Gimelfarb
Ayal Taitler
Scott Sanner
16
0
0
20 Jan 2024
Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review
Chengmin Zhou
Chao Wang
Haseeb Hassan
H. Shah
Bingding Huang
P. Fränti
3DV
25
3
0
16 Jul 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search
Dinesh Parthasarathy
G. Kontes
Axel Plinge
Christopher Mutschler
32
3
0
25 May 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications
Marco Kemmerling
Daniel Lutticke
Robert H. Schmitt
16
14
0
14 Mar 2023
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
18
9
0
28 Nov 2022
Machine Learning for K-adaptability in Two-stage Robust Optimization
Esther Julien
Krzysztof Postek
cS. .Ilker Birbil
31
2
0
20 Oct 2022
Developing a Successful Bomberman Agent
D. Kowalczyk
Jakub Kowalski
Hubert Obrzut
Michael Maras
Szymon Kosakowski
Radoslaw Miernik
24
1
0
17 Mar 2022
A Fast Evolutionary adaptation for MCTS in Pommerman
Harsh Panwar
Saswata Chatterjee
W. Dube
11
0
0
26 Nov 2021
Seriema: RDMA-based Remote Invocationwith a Case-Study on Monte-Carlo Tree Search
Hammurabi Mendes
Bryce Wiedenbeck
Aidan OÑeill
LRM
22
1
0
20 Sep 2021
Searching for More Efficient Dynamic Programs
Tim Vieira
Ryan Cotterell
Jason Eisner
19
3
0
14 Sep 2021
Learning compositional programs with arguments and sampling
Giovanni De Toni
L. Erculiani
Andrea Passerini
22
3
0
01 Sep 2021
Improving Hearthstone AI by Combining MCTS and Supervised Learning Algorithms
M. Świechowski
T. Tajmajer
Andrzej Janusz
BDL
52
59
0
14 Aug 2018
1