Monte Carlo Tree Search: A Review of Recent Modifications and Applications

8 March 2021

Papers citing "Monte Carlo Tree Search: A Review of Recent Modifications and Applications"

23 / 23 papers shown

Title
Adaptive Stress Testing Black-Box LLM Planners Neeloy Chakraborty John Pohovey Melkior Ornik Katherine Driggs-Campbell 28 0 0 08 May 2025
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification Xiaoyu Tan Tianchu Yao C. Qu Bin Li Minghao Yang ... Haozhe Wang Xihe Qiu Wei Chu Yinghui Xu Yuan Qi OffRL LRM 44 2 0 17 Feb 2025
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL Shuai Lyu Haoran Luo Zhonghong Ou Yifan Zhu Xiaoran Shang Yang Qin Meina Song AI4TS LRM 60 1 0 17 Feb 2025
Quantum Circuit Design using a Progressive Widening Enhanced Monte Carlo Tree Search Vincenzo Lipardi Domenica Dibenedetto Georgios Stamoulis Mark H.M. Winands 117 0 0 06 Feb 2025
Conversation Games and a Strategic View of the Turing Test Kaveh Aryan LRM 70 0 0 30 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Chunyu Xuan Yazhe Niu Yuan Pu Shuai Hu Yu Liu Jing Yang 59 0 0 03 Jan 2025
Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning Jason Piquenot Maxime Bérar Pierre Héroux Jean-Yves Ramel R. Raveaux Sébastien Adam 16 0 0 02 Oct 2024
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo Shengyu Feng Xiang Kong Shuang Ma Aonan Zhang Dong Yin Chong-Jun Wang Ruoming Pang Yiming Yang LRM 25 0 0 02 Oct 2024
QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE Junjie Zhao Chengxi Zhang Min Qin Peng Yang OOD 31 3 0 08 Sep 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models Yuan Pu Yazhe Niu Jiyuan Ren Zhenjie Yang Hongsheng Li Yu Liu OffRL 41 1 0 15 Jun 2024
Reinforcement learning-based architecture search for quantum machine learning Frederic Rapp D. Kreplin Marco F. Huber M. Roth AI4CE 27 5 0 04 Jun 2024
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs Michael Gimelfarb Ayal Taitler Scott Sanner 16 0 0 20 Jan 2024
Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review Chengmin Zhou Chao Wang Haseeb Hassan H. Shah Bingding Huang P. Fränti 3DV 25 3 0 16 Jul 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search Dinesh Parthasarathy G. Kontes Axel Plinge Christopher Mutschler 32 3 0 25 May 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications Marco Kemmerling Daniel Lutticke Robert H. Schmitt 16 14 0 14 Mar 2023
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search Ayush Raina Jonathan Cagan Christopher McComb AI4CE 18 9 0 28 Nov 2022
Machine Learning for K-adaptability in Two-stage Robust Optimization Esther Julien Krzysztof Postek cS. .Ilker Birbil 31 2 0 20 Oct 2022
Developing a Successful Bomberman Agent D. Kowalczyk Jakub Kowalski Hubert Obrzut Michael Maras Szymon Kosakowski Radoslaw Miernik 24 1 0 17 Mar 2022
A Fast Evolutionary adaptation for MCTS in Pommerman Harsh Panwar Saswata Chatterjee W. Dube 11 0 0 26 Nov 2021
Seriema: RDMA-based Remote Invocationwith a Case-Study on Monte-Carlo Tree Search Hammurabi Mendes Bryce Wiedenbeck Aidan OÑeill LRM 22 1 0 20 Sep 2021
Searching for More Efficient Dynamic Programs Tim Vieira Ryan Cotterell Jason Eisner 19 3 0 14 Sep 2021
Learning compositional programs with arguments and sampling Giovanni De Toni L. Erculiani Andrea Passerini 22 3 0 01 Sep 2021
Improving Hearthstone AI by Combining MCTS and Supervised Learning Algorithms M. Świechowski T. Tajmajer Andrzej Janusz BDL 52 59 0 14 Aug 2018