ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02986
  4. Cited By
Monte-Carlo Tree Search by Best Arm Identification
v1v2 (latest)

Monte-Carlo Tree Search by Best Arm Identification

9 June 2017
E. Kaufmann
Wouter M. Koolen
ArXiv (abs)PDFHTML

Papers citing "Monte-Carlo Tree Search by Best Arm Identification"

19 / 19 papers shown
Title
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective
  Reinforcement Learning
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes
Mathieu Reymond
D. Roijers
Enda Howley
Patrick Mannion
54
4
0
23 Nov 2022
An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search
An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search
Gongbo Zhang
Yijie Peng
Yilong Xu
51
9
0
26 Apr 2022
Best Arm Identification under Additive Transfer Bandits
Best Arm Identification under Additive Transfer Bandits
Ojash Neopane
Aaditya Ramdas
Aarti Singh
52
2
0
08 Dec 2021
Is Policy Learning Overrated?: Width-Based Planning and Active Learning
  for Atari
Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari
B. Ayton
Masataro Asai
VLMOffRL
97
1
0
30 Sep 2021
On Effective Parallelization of Monte Carlo Tree Search
On Effective Parallelization of Monte Carlo Tree Search
Hoang Trung-Dung
Yitao Liang
Ji Liu
Guy Van den Broeck
Jianshu Chen
23
5
0
15 Jun 2020
Planning in Markov Decision Processes with Gap-Dependent Sample
  Complexity
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Anders Jonsson
E. Kaufmann
Pierre Ménard
O. D. Domingues
Edouard Leurent
Michal Valko
68
35
0
10 Jun 2020
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with
  Non-Asymptotic Analysis
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Weichao Mao
Kai Zhang
Qiaomin Xie
Tamer Basar
148
14
0
08 Jun 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
43
11
0
25 Feb 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
231
1,228
0
24 Nov 2019
Combining No-regret and Q-learning
Combining No-regret and Q-learning
Ian A. Kash
Michael Sullins
Katja Hofmann
OffRL
105
17
0
07 Oct 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
34
9
0
14 Feb 2019
Pure Exploration with Multiple Correct Answers
Pure Exploration with Multiple Correct Answers
Rémy Degenne
Wouter M. Koolen
87
91
0
09 Feb 2019
Mixture Martingales Revisited with Applications to Sequential Tests and
  Confidence Intervals
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals
E. Kaufmann
Wouter M. Koolen
128
123
0
28 Nov 2018
Feature selection as Monte-Carlo Search in Growing Single Rooted
  Directed Acyclic Graph by Best Leaf Identification
Feature selection as Monte-Carlo Search in Growing Single Rooted Directed Acyclic Graph by Best Leaf Identification
A. Pelissier
Atsuyoshi Nakamura
Koji Tabata
15
6
0
19 Nov 2018
The Potential of the Return Distribution for Exploration in RL
The Potential of the Return Distribution for Exploration in RL
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
74
9
0
11 Jun 2018
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
89
27
0
04 Jun 2018
Feedback-Based Tree Search for Reinforcement Learning
Feedback-Based Tree Search for Reinforcement Learning
Daniel R. Jiang
E. Ekwedike
Han Liu
125
29
0
15 May 2018
AutoML from Service Provider's Perspective: Multi-device, Multi-tenant
  Model Selection with GP-EI
AutoML from Service Provider's Perspective: Multi-device, Multi-tenant Model Selection with GP-EI
Chen Yu
Bojan Karlas
Jie Zhong
Ce Zhang
Ji Liu
30
6
0
17 Mar 2018
Structured Best Arm Identification with Fixed Confidence
Structured Best Arm Identification with Fixed Confidence
Ruitong Huang
Mohammad M. Ajallooeian
Csaba Szepesvári
Martin Müller
100
25
0
16 Jun 2017
1