Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02986
Cited By
v1
v2 (latest)
Monte-Carlo Tree Search by Best Arm Identification
9 June 2017
E. Kaufmann
Wouter M. Koolen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Monte-Carlo Tree Search by Best Arm Identification"
19 / 19 papers shown
Title
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning
Conor F. Hayes
Mathieu Reymond
D. Roijers
Enda Howley
Patrick Mannion
54
4
0
23 Nov 2022
An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search
Gongbo Zhang
Yijie Peng
Yilong Xu
51
9
0
26 Apr 2022
Best Arm Identification under Additive Transfer Bandits
Ojash Neopane
Aaditya Ramdas
Aarti Singh
52
2
0
08 Dec 2021
Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari
B. Ayton
Masataro Asai
VLM
OffRL
97
1
0
30 Sep 2021
On Effective Parallelization of Monte Carlo Tree Search
Hoang Trung-Dung
Yitao Liang
Ji Liu
Guy Van den Broeck
Jianshu Chen
23
5
0
15 Jun 2020
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Anders Jonsson
E. Kaufmann
Pierre Ménard
O. D. Domingues
Edouard Leurent
Michal Valko
68
35
0
10 Jun 2020
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Weichao Mao
Kai Zhang
Qiaomin Xie
Tamer Basar
148
14
0
08 Jun 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
43
11
0
25 Feb 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
231
1,228
0
24 Nov 2019
Combining No-regret and Q-learning
Ian A. Kash
Michael Sullins
Katja Hofmann
OffRL
105
17
0
07 Oct 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
34
9
0
14 Feb 2019
Pure Exploration with Multiple Correct Answers
Rémy Degenne
Wouter M. Koolen
87
91
0
09 Feb 2019
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals
E. Kaufmann
Wouter M. Koolen
128
123
0
28 Nov 2018
Feature selection as Monte-Carlo Search in Growing Single Rooted Directed Acyclic Graph by Best Leaf Identification
A. Pelissier
Atsuyoshi Nakamura
Koji Tabata
15
6
0
19 Nov 2018
The Potential of the Return Distribution for Exploration in RL
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
74
9
0
11 Jun 2018
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
89
27
0
04 Jun 2018
Feedback-Based Tree Search for Reinforcement Learning
Daniel R. Jiang
E. Ekwedike
Han Liu
125
29
0
15 May 2018
AutoML from Service Provider's Perspective: Multi-device, Multi-tenant Model Selection with GP-EI
Chen Yu
Bojan Karlas
Jie Zhong
Ce Zhang
Ji Liu
30
6
0
17 Mar 2018
Structured Best Arm Identification with Fixed Confidence
Ruitong Huang
Mohammad M. Ajallooeian
Csaba Szepesvári
Martin Müller
100
25
0
16 Jun 2017
1