Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.16073
Cited By
Optimal Best-arm Identification in Linear Bandits
29 June 2020
Yassir Jedra
Alexandre Proutiere
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Optimal Best-arm Identification in Linear Bandits"
50 / 53 papers shown
Title
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
104
0
0
01 Jul 2025
Experimental Design for Semiparametric Bandits
Seok-Jin Kim
Gi-Soo Kim
Min-hwan Oh
21
0
0
16 Jun 2025
Sample Efficient Demonstration Selection for In-Context Learning
Kiran Purohit
Venktesh V
Sourangshu Bhattacharya
Avishek Anand
41
0
0
10 Jun 2025
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
Jie Bian
Vincent Y. F. Tan
61
0
0
03 Jun 2025
Policy Testing in Markov Decision Processes
Kaito Ariu
Po-An Wang
Alexandre Proutiere
Kenshi Abe
OffRL
54
0
0
21 May 2025
Cost-Aware Optimal Pairwise Pure Exploration
Di Wu
Chengshuai Shi
Ruida Zhou
Cong Shen
71
0
0
10 Mar 2025
Pure Exploration with Feedback Graphs
Alessio Russo
Yichen Song
Aldo Pacchiano
72
2
0
10 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits
Elise Crépon
Aurélien Garivier
Wouter M. Koolen
87
5
0
29 Jan 2025
Online Clustering with Bandit Information
G Dhinesh Chandran
Srinivas Reddy Kota
Srikrishna Bhashyam
114
0
0
20 Jan 2025
Best-Arm Identification in Unimodal Bandits
Riccardo Poiani
Marc Jourdan
E. Kaufmann
Rémy Degenne
215
1
0
04 Nov 2024
Near Optimal Pure Exploration in Logistic Bandits
Eduardo Ochoa Rivera
Ambuj Tewari
101
0
0
28 Oct 2024
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
56
2
0
06 Jun 2024
Efficient Prompt Optimization Through the Lens of Best Arm Identification
Chengshuai Shi
Kun Yang
Zihan Chen
Jundong Li
Jing Yang
Cong Shen
81
10
0
15 Feb 2024
Optimal Thresholding Linear Bandit
Eduardo Ochoa Rivera
Ambuj Tewari
62
0
0
11 Feb 2024
Experiment Planning with Function Approximation
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
70
4
0
10 Jan 2024
Data-driven optimal stopping: A pure exploration analysis
Soren Christensen
Niklas Dexheimer
Claudia Strauch
66
2
0
10 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits
Recep Can Yavas
Vincent Y. F. Tan
57
2
0
01 Nov 2023
Towards Instance-Optimality in Online PAC Reinforcement Learning
Aymen Al Marjani
Andrea Tirinzoni
Emilie Kaufmann
OffRL
45
5
0
31 Oct 2023
Pure Exploration in Asynchronous Federated Bandits
Zichen Wang
Chuanhao Li
Chenyu Song
Lianghui Wang
Quanquan Gu
Huazheng Wang
FedML
73
3
0
17 Oct 2023
Optimal Exploration is no harder than Thompson Sampling
Zhaoqi Li
Kevin Jamieson
Lalit P. Jain
69
3
0
09 Oct 2023
Experimental Designs for Heteroskedastic Variance
Justin Weltz
Tanner Fiez
Alex Volfovsky
Eric B. Laber
Blake Mason
Houssam Nassif
Lalit P. Jain
84
5
0
06 Oct 2023
Thompson Exploration with Best Challenger Rule in Best Arm Identification
Jongyeong Lee
Junya Honda
Masashi Sugiyama
76
3
0
01 Oct 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
66
5
0
15 Sep 2023
Pure Exploration under Mediators' Feedback
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
50
1
0
29 Aug 2023
Certified Multi-Fidelity Zeroth-Order Optimization
Étienne de Montbrun
Sébastien Gerchinovitz
86
1
0
02 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Zhihan Xiong
Romain Camilleri
Maryam Fazel
Lalit P. Jain
Kevin Jamieson
137
1
0
27 Jul 2023
Pure Exploration in Bandits with Linear Constraints
Emil Carlsson
Debabrota Basu
Fredrik D. Johansson
Devdatt Dubhashi
75
4
0
22 Jun 2023
Cooperative Thresholded Lasso for Sparse Linear Bandit
Haniyeh Barghi
Xiaotong Cheng
S. Maghsudi
80
0
0
30 May 2023
Best Arm Identification in Bandits with Limited Precision Sampling
Kota Srinivas Reddy
P. Karthik
Nikhil Karamchandani
Jayakrishnan Nair
64
2
0
10 May 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
54
0
0
19 Feb 2023
Active learning for data streams: a survey
Davide Cacciarelli
M. Kulahci
83
49
0
17 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits
Yihan Du
Longbo Huang
Wen Sun
99
4
0
09 Feb 2023
Best Arm Identification in Stochastic Bandits: Beyond
β
−
β-
β
−
optimality
Arpan Mukherjee
A. Tajer
64
3
0
10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning
Susan Athey
Undral Byambadalai
Vitor Hadad
Sanath Kumar Krishnamurthy
Weiwen Leung
Joseph Jay Williams
91
14
0
22 Nov 2022
Best Policy Identification in Linear MDPs
Jerome Taupin
Yassir Jedra
Alexandre Proutiere
100
4
0
11 Aug 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
Arpan Mukherjee
A. Tajer
66
6
0
22 Jul 2022
Choosing Answers in
ε
\varepsilon
ε
-Best-Answer Identification for Linear Bandits
Marc Jourdan
Rémy Degenne
47
1
0
09 Jun 2022
Information-Directed Selection for Top-Two Algorithms
Wei You
Chao Qin
Zihao Wang
Shuoguang Yang
103
14
0
24 May 2022
On Elimination Strategies for Bandit Fixed-Confidence Identification
Andrea Tirinzoni
Rémy Degenne
88
7
0
22 May 2022
On the complexity of All
ε
\varepsilon
ε
-Best Arms Identification
Aymen Al Marjani
Tomás Kocák
Aurélien Garivier
100
4
0
13 Feb 2022
Optimal Clustering with Bandit Feedback
Junwen Yang
Zixin Zhong
Vincent Y. F. Tan
65
12
0
09 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach
Filippo Vannella
Alexandre Proutiere
Yassir Jedra
Jaeseong Jeong
114
7
0
06 Jan 2022
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
64
10
0
02 Nov 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Han Zhong
Jiayi Huang
Lin F. Yang
Liwei Wang
56
9
0
26 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
75
18
0
21 Jul 2021
The Role of Contextual Information in Best Arm Identification
Masahiro Kato
Kaito Ariu
81
18
0
26 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits
Javad Azizi
Branislav Kveton
Mohammad Ghavamzadeh
167
26
0
09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits
Junwen Yang
Vincent Y. F. Tan
67
26
0
27 May 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
86
0
0
12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Zihan Zhang
Jiaqi Yang
Xiangyang Ji
S. Du
108
41
0
29 Jan 2021
1
2
Next