Optimal Best-arm Identification in Linear Bandits

29 June 2020

Papers citing "Optimal Best-arm Identification in Linear Bandits"

50 / 53 papers shown

Title
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis Ruiquan Huang Donghao Li Chengshuai Shi Cong Shen Jing Yang OffRL 104 0 0 01 Jul 2025
Experimental Design for Semiparametric Bandits Seok-Jin Kim Gi-Soo Kim Min-hwan Oh 21 0 0 16 Jun 2025
Sample Efficient Demonstration Selection for In-Context Learning Kiran Purohit Venktesh V Sourangshu Bhattacharya Avishek Anand 41 0 0 10 Jun 2025
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget Jie Bian Vincent Y. F. Tan 61 0 0 03 Jun 2025
Policy Testing in Markov Decision Processes Kaito Ariu Po-An Wang Alexandre Proutiere Kenshi Abe OffRL 54 0 0 21 May 2025
Cost-Aware Optimal Pairwise Pure Exploration Di Wu Chengshuai Shi Ruida Zhou Cong Shen 71 0 0 10 Mar 2025
Pure Exploration with Feedback Graphs Alessio Russo Yichen Song Aldo Pacchiano 72 2 0 10 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits Elise Crépon Aurélien Garivier Wouter M. Koolen 87 5 0 29 Jan 2025
Online Clustering with Bandit Information G Dhinesh Chandran Srinivas Reddy Kota Srikrishna Bhashyam 114 0 0 20 Jan 2025
Best-Arm Identification in Unimodal Bandits Riccardo Poiani Marc Jourdan E. Kaufmann Rémy Degenne 215 1 0 04 Nov 2024
Near Optimal Pure Exploration in Logistic Bandits Eduardo Ochoa Rivera Ambuj Tewari 101 0 0 28 Oct 2024
Optimal Batched Linear Bandits Xuanfei Ren Tianyuan Jin Pan Xu 56 2 0 06 Jun 2024
Efficient Prompt Optimization Through the Lens of Best Arm Identification Chengshuai Shi Kun Yang Zihan Chen Jundong Li Jing Yang Cong Shen 81 10 0 15 Feb 2024
Optimal Thresholding Linear Bandit Eduardo Ochoa Rivera Ambuj Tewari 62 0 0 11 Feb 2024
Experiment Planning with Function Approximation Aldo Pacchiano Jonathan Lee Emma Brunskill OffRL 70 4 0 10 Jan 2024
Data-driven optimal stopping: A pure exploration analysis Soren Christensen Niklas Dexheimer Claudia Strauch 66 2 0 10 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits Recep Can Yavas Vincent Y. F. Tan 57 2 0 01 Nov 2023
Towards Instance-Optimality in Online PAC Reinforcement Learning Aymen Al Marjani Andrea Tirinzoni Emilie Kaufmann OffRL 45 5 0 31 Oct 2023
Pure Exploration in Asynchronous Federated Bandits Zichen Wang Chuanhao Li Chenyu Song Lianghui Wang Quanquan Gu Huazheng Wang FedML 73 3 0 17 Oct 2023
Optimal Exploration is no harder than Thompson Sampling Zhaoqi Li Kevin Jamieson Lalit P. Jain 69 3 0 09 Oct 2023
Experimental Designs for Heteroskedastic Variance Justin Weltz Tanner Fiez Alex Volfovsky Eric B. Laber Blake Mason Houssam Nassif Lalit P. Jain 84 5 0 06 Oct 2023
Thompson Exploration with Best Challenger Rule in Best Arm Identification Jongyeong Lee Junya Honda Masashi Sugiyama 76 3 0 01 Oct 2023
Price of Safety in Linear Best Arm Identification Xuedong Shang Igor Colin M. Barlier Hamza Cherkaoui LLMSV 66 5 0 15 Sep 2023
Pure Exploration under Mediators' Feedback Riccardo Poiani Alberto Maria Metelli Marcello Restelli 50 1 0 29 Aug 2023
Certified Multi-Fidelity Zeroth-Order Optimization Étienne de Montbrun Sébastien Gerchinovitz 86 1 0 02 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity Zhihan Xiong Romain Camilleri Maryam Fazel Lalit P. Jain Kevin Jamieson 137 1 0 27 Jul 2023
Pure Exploration in Bandits with Linear Constraints Emil Carlsson Debabrota Basu Fredrik D. Johansson Devdatt Dubhashi 75 4 0 22 Jun 2023
Cooperative Thresholded Lasso for Sparse Linear Bandit Haniyeh Barghi Xiaotong Cheng S. Maghsudi 80 0 0 30 May 2023
Best Arm Identification in Bandits with Limited Precision Sampling Kota Srinivas Reddy P. Karthik Nikhil Karamchandani Jayakrishnan Nair 64 2 0 10 May 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill 54 0 0 19 Feb 2023
Active learning for data streams: a survey Davide Cacciarelli M. Kulahci 83 49 0 17 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits Yihan Du Longbo Huang Wen Sun 99 4 0 09 Feb 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$ optimality Arpan Mukherjee A. Tajer 64 3 0 10 Jan 2023
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning Susan Athey Undral Byambadalai Vitor Hadad Sanath Kumar Krishnamurthy Weiwen Leung Joseph Jay Williams 91 14 0 22 Nov 2022
Best Policy Identification in Linear MDPs Jerome Taupin Yassir Jedra Alexandre Proutiere 100 4 0 11 Aug 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits Arpan Mukherjee A. Tajer 66 6 0 22 Jul 2022
$Choosing Answers in $\varepsilon$-Best-Answer Identification for Linear Bandits$ Choosing Answers in $\varepsilon$ -Best-Answer Identification for Linear Bandits Marc Jourdan Rémy Degenne 47 1 0 09 Jun 2022
Information-Directed Selection for Top-Two Algorithms Wei You Chao Qin Zihao Wang Shuoguang Yang 103 14 0 24 May 2022
On Elimination Strategies for Bandit Fixed-Confidence Identification Andrea Tirinzoni Rémy Degenne 88 7 0 22 May 2022
$On the complexity of All $\varepsilon$-Best Arms Identification$ On the complexity of All $\varepsilon$ -Best Arms Identification Aymen Al Marjani Tomás Kocák Aurélien Garivier 100 4 0 13 Feb 2022
Optimal Clustering with Bandit Feedback Junwen Yang Zixin Zhong Vincent Y. F. Tan 65 12 0 09 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach Filippo Vannella Alexandre Proutiere Yassir Jedra Jaeseong Jeong 114 7 0 06 Jan 2022
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification Clémence Réda Andrea Tirinzoni Rémy Degenne 64 10 0 02 Nov 2021
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs Han Zhong Jiayi Huang Lin F. Yang Liwei Wang 56 9 0 26 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits Andrea Zanette Kefan Dong Jonathan Lee Emma Brunskill OffRL 75 18 0 21 Jul 2021
The Role of Contextual Information in Best Arm Identification Masahiro Kato Kaito Ariu 81 18 0 26 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits Javad Azizi Branislav Kveton Mohammad Ghavamzadeh 167 26 0 09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits Junwen Yang Vincent Y. F. Tan 67 26 0 27 May 2021
Pure Exploration with Structured Preference Feedback Shubham Gupta Aadirupa Saha S. Katariya 86 0 0 12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP Zihan Zhang Jiaqi Yang Xiangyang Ji S. Du 108 41 0 29 Jan 2021