An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits

21 June 2020

Papers citing "An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits"

44 / 44 papers shown

Title
Sample-Efficient Alignment for LLMs Zichen Liu Changyu Chen Chao Du Wee Sun Lee Min Lin 102 4 0 03 Nov 2024
AHA: Human-Assisted Out-of-Distribution Generalization and Detection Haoyue Bai Jifan Zhang Robert Nowak 139 7 0 10 Oct 2024
Optimal Design for Human Feedback Subhojyoti Mukherjee Anusha Lalitha Kousha Kalantari Aniket Deshmukh Ge Liu Yifei Ma Branislav Kveton 72 7 0 22 Apr 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 102 2 0 08 Feb 2024
Query-Efficient Correlation Clustering with Noisy Oracle Yuko Kuroki Atsushi Miyauchi Francesco Bonchi Wei Chen 63 2 0 02 Feb 2024
Improved Algorithm for Deep Active Learning under Imbalance via Optimal Separation Shyam Nuggehalli Jifan Zhang Lalit P. Jain Robert D. Nowak 109 9 0 14 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits Recep Can Yavas Vincent Y. F. Tan 49 2 0 01 Nov 2023
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits Subhojyoti Mukherjee Qiaomin Xie Josiah P. Hanna Robert D. Nowak 121 6 0 01 Nov 2023
Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Shintaro Nakamura Masashi Sugiyama 51 1 0 24 Oct 2023
Efficient and Interpretable Bandit Algorithms Subhojyoti Mukherjee Ruihao Zhu Branislav Kveton FAtt 58 2 0 23 Oct 2023
Optimal Batched Best Arm Identification Tianyuan Jin Yu Yang Jing Tang Xiaokui Xiao Pan Xu 116 3 0 21 Oct 2023
Optimal Exploration is no harder than Thompson Sampling Zhaoqi Li Kevin Jamieson Lalit P. Jain 62 3 0 09 Oct 2023
Experimental Designs for Heteroskedastic Variance Justin Weltz Tanner Fiez Alex Volfovsky Eric B. Laber Blake Mason Houssam Nassif Lalit P. Jain 76 5 0 06 Oct 2023
Price of Safety in Linear Best Arm Identification Xuedong Shang Igor Colin M. Barlier Hamza Cherkaoui LLMSV 58 5 0 15 Sep 2023
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Shintaro Nakamura Masashi Sugiyama 45 5 0 20 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity Zhihan Xiong Romain Camilleri Maryam Fazel Lalit P. Jain Kevin Jamieson 123 1 0 27 Jul 2023
No-Regret Linear Bandits beyond Realizability Chong Liu Ming Yin Yu Wang 25 1 0 26 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits Yihan Du Longbo Huang Wen Sun 99 4 0 09 Feb 2023
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits Subhojyoti Mukherjee Qiaomin Xie Josiah P. Hanna R. Nowak OffRL 99 5 0 29 Jan 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$ optimality Arpan Mukherjee A. Tajer 64 3 0 10 Jan 2023
Non-Asymptotic Analysis of a UCB-based Top Two Algorithm Marc Jourdan Rémy Degenne 120 9 0 11 Oct 2022
Best Arm Identification with Contextual Information under a Small Gap Masahiro Kato Masaaki Imaizumi Takuya Ishihara T. Kitagawa 64 2 0 15 Sep 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits Arpan Mukherjee A. Tajer 66 6 0 22 Jul 2022
Contextual Bandits with Large Action Spaces: Made Practical Yinglun Zhu Dylan J. Foster John Langford Paul Mineiro 87 30 0 12 Jul 2022
Active Learning with Safety Constraints Romain Camilleri Andrew Wagenmaker Jamie Morgenstern Lalit P. Jain Kevin Jamieson 66 14 0 22 Jun 2022
$Choosing Answers in $\varepsilon$-Best-Answer Identification for Linear Bandits$ Choosing Answers in $\varepsilon$ -Best-Answer Identification for Linear Bandits Marc Jourdan Rémy Degenne 39 1 0 09 Jun 2022
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap Masahiro Kato Kaito Ariu Masaaki Imaizumi and Masahiro Nomura Chao Qin 74 3 0 12 Jan 2022
Best Arm Identification under Additive Transfer Bandits Ojash Neopane Aaditya Ramdas Aarti Singh 38 2 0 08 Dec 2021
Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers Julian Katz-Samuels Blake Mason Kevin Jamieson R. Nowak 45 0 0 09 Nov 2021
Nearly Optimal Algorithms for Level Set Estimation Blake Mason Romain Camilleri Subhojyoti Mukherjee Kevin Jamieson Robert D. Nowak Lalit P. Jain 77 23 0 02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification Clémence Réda Andrea Tirinzoni Rémy Degenne 64 10 0 02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit Yihan Du Wei Chen Yuko Kuroki Longbo Huang 104 12 0 29 Oct 2021
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits Yinglun Zhu Julian Katz-Samuels Robert D. Nowak 55 7 0 10 Sep 2021
Pure Exploration in Kernel and Neural Bandits Yinglun Zhu Dongruo Zhou Ruoxi Jiang Quanquan Gu Rebecca Willett Robert D. Nowak 67 16 0 22 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits Javad Azizi Branislav Kveton Mohammad Ghavamzadeh 145 26 0 09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits Junwen Yang Vincent Y. F. Tan 67 26 0 27 May 2021
Improved Algorithms for Agnostic Pool-based Active Classification Julian Katz-Samuels Jifan Zhang Lalit P. Jain Kevin Jamieson 43 23 0 13 May 2021
High-Dimensional Experimental Design and Kernel Bandits Romain Camilleri Julian Katz-Samuels Kevin Jamieson 80 57 0 12 May 2021
Pure Exploration with Structured Preference Feedback Shubham Gupta Aadirupa Saha S. Katariya 72 0 0 12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP Zihan Zhang Jiaqi Yang Xiangyang Ji S. Du 108 41 0 29 Jan 2021
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation Yuko Kuroki Junya Honda Masashi Sugiyama OffRL 50 1 0 31 Dec 2020
Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits Kwang-Sung Jun Lalit P. Jain Blake Mason Houssam Nassif 77 20 0 23 Nov 2020
Experimental Design for Regret Minimization in Linear Bandits Andrew Wagenmaker Julian Katz-Samuels Kevin Jamieson 114 16 0 01 Nov 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality Kwang-Sung Jun Chicheng Zhang 393 10 0 15 Jun 2020