Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.11685
Cited By
An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits
21 June 2020
Julian Katz-Samuels
Lalit P. Jain
Zohar Karnin
Kevin Jamieson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits"
44 / 44 papers shown
Title
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
102
4
0
03 Nov 2024
AHA: Human-Assisted Out-of-Distribution Generalization and Detection
Haoyue Bai
Jifan Zhang
Robert Nowak
139
7
0
10 Oct 2024
Optimal Design for Human Feedback
Subhojyoti Mukherjee
Anusha Lalitha
Kousha Kalantari
Aniket Deshmukh
Ge Liu
Yifei Ma
Branislav Kveton
72
7
0
22 Apr 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
102
2
0
08 Feb 2024
Query-Efficient Correlation Clustering with Noisy Oracle
Yuko Kuroki
Atsushi Miyauchi
Francesco Bonchi
Wei Chen
63
2
0
02 Feb 2024
Improved Algorithm for Deep Active Learning under Imbalance via Optimal Separation
Shyam Nuggehalli
Jifan Zhang
Lalit P. Jain
Robert D. Nowak
109
9
0
14 Dec 2023
Fixed-Budget Best-Arm Identification in Sparse Linear Bandits
Recep Can Yavas
Vincent Y. F. Tan
49
2
0
01 Nov 2023
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
Robert D. Nowak
121
6
0
01 Nov 2023
Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Shintaro Nakamura
Masashi Sugiyama
51
1
0
24 Oct 2023
Efficient and Interpretable Bandit Algorithms
Subhojyoti Mukherjee
Ruihao Zhu
Branislav Kveton
FAtt
58
2
0
23 Oct 2023
Optimal Batched Best Arm Identification
Tianyuan Jin
Yu Yang
Jing Tang
Xiaokui Xiao
Pan Xu
116
3
0
21 Oct 2023
Optimal Exploration is no harder than Thompson Sampling
Zhaoqi Li
Kevin Jamieson
Lalit P. Jain
62
3
0
09 Oct 2023
Experimental Designs for Heteroskedastic Variance
Justin Weltz
Tanner Fiez
Alex Volfovsky
Eric B. Laber
Blake Mason
Houssam Nassif
Lalit P. Jain
76
5
0
06 Oct 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
58
5
0
15 Sep 2023
Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
Shintaro Nakamura
Masashi Sugiyama
45
5
0
20 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Zhihan Xiong
Romain Camilleri
Maryam Fazel
Lalit P. Jain
Kevin Jamieson
123
1
0
27 Jul 2023
No-Regret Linear Bandits beyond Realizability
Chong Liu
Ming Yin
Yu Wang
25
1
0
26 Feb 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits
Yihan Du
Longbo Huang
Wen Sun
99
4
0
09 Feb 2023
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
99
5
0
29 Jan 2023
Best Arm Identification in Stochastic Bandits: Beyond
β
−
β-
β
−
optimality
Arpan Mukherjee
A. Tajer
64
3
0
10 Jan 2023
Non-Asymptotic Analysis of a UCB-based Top Two Algorithm
Marc Jourdan
Rémy Degenne
120
9
0
11 Oct 2022
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
64
2
0
15 Sep 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
Arpan Mukherjee
A. Tajer
66
6
0
22 Jul 2022
Contextual Bandits with Large Action Spaces: Made Practical
Yinglun Zhu
Dylan J. Foster
John Langford
Paul Mineiro
87
30
0
12 Jul 2022
Active Learning with Safety Constraints
Romain Camilleri
Andrew Wagenmaker
Jamie Morgenstern
Lalit P. Jain
Kevin Jamieson
66
14
0
22 Jun 2022
Choosing Answers in
ε
\varepsilon
ε
-Best-Answer Identification for Linear Bandits
Marc Jourdan
Rémy Degenne
39
1
0
09 Jun 2022
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap
Masahiro Kato
Kaito Ariu
Masaaki Imaizumi
and Masahiro Nomura
Chao Qin
74
3
0
12 Jan 2022
Best Arm Identification under Additive Transfer Bandits
Ojash Neopane
Aaditya Ramdas
Aarti Singh
38
2
0
08 Dec 2021
Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers
Julian Katz-Samuels
Blake Mason
Kevin Jamieson
R. Nowak
45
0
0
09 Nov 2021
Nearly Optimal Algorithms for Level Set Estimation
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin Jamieson
Robert D. Nowak
Lalit P. Jain
77
23
0
02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
64
10
0
02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
Wei Chen
Yuko Kuroki
Longbo Huang
104
12
0
29 Oct 2021
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits
Yinglun Zhu
Julian Katz-Samuels
Robert D. Nowak
55
7
0
10 Sep 2021
Pure Exploration in Kernel and Neural Bandits
Yinglun Zhu
Dongruo Zhou
Ruoxi Jiang
Quanquan Gu
Rebecca Willett
Robert D. Nowak
67
16
0
22 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits
Javad Azizi
Branislav Kveton
Mohammad Ghavamzadeh
145
26
0
09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits
Junwen Yang
Vincent Y. F. Tan
67
26
0
27 May 2021
Improved Algorithms for Agnostic Pool-based Active Classification
Julian Katz-Samuels
Jifan Zhang
Lalit P. Jain
Kevin Jamieson
43
23
0
13 May 2021
High-Dimensional Experimental Design and Kernel Bandits
Romain Camilleri
Julian Katz-Samuels
Kevin Jamieson
80
57
0
12 May 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
72
0
0
12 Apr 2021
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Zihan Zhang
Jiaqi Yang
Xiangyang Ji
S. Du
108
41
0
29 Jan 2021
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Yuko Kuroki
Junya Honda
Masashi Sugiyama
OffRL
50
1
0
31 Dec 2020
Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits
Kwang-Sung Jun
Lalit P. Jain
Blake Mason
Houssam Nassif
77
20
0
23 Nov 2020
Experimental Design for Regret Minimization in Linear Bandits
Andrew Wagenmaker
Julian Katz-Samuels
Kevin Jamieson
114
16
0
01 Nov 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
393
10
0
15 Jun 2020
1