Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.06996
Cited By
Adaptive Exploration in Linear Contextual Bandit
15 October 2019
Botao Hao
Tor Lattimore
Csaba Szepesvári
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Exploration in Linear Contextual Bandit"
47 / 47 papers shown
Title
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
28
0
0
06 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
30
0
0
04 Apr 2025
Sparse Nonparametric Contextual Bandits
Hamish Flynn
Julia Olkhovskaya
Paul Rognon-Vael
53
0
0
20 Mar 2025
Information maximization for a broad variety of multi-armed bandit games
Alex Barbier-Chebbah
Christian L. Vestergaard
Jean-Baptiste Masson
61
0
0
20 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits
Elise Crépon
Aurélien Garivier
Wouter M. Koolen
47
1
0
29 Jan 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
36
0
0
03 Jan 2025
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
34
2
0
06 Jun 2024
On Bits and Bandits: Quantifying the Regret-Information Trade-off
Itai Shufaro
Nadav Merlis
Nir Weinberger
Shie Mannor
38
0
0
26 May 2024
A Functional Model Method for Nonconvex Nonsmooth Conditional Stochastic Optimization
Andrzej Ruszczyñski
Shangzhe Yang
31
0
0
17 May 2024
ε
ε
ε
-Policy Gradient for Online Pricing
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
OffRL
60
1
0
06 May 2024
Differentially Private High Dimensional Bandits
Apurv Shukla
26
0
0
06 Feb 2024
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach
Wen Huang
Xintao Wu
OffRL
CML
18
0
0
20 Dec 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
21
1
0
26 Jun 2023
Federated Linear Contextual Bandits with User-level Differential Privacy
Ruiquan Huang
Huanyu Zhang
Luca Melis
Milan Shen
Meisam Hajzinia
J. Yang
FedML
23
11
0
08 Jun 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
102
5
0
16 Mar 2023
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications
Johannes Kirschner
Tor Lattimore
Andreas Krause
36
8
0
07 Feb 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle
Hyunwook Kang
P. R. Kumar
OffRL
33
1
0
29 Jan 2023
On the Complexity of Representation Learning in Contextual Linear Bandits
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
32
1
0
19 Dec 2022
On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure
Alessio Russo
Alexandre Proutière
28
2
0
28 Nov 2022
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
28
4
0
24 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
27
9
0
23 Jul 2022
Instance-optimal PAC Algorithms for Contextual Bandits
Zhao Li
Lillian J. Ratliff
Houssam Nassif
Kevin G. Jamieson
Lalit P. Jain
12
17
0
05 Jul 2022
Asymptotic Instance-Optimal Algorithms for Interactive Decision Making
Kefan Dong
Tengyu Ma
18
9
0
06 Jun 2022
Residual Bootstrap Exploration for Stochastic Linear Bandit
Shuang Wu
ChiHua Wang
Yuantong Li
Guang Cheng
11
8
0
23 Feb 2022
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
49
0
0
23 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach
Filippo Vannella
Alexandre Proutière
Yassir Jedra
Jaeseong Jeong
23
7
0
06 Jan 2022
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
19
18
0
27 Oct 2021
Sequential Estimation under Multiple Resources: a Bandit Point of View
Alireza Masoumian
Shayan Kiyani
Mohammad Hossein Yassaee
31
1
0
29 Sep 2021
A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
13
16
0
24 Jun 2021
Fair Exploration via Axiomatic Bargaining
Jackie Baek
Vivek F. Farias
FaML
16
28
0
04 Jun 2021
Information Directed Sampling for Sparse Linear Bandits
Botao Hao
Tor Lattimore
Wei Deng
19
19
0
29 May 2021
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
30
26
0
08 Apr 2021
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
Chung-Wei Lee
Haipeng Luo
Chen-Yu Wei
Mengxiao Zhang
Xiaojin Zhang
15
46
0
11 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Kefan Dong
Jiaqi Yang
Tengyu Ma
24
32
0
08 Feb 2021
Asymptotically Optimal Information-Directed Sampling
Johannes Kirschner
Tor Lattimore
Claire Vernade
Csaba Szepesvári
11
33
0
11 Nov 2020
High-Dimensional Sparse Linear Bandits
Botao Hao
Tor Lattimore
Mengdi Wang
6
60
0
08 Nov 2020
Experimental Design for Regret Minimization in Linear Bandits
Andrew Wagenmaker
Julian Katz-Samuels
Kevin G. Jamieson
9
15
0
01 Nov 2020
Regret in Online Recommendation Systems
Kaito Ariu
Narae Ryu
Seyoung Yun
Alexandre Proutière
9
6
0
23 Oct 2020
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits
Andrea Tirinzoni
Matteo Pirotta
Marcello Restelli
A. Lazaric
14
34
0
23 Oct 2020
Thresholded Lasso Bandit
Kaito Ariu
Kenshi Abe
Alexandre Proutière
21
17
0
22 Oct 2020
Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective
Dylan J. Foster
Alexander Rakhlin
D. Simchi-Levi
Yunzong Xu
21
75
0
07 Oct 2020
Diversity-Preserving K-Armed Bandits, Revisited
Hédi Hadiji
Sébastien Gerchinovitz
Jean-Michel Loubes
Gilles Stoltz
26
2
0
05 Oct 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
15
10
0
15 Jun 2020
A Novel Confidence-Based Algorithm for Structured Bandits
Andrea Tirinzoni
A. Lazaric
Marcello Restelli
13
11
0
23 May 2020
Information Directed Sampling for Linear Partial Monitoring
Johannes Kirschner
Tor Lattimore
Andreas Krause
21
46
0
25 Feb 2020
The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms
Mohsen Bayati
N. Hamidi
Ramesh Johari
Khashayar Khosravi
37
28
0
24 Feb 2020
A General Theory of the Stochastic Linear Bandit and Its Applications
N. Hamidi
Mohsen Bayati
10
3
0
12 Feb 2020
1