Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.10040
Cited By
v1
v2
v3
v4 (latest)
OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits
24 May 2019
Niladri S. Chatterji
Vidya Muthukumar
Peter L. Bartlett
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits"
17 / 17 papers shown
Title
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
Tanmay Goyal
Gaurav Sinha
20
0
0
16 Jun 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
80
2
0
03 Jan 2025
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
83
13
0
03 Nov 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
68
10
0
23 Jul 2022
Best of Both Worlds Model Selection
Aldo Pacchiano
Christoph Dann
Claudio Gentile
89
10
0
29 Jun 2022
Breaking the
T
\sqrt{T}
T
Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits
Avishek Ghosh
Abishek Sankararaman
52
4
0
19 May 2022
On Dynamic Pricing with Covariates
Hanrui Wang
Kalyan Talluri
Xiaocheng Li
56
11
0
25 Dec 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
64
10
0
02 Nov 2021
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
72
11
0
22 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
73
27
0
08 Apr 2021
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Aldo Pacchiano
Christoph Dann
Claudio Gentile
Peter L. Bartlett
100
49
0
24 Dec 2020
Corralling Stochastic Bandit Algorithms
R. Arora
T. V. Marinov
M. Mohri
115
35
0
16 Jun 2020
Regret Balancing for Bandit and RL Model Selection
Yasin Abbasi-Yadkori
Aldo Pacchiano
My Phan
87
26
0
09 Jun 2020
Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
80
34
0
04 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
203
94
0
03 Mar 2020
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
216
90
0
03 Jun 2019
1