ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.10040
  4. Cited By
OSOM: A simultaneously optimal algorithm for multi-armed and linear
  contextual bandits
v1v2v3v4 (latest)

OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits

24 May 2019
Niladri S. Chatterji
Vidya Muthukumar
Peter L. Bartlett
ArXiv (abs)PDFHTML

Papers citing "OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual bandits"

17 / 17 papers shown
Title
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
Tanmay Goyal
Gaurav Sinha
20
0
0
16 Jun 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
80
2
0
03 Jan 2025
Oracle Inequalities for Model Selection in Offline Reinforcement
  Learning
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
83
13
0
03 Nov 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications
  for Inference
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
68
10
0
23 Jul 2022
Best of Both Worlds Model Selection
Best of Both Worlds Model Selection
Aldo Pacchiano
Christoph Dann
Claudio Gentile
89
10
0
29 Jun 2022
Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret
  in Stochastic Contextual Linear Bandits
Breaking the T\sqrt{T}T​ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits
Avishek Ghosh
Abishek Sankararaman
52
4
0
19 May 2022
On Dynamic Pricing with Covariates
On Dynamic Pricing with Covariates
Hanrui Wang
Kalyan Talluri
Xiaocheng Li
56
11
0
25 Dec 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m
  Identification
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
64
10
0
02 Nov 2021
Model Selection for Generic Contextual Bandits
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
Provably Efficient Representation Selection in Low-rank Markov Decision
  Processes: From Online to Offline RL
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
72
11
0
22 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
73
27
0
08 Apr 2021
Regret Bound Balancing and Elimination for Model Selection in Bandits
  and RL
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Aldo Pacchiano
Christoph Dann
Claudio Gentile
Peter L. Bartlett
100
49
0
24 Dec 2020
Corralling Stochastic Bandit Algorithms
Corralling Stochastic Bandit Algorithms
R. Arora
T. V. Marinov
M. Mohri
115
35
0
16 Jun 2020
Regret Balancing for Bandit and RL Model Selection
Regret Balancing for Bandit and RL Model Selection
Yasin Abbasi-Yadkori
Aldo Pacchiano
My Phan
87
26
0
09 Jun 2020
Problem-Complexity Adaptive Model Selection for Stochastic Linear
  Bandits
Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
80
34
0
04 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
203
94
0
03 Mar 2020
Model selection for contextual bandits
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
216
90
0
03 Jun 2019
1