Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2003.01704
Cited By
v1
v2
v3 (latest)
Model Selection in Contextual Stochastic Bandit Problems
Neural Information Processing Systems (NeurIPS), 2020
3 March 2020
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model Selection in Contextual Stochastic Bandit Problems"
50 / 92 papers shown
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Aida Afshar
Aldo Pacchiano
88
0
0
01 Dec 2025
UCB-type Algorithm for Budget-Constrained Expert Learning
Ilgam Latypov
A. Suvorikova
Alexey Kroshnin
Alexander Gasnikov
Yuriy Dorn
149
0
0
26 Oct 2025
Offline-to-online hyperparameter transfer for stochastic bandits
AAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
371
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
International Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
402
51
0
31 Dec 2024
State-free Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Mingyu Chen
Aldo Pacchiano
Xuezhou Zhang
368
0
0
27 Sep 2024
Stochastic Bandits Robust to Adversarial Attacks
Xuchuang Wang
Jinhang Zuo
Xutong Liu
John C. S. Lui
Mohammad Hajiesmaili
AAML
183
2
0
16 Aug 2024
Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives
Aida Afshar
Aldo Pacchiano
234
0
0
07 Aug 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu
Idan Attias
Daniel M. Roy
CML
255
2
0
01 Jul 2024
Efficient Sequential Decision Making with Large Language Models
Dingyang Chen
Qi Zhang
Yinglun Zhu
LRM
464
12
0
17 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
330
1
0
06 Jun 2024
Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Tianyuan Jin
Kyoungseok Jang
Nicolò Cesa-Bianchi
273
1
0
03 Jun 2024
Online Bandits with (Biased) Offline Data: Adaptive Learning under Distribution Mismatch
International Conference on Machine Learning (ICML), 2024
Wang Chi Cheung
Lixing Lyu
OffRL
477
12
0
04 May 2024
Neural Active Learning Beyond Bandits
Yikun Ban
Ishika Agarwal
Ziwei Wu
Yada Zhu
Kommy Weldemariam
Hanghang Tong
Jingrui He
303
14
0
18 Apr 2024
The SMART approach to instance-optimal online learning
Siddhartha Banerjee
Alankrita Bhatt
Chao Yu
238
0
0
27 Feb 2024
Understanding Model Selection For Learning In Strategic Environments
Neural Information Processing Systems (NeurIPS), 2024
Tinashe Handina
Eric Mazumdar
272
2
0
12 Feb 2024
Budgeted Online Model Selection and Fine-Tuning via Federated Learning
P. M. Ghari
Yanning Shen
FedML
373
2
0
19 Jan 2024
Experiment Planning with Function Approximation
Neural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
239
6
0
10 Jan 2024
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits
Yuko Kuroki
Alberto Rumi
Taira Tsuchiya
Fabio Vitale
Nicolò Cesa-Bianchi
338
13
0
24 Dec 2023
Online Clustering of Bandits with Misspecified User Models
Neural Information Processing Systems (NeurIPS), 2023
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
394
15
0
04 Oct 2023
Anytime Model Selection in Linear Bandits
Neural Information Processing Systems (NeurIPS), 2023
Parnian Kassraie
N. Emmenegger
Andreas Krause
Aldo Pacchiano
407
7
0
24 Jul 2023
Active Policy Improvement from Multiple Black-box Oracles
International Conference on Machine Learning (ICML), 2023
Xuefeng Liu
Takuma Yoneda
Simon Mahns
Matthew R. Walter
Yuxin Chen
450
13
0
17 Jun 2023
Data-Driven Online Model Selection With Regret Guarantees
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Aldo Pacchiano
Christoph Dann
Claudio Gentile
OffRL
453
11
0
05 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions
Neural Information Processing Systems (NeurIPS), 2023
Yue Kang
Cho-Jui Hsieh
T. C. Lee
AAML
283
15
0
29 May 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yusha Liu
Aarti Singh
356
3
0
26 Apr 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds
Annual Conference Computational Learning Theory (COLT), 2023
Shinji Ito
Kei Takemura
192
15
0
24 Feb 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
283
0
0
19 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
254
3
0
16 Feb 2023
Learning Complementary Policies for Human-AI Teams
Ruijiang Gao
M. Saar-Tsechansky
Maria De-Arteaga
364
11
0
06 Feb 2023
On the Complexity of Representation Learning in Contextual Linear Bandits
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
258
1
0
19 Dec 2022
Stochastic Rising Bandits
International Conference on Machine Learning (ICML), 2022
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
200
19
0
07 Dec 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Annual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
380
20
0
08 Nov 2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
391
14
0
03 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
397
3
0
27 Oct 2022
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Neural Information Processing Systems (NeurIPS), 2022
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
306
6
0
24 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Neural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
261
100
0
18 Oct 2022
Unsupervised Model Selection for Time-series Anomaly Detection
International Conference on Learning Representations (ICLR), 2022
Mononito Goswami
Cristian Challu
Laurent Callot
Lenon Minorics
Andrey Kan
OOD
499
40
0
03 Oct 2022
Neural Design for Genetic Perturbation Experiments
International Conference on Learning Representations (ICLR), 2022
Aldo Pacchiano
Drausin Wulsin
Robert A. Barton
L. Voloch
327
7
0
26 Jul 2022
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
193
3
0
06 Jul 2022
Best of Both Worlds Model Selection
Neural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
250
11
0
29 Jun 2022
Adversarial Bandits against Arbitrary Strategies
Jung-hun Kim
Se-Young Yun
454
1
0
30 May 2022
Leveraging Initial Hints for Free in Stochastic Linear Bandits
International Conference on Algorithmic Learning Theory (ALT), 2022
Ashok Cutkosky
Christoph Dann
Abhimanyu Das
Qiuyi
Qiuyi Zhang
184
6
0
08 Mar 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Annual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
261
22
0
12 Feb 2022
Model Selection in Batch Policy Optimization
International Conference on Machine Learning (ICML), 2021
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
OffRL
252
12
0
23 Dec 2021
Neural Pseudo-Label Optimism for the Bank Loan Problem
Neural Information Processing Systems (NeurIPS), 2021
Aldo Pacchiano
Shaun Singh
Edward Chou
Alexander C. Berg
Jakob N. Foerster
180
8
0
03 Dec 2021
Misspecified Gaussian Process Bandit Optimization
Neural Information Processing Systems (NeurIPS), 2021
Ilija Bogunovic
Andreas Krause
266
57
0
09 Nov 2021
Universal and data-adaptive algorithms for model selection in linear contextual bandits
Vidya Muthukumar
A. Krishnamurthy
316
5
0
08 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Neural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
241
11
0
02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
258
28
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
257
25
0
25 Oct 2021
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
471
44
0
15 Sep 2021
1
2
Next
Page 1 of 2