Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2107.05745
Cited By
Adapting to Misspecification in Contextual Bandits
12 July 2021
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adapting to Misspecification in Contextual Bandits"
50 / 61 papers shown
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Aida Afshar
Aldo Pacchiano
62
0
0
01 Dec 2025
A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions
Annual Conference Computational Learning Theory (COLT), 2025
Junfan Li
Shizhong Liao
Zenglin Xu
L. Nie
100
0
0
31 Oct 2025
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
Orin Levy
Liad Erez
Alon Cohen
Yishay Mansour
115
0
0
10 Oct 2025
Non-Linear Model-Based Sequential Decision-Making in Agriculture
Sakshi Arya
Wentao Lin
168
0
0
02 Sep 2025
Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Hwanwoo Kim
Chong Liu
Yuxin Chen
230
2
0
13 Jun 2025
Offline-to-online hyperparameter transfer for stochastic bandits
AAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
300
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
International Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
307
49
0
31 Dec 2024
Symmetric Linear Bandits with Hidden Symmetry
Nam-Phuong Tran
T. Ta
Debmalya Mandal
Long Tran-Thanh
339
1
0
22 May 2024
Diffusion Models Meet Contextual Bandits
Imad Aouali
DiffM
306
5
0
15 Feb 2024
Robust Causal Bandits for Linear Models
IEEE Journal on Selected Areas in Information Theory (JSAIT), 2023
Zirui Yan
Arpan Mukherjee
Burak Varici
A. Tajer
CML
274
4
0
30 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
417
30
0
23 Oct 2023
Bayesian Design Principles for Frequentist Sequential Learning
International Conference on Machine Learning (ICML), 2023
Yunbei Xu
A. Zeevi
540
17
0
01 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
International Conference on Algorithmic Learning Theory (ALT), 2023
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
234
3
0
28 Sep 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Neural Information Processing Systems (NeurIPS), 2023
Haolin Liu
Chen-Yu Wei
Julian Zimmert
250
11
0
02 Sep 2023
On the Model-Misspecification in Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
291
6
0
19 Jun 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
International Conference on Machine Learning (ICML), 2023
Jialin Dong
Lin F. Yang
253
2
0
29 Mar 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
International Conference on Machine Learning (ICML), 2023
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
238
6
0
16 Mar 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds
Annual Conference Computational Learning Theory (COLT), 2023
Shinji Ito
Kei Takemura
164
14
0
24 Feb 2023
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Annual Conference Computational Learning Theory (COLT), 2023
Christoph Dann
Chen-Yu Wei
Julian Zimmert
244
29
0
20 Feb 2023
Practical Contextual Bandits with Feedback Graphs
Neural Information Processing Systems (NeurIPS), 2023
Mengxiao Zhang
Yuheng Zhang
Olga Vrousgou
Haipeng Luo
Paul Mineiro
345
9
0
17 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
International Conference on Machine Learning (ICML), 2023
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
315
2
0
16 Feb 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits
Annals of Statistics (Ann. Stat.), 2023
Nived Rajaraman
Yanjun Han
Jiantao Jiao
Kannan Ramchandran
489
4
0
12 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits
Alekh Agarwal
Claudio Gentile
T. V. Marinov
181
0
0
07 Feb 2023
Learning to Generate All Feasible Actions
IEEE Access (IEEE Access), 2023
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
169
3
0
26 Jan 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
International Conference on Machine Learning (ICML), 2022
Chen Ye
Wei Xiong
Quanquan Gu
Tong Zhang
536
37
0
12 Dec 2022
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
Annual Conference Computational Learning Theory (COLT), 2022
Aleksandrs Slivkins
Xingyu Zhou
Karthik Abinav Sankararaman
Dylan J. Foster
314
28
0
14 Nov 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Annual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
344
17
0
08 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
328
3
0
27 Oct 2022
Robust Contextual Linear Bandits
Rong Zhu
Branislav Kveton
215
3
0
26 Oct 2022
Deploying a Steered Query Optimizer in Production at Microsoft
Wangda Zhang
Matteo Interlandi
Paul Mineiro
S. Qiao
Nasim Ghazanfari
Marc T. Friedman
Rafah Hosn
Hiren Patel
Alekh Jindal
142
27
0
24 Oct 2022
Conditionally Risk-Averse Contextual Bandits
Mónika Farsang
Paul Mineiro
Wangda Zhang
264
2
0
24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
302
13
0
21 Oct 2022
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces
International Conference on Machine Learning (ICML), 2022
Yinglun Zhu
Paul Mineiro
224
18
0
12 Jul 2022
Best of Both Worlds Model Selection
Neural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
220
11
0
29 Jun 2022
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Neural Information Processing Systems (NeurIPS), 2022
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
265
53
0
13 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
238
1
0
30 Mar 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Runzhe Wan
Linjuan Ge
Rui Song
221
13
0
26 Feb 2022
Damped Online Newton Step for Portfolio Selection
Annual Conference Computational Learning Theory (COLT), 2022
Zakaria Mhammedi
Alexander Rakhlin
130
16
0
15 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Annual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
228
20
0
12 Feb 2022
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States
Annual Conference Computational Learning Theory (COLT), 2022
Julian Zimmert
Naman Agarwal
Satyen Kale
149
19
0
06 Feb 2022
Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification
Saptarshi Chakraborty
Debolina Paul
Swagatam Das
OOD
256
0
0
06 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
284
41
0
24 Nov 2021
Misspecified Gaussian Process Bandit Optimization
Neural Information Processing Systems (NeurIPS), 2021
Ilija Bogunovic
Andreas Krause
209
54
0
09 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Neural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
195
10
0
02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
234
25
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
220
24
0
25 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
191
74
0
02 Oct 2021
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
388
43
0
15 Sep 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes
Daniel Vial
Advait Parulekar
Sanjay Shakkottai
R. Srikant
201
7
0
12 Sep 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Neural Information Processing Systems (NeurIPS), 2021
Runzhe Wan
Linjuan Ge
Rui Song
218
31
0
13 Aug 2021
1
2
Next
Page 1 of 2