Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2107.05745
Cited By
Adapting to Misspecification in Contextual Bandits
12 July 2021
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adapting to Misspecification in Contextual Bandits"
50 / 61 papers shown
Title
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Aida Afshar
Aldo Pacchiano
40
0
0
01 Dec 2025
A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions
Annual Conference Computational Learning Theory (COLT), 2025
Junfan Li
Shizhong Liao
Zenglin Xu
L. Nie
80
0
0
31 Oct 2025
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
Orin Levy
Liad Erez
Alon Cohen
Yishay Mansour
80
0
0
10 Oct 2025
Non-Linear Model-Based Sequential Decision-Making in Agriculture
Sakshi Arya
Wentao Lin
124
0
0
02 Sep 2025
Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Hwanwoo Kim
Chong Liu
Yuxin Chen
185
2
0
13 Jun 2025
Offline-to-online hyperparameter transfer for stochastic bandits
AAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
279
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
International Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
281
48
0
31 Dec 2024
Symmetric Linear Bandits with Hidden Symmetry
Nam-Phuong Tran
T. Ta
Debmalya Mandal
Long Tran-Thanh
306
1
0
22 May 2024
Diffusion Models Meet Contextual Bandits
Imad Aouali
DiffM
256
5
0
15 Feb 2024
Robust Causal Bandits for Linear Models
IEEE Journal on Selected Areas in Information Theory (JSAIT), 2023
Zirui Yan
Arpan Mukherjee
Burak Varici
A. Tajer
CML
227
4
0
30 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
380
29
0
23 Oct 2023
Bayesian Design Principles for Frequentist Sequential Learning
International Conference on Machine Learning (ICML), 2023
Yunbei Xu
A. Zeevi
474
16
0
01 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
International Conference on Algorithmic Learning Theory (ALT), 2023
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
185
3
0
28 Sep 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Neural Information Processing Systems (NeurIPS), 2023
Haolin Liu
Chen-Yu Wei
Julian Zimmert
236
11
0
02 Sep 2023
On the Model-Misspecification in Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
262
6
0
19 Jun 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
International Conference on Machine Learning (ICML), 2023
Jialin Dong
Lin F. Yang
213
2
0
29 Mar 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
International Conference on Machine Learning (ICML), 2023
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
217
6
0
16 Mar 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds
Annual Conference Computational Learning Theory (COLT), 2023
Shinji Ito
Kei Takemura
142
13
0
24 Feb 2023
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Annual Conference Computational Learning Theory (COLT), 2023
Christoph Dann
Chen-Yu Wei
Julian Zimmert
213
28
0
20 Feb 2023
Practical Contextual Bandits with Feedback Graphs
Neural Information Processing Systems (NeurIPS), 2023
Mengxiao Zhang
Yuheng Zhang
Olga Vrousgou
Haipeng Luo
Paul Mineiro
277
9
0
17 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
International Conference on Machine Learning (ICML), 2023
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
274
2
0
16 Feb 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits
Annals of Statistics (Ann. Stat.), 2023
Nived Rajaraman
Yanjun Han
Jiantao Jiao
Kannan Ramchandran
405
3
0
12 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits
Alekh Agarwal
Claudio Gentile
T. V. Marinov
157
0
0
07 Feb 2023
Learning to Generate All Feasible Actions
IEEE Access (IEEE Access), 2023
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
142
3
0
26 Jan 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
International Conference on Machine Learning (ICML), 2022
Chen Ye
Wei Xiong
Quanquan Gu
Tong Zhang
468
37
0
12 Dec 2022
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
Annual Conference Computational Learning Theory (COLT), 2022
Aleksandrs Slivkins
Xingyu Zhou
Karthik Abinav Sankararaman
Dylan J. Foster
263
28
0
14 Nov 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Annual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
268
17
0
08 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
282
3
0
27 Oct 2022
Robust Contextual Linear Bandits
Rong Zhu
Branislav Kveton
192
3
0
26 Oct 2022
Deploying a Steered Query Optimizer in Production at Microsoft
Wangda Zhang
Matteo Interlandi
Paul Mineiro
S. Qiao
Nasim Ghazanfari
Marc T. Friedman
Rafah Hosn
Hiren Patel
Alekh Jindal
128
27
0
24 Oct 2022
Conditionally Risk-Averse Contextual Bandits
Mónika Farsang
Paul Mineiro
Wangda Zhang
204
2
0
24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
277
13
0
21 Oct 2022
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces
International Conference on Machine Learning (ICML), 2022
Yinglun Zhu
Paul Mineiro
190
18
0
12 Jul 2022
Best of Both Worlds Model Selection
Neural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
192
11
0
29 Jun 2022
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Neural Information Processing Systems (NeurIPS), 2022
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
233
53
0
13 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
217
1
0
30 Mar 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Runzhe Wan
Linjuan Ge
Rui Song
207
13
0
26 Feb 2022
Damped Online Newton Step for Portfolio Selection
Annual Conference Computational Learning Theory (COLT), 2022
Zakaria Mhammedi
Alexander Rakhlin
110
16
0
15 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Annual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
198
20
0
12 Feb 2022
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States
Annual Conference Computational Learning Theory (COLT), 2022
Julian Zimmert
Naman Agarwal
Satyen Kale
132
19
0
06 Feb 2022
Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification
Saptarshi Chakraborty
Debolina Paul
Swagatam Das
OOD
216
0
0
06 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
259
41
0
24 Nov 2021
Misspecified Gaussian Process Bandit Optimization
Neural Information Processing Systems (NeurIPS), 2021
Ilija Bogunovic
Andreas Krause
181
53
0
09 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Neural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
165
10
0
02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
211
25
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
203
24
0
25 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
174
72
0
02 Oct 2021
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
351
41
0
15 Sep 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes
Daniel Vial
Advait Parulekar
Sanjay Shakkottai
R. Srikant
172
7
0
12 Sep 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Neural Information Processing Systems (NeurIPS), 2021
Runzhe Wan
Linjuan Ge
Rui Song
193
31
0
13 Aug 2021
1
2
Next