ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.05745
  4. Cited By
Adapting to Misspecification in Contextual Bandits

Adapting to Misspecification in Contextual Bandits

12 July 2021
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
ArXiv (abs)PDFHTML

Papers citing "Adapting to Misspecification in Contextual Bandits"

50 / 61 papers shown
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Aida Afshar
Aldo Pacchiano
62
0
0
01 Dec 2025
A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker Conditions
A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker ConditionsAnnual Conference Computational Learning Theory (COLT), 2025
Junfan Li
Shizhong Liao
Zenglin Xu
L. Nie
100
0
0
31 Oct 2025
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
Orin Levy
Liad Erez
Alon Cohen
Yishay Mansour
115
0
0
10 Oct 2025
Non-Linear Model-Based Sequential Decision-Making in Agriculture
Non-Linear Model-Based Sequential Decision-Making in Agriculture
Sakshi Arya
Wentao Lin
168
0
0
02 Sep 2025
Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?
Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Hwanwoo Kim
Chong Liu
Yuxin Chen
230
2
0
13 Jun 2025
Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
300
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
307
49
0
31 Dec 2024
Symmetric Linear Bandits with Hidden Symmetry
Symmetric Linear Bandits with Hidden Symmetry
Nam-Phuong Tran
T. Ta
Debmalya Mandal
Long Tran-Thanh
339
1
0
22 May 2024
Diffusion Models Meet Contextual Bandits
Diffusion Models Meet Contextual Bandits
Imad Aouali
DiffM
306
5
0
15 Feb 2024
Robust Causal Bandits for Linear Models
Robust Causal Bandits for Linear ModelsIEEE Journal on Selected Areas in Information Theory (JSAIT), 2023
Zirui Yan
Arpan Mukherjee
Burak Varici
A. Tajer
CML
274
4
0
30 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function
  Approximation
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
417
30
0
23 Oct 2023
Bayesian Design Principles for Frequentist Sequential Learning
Bayesian Design Principles for Frequentist Sequential LearningInternational Conference on Machine Learning (ICML), 2023
Yunbei Xu
A. Zeevi
540
17
0
01 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded
  Stochastic Corruption
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic CorruptionInternational Conference on Algorithmic Learning Theory (ALT), 2023
Shubhada Agrawal
Timothée Mathieu
D. Basu
Odalric-Ambrym Maillard
234
3
0
28 Sep 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual
  Bandits
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023
Haolin Liu
Chen-Yu Wei
Julian Zimmert
250
11
0
02 Sep 2023
On the Model-Misspecification in Reinforcement Learning
On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
291
6
0
19 Jun 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?
Does Sparsity Help in Learning Misspecified Linear Bandits?International Conference on Machine Learning (ICML), 2023
Jialin Dong
Lin F. Yang
253
2
0
29 Mar 2023
On the Interplay Between Misspecification and Sub-optimality Gap in
  Linear Contextual Bandits
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual BanditsInternational Conference on Machine Learning (ICML), 2023
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
238
6
0
16 Mar 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive
  Regret Bounds
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023
Shinji Ito
Kei Takemura
164
14
0
24 Feb 2023
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
A Blackbox Approach to Best of Both Worlds in Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2023
Christoph Dann
Chen-Yu Wei
Julian Zimmert
244
29
0
20 Feb 2023
Practical Contextual Bandits with Feedback Graphs
Practical Contextual Bandits with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023
Mengxiao Zhang
Yuheng Zhang
Olga Vrousgou
Haipeng Luo
Paul Mineiro
345
9
0
17 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
Infinite Action Contextual Bandits with Reusable Data ExhaustInternational Conference on Machine Learning (ICML), 2023
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
315
2
0
16 Feb 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge
  Bandits
Statistical Complexity and Optimal Algorithms for Non-linear Ridge BanditsAnnals of Statistics (Ann. Stat.), 2023
Nived Rajaraman
Yanjun Han
Jiantao Jiao
Kannan Ramchandran
489
4
0
12 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits
Leveraging User-Triggered Supervision in Contextual Bandits
Alekh Agarwal
Claudio Gentile
T. V. Marinov
181
0
0
07 Feb 2023
Learning to Generate All Feasible Actions
Learning to Generate All Feasible ActionsIEEE Access (IEEE Access), 2023
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
169
3
0
26 Jan 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear
  Contextual Bandits and Markov Decision Processes
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Chen Ye
Wei Xiong
Quanquan Gu
Tong Zhang
536
37
0
12 Dec 2022
Contextual Bandits with Packing and Covering Constraints: A Modular
  Lagrangian Approach via Regression
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via RegressionAnnual Conference Computational Learning Theory (COLT), 2022
Aleksandrs Slivkins
Xingyu Zhou
Karthik Abinav Sankararaman
Dylan J. Foster
314
28
0
14 Nov 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear
  Bandit Algorithms
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
344
17
0
08 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Lifelong Bandit Optimization: No Prior and No RegretConference on Uncertainty in Artificial Intelligence (UAI), 2022
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
328
3
0
27 Oct 2022
Robust Contextual Linear Bandits
Robust Contextual Linear Bandits
Rong Zhu
Branislav Kveton
215
3
0
26 Oct 2022
Deploying a Steered Query Optimizer in Production at Microsoft
Deploying a Steered Query Optimizer in Production at Microsoft
Wangda Zhang
Matteo Interlandi
Paul Mineiro
S. Qiao
Nasim Ghazanfari
Marc T. Friedman
Rafah Hosn
Hiren Patel
Alekh Jindal
142
27
0
24 Oct 2022
Conditionally Risk-Averse Contextual Bandits
Conditionally Risk-Averse Contextual Bandits
Mónika Farsang
Paul Mineiro
Wangda Zhang
264
2
0
24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via
  Regression Oracles
Optimal Contextual Bandits with Knapsacks under Realizability via Regression OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
302
13
0
21 Oct 2022
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous
  Action Spaces
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action SpacesInternational Conference on Machine Learning (ICML), 2022
Yinglun Zhu
Paul Mineiro
224
18
0
12 Jul 2022
Best of Both Worlds Model Selection
Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
220
11
0
29 Jun 2022
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial
  Corruptions
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial CorruptionsNeural Information Processing Systems (NeurIPS), 2022
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
265
53
0
13 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
238
1
0
30 Mar 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning FrameworkInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Runzhe Wan
Linjuan Ge
Rui Song
221
13
0
26 Feb 2022
Damped Online Newton Step for Portfolio Selection
Damped Online Newton Step for Portfolio SelectionAnnual Conference Computational Learning Theory (COLT), 2022
Zakaria Mhammedi
Alexander Rakhlin
130
16
0
15 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
228
20
0
12 Feb 2022
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of
  Portfolios and Quantum States
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum StatesAnnual Conference Computational Learning Theory (COLT), 2022
Julian Zimmert
Naman Agarwal
Satyen Kale
149
19
0
06 Feb 2022
Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates
  and Model Misspecification
Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification
Saptarshi Chakraborty
Debolina Paul
Swagatam Das
OOD
256
0
0
06 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under
  Realizability
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
284
41
0
24 Nov 2021
Misspecified Gaussian Process Bandit Optimization
Misspecified Gaussian Process Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021
Ilija Bogunovic
Andreas Krause
209
54
0
09 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m
  Identification
Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
195
10
0
02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
234
25
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
220
24
0
25 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement
  Learning
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
191
74
0
02 Oct 2021
Distribution-free Contextual Dynamic Pricing
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
388
43
0
15 Sep 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes
Improved Algorithms for Misspecified Linear Markov Decision Processes
Daniel Vial
Advait Parulekar
Sanjay Shakkottai
R. Srikant
201
7
0
12 Sep 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Metadata-based Multi-Task Bandits with Bayesian Hierarchical ModelsNeural Information Processing Systems (NeurIPS), 2021
Runzhe Wan
Linjuan Ge
Rui Song
218
31
0
13 Aug 2021
12
Next
Page 1 of 2