Adapting to Misspecification in Contextual Bandits

12 July 2021

Papers citing "Adapting to Misspecification in Contextual Bandits"

50 / 61 papers shown

Improved Training Mechanism for Reinforcement Learning via Online Model Selection

Aida Afshar

Aldo Pacchiano

01 Dec 2025

A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker ConditionsAnnual Conference Computational Learning Theory (COLT), 2025

100

31 Oct 2025

Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback

115

10 Oct 2025

Non-Linear Model-Based Sequential Decision-Making in Agriculture

Sakshi Arya

Wentao Lin

168

02 Sep 2025

Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?Conference on Uncertainty in Artificial Intelligence (UAI), 2025

Hwanwoo Kim

Chong Liu

Yuxin Chen

230

13 Jun 2025

Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025

Dravyansh Sharma

Arun Sai Suggala

OffRL

300

06 Jan 2025

A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021

Chen-Yu Wei

Christoph Dann

Julian Zimmert

307

31 Dec 2024

Symmetric Linear Bandits with Hidden Symmetry

339

22 May 2024

Diffusion Models Meet Contextual Bandits

Imad Aouali

DiffM

306

15 Feb 2024

Robust Causal Bandits for Linear ModelsIEEE Journal on Selected Areas in Information Theory (JSAIT), 2023

274

30 Oct 2023

Corruption-Robust Offline Reinforcement Learning with General Function ApproximationNeural Information Processing Systems (NeurIPS), 2023

Chen Ye

Rui Yang

Quanquan Gu

Tong Zhang

OffRL

417

23 Oct 2023

Bayesian Design Principles for Frequentist Sequential LearningInternational Conference on Machine Learning (ICML), 2023

Yunbei Xu

A. Zeevi

540

01 Oct 2023

CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic CorruptionInternational Conference on Algorithmic Learning Theory (ALT), 2023

Shubhada Agrawal

Timothée Mathieu

D. Basu

Odalric-Ambrym Maillard

234

28 Sep 2023

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023

Haolin Liu

Chen-Yu Wei

Julian Zimmert

250

02 Sep 2023

On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Yunfan Li

Lin F. Yang

291

19 Jun 2023

Does Sparsity Help in Learning Misspecified Linear Bandits?International Conference on Machine Learning (ICML), 2023

Jialin Dong

Lin F. Yang

253

29 Mar 2023

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual BanditsInternational Conference on Machine Learning (ICML), 2023

238

16 Mar 2023

Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023

Shinji Ito

Kei Takemura

164

24 Feb 2023

A Blackbox Approach to Best of Both Worlds in Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2023

Christoph Dann

Chen-Yu Wei

Julian Zimmert

244

20 Feb 2023

Practical Contextual Bandits with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023

345

17 Feb 2023

Infinite Action Contextual Bandits with Reusable Data ExhaustInternational Conference on Machine Learning (ICML), 2023

315

16 Feb 2023

Statistical Complexity and Optimal Algorithms for Non-linear Ridge BanditsAnnals of Statistics (Ann. Stat.), 2023

489

12 Feb 2023

Leveraging User-Triggered Supervision in Contextual Bandits

Alekh Agarwal

Claudio Gentile

T. V. Marinov

181

07 Feb 2023

Learning to Generate All Feasible ActionsIEEE Access (IEEE Access), 2023

Alberto L. Sangiovanni-Vincentelli

169

26 Jan 2023

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022

Chen Ye

Wei Xiong

Quanquan Gu

Tong Zhang

536

12 Dec 2022

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via RegressionAnnual Conference Computational Learning Theory (COLT), 2022

Aleksandrs Slivkins

Xingyu Zhou

Karthik Abinav Sankararaman

Dylan J. Foster

314

14 Nov 2022

Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022

Osama A. Hanna

Lin F. Yang

Christina Fragouli

344

08 Nov 2022

Lifelong Bandit Optimization: No Prior and No RegretConference on Uncertainty in Artificial Intelligence (UAI), 2022

328

27 Oct 2022

Robust Contextual Linear Bandits

Rong Zhu

Branislav Kveton

215

26 Oct 2022

Deploying a Steered Query Optimizer in Production at Microsoft

142

24 Oct 2022

Conditionally Risk-Averse Contextual Bandits

Mónika Farsang

Paul Mineiro

Wangda Zhang

264

24 Oct 2022

Optimal Contextual Bandits with Knapsacks under Realizability via Regression OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

302

21 Oct 2022

Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action SpacesInternational Conference on Machine Learning (ICML), 2022

Yinglun Zhu

Paul Mineiro

224

12 Jul 2022

Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022

Aldo Pacchiano

Christoph Dann

Claudio Gentile

220

29 Jun 2022

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial CorruptionsNeural Information Processing Systems (NeurIPS), 2022

Jiafan He

Dongruo Zhou

Tong Zhang

Quanquan Gu

265

13 May 2022

Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Aldo G. Carranza

Sanath Kumar Krishnamurthy

Susan Athey

238

30 Mar 2022

Towards Scalable and Robust Structured Bandits: A Meta-Learning FrameworkInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Runzhe Wan

Linjuan Ge

Rui Song

221

26 Feb 2022

Damped Online Newton Step for Portfolio SelectionAnnual Conference Computational Learning Theory (COLT), 2022

Zakaria Mhammedi

Alexander Rakhlin

130

15 Feb 2022

Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022

228

12 Feb 2022

Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum StatesAnnual Conference Computational Learning Theory (COLT), 2022

Julian Zimmert

Naman Agarwal

Satyen Kale

149

06 Feb 2022

Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification

Saptarshi Chakraborty

Debolina Paul

Swagatam Das

OOD

256

06 Jan 2022

Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability

Aadirupa Saha

A. Krishnamurthy

284

24 Nov 2021

Misspecified Gaussian Process Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021

Ilija Bogunovic

Andreas Krause

209

09 Nov 2021

Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021

Clémence Réda

Andrea Tirinzoni

Rémy Degenne

195

02 Nov 2021

The Pareto Frontier of model selection for general Contextual Bandits

T. V. Marinov

Julian Zimmert

234

25 Oct 2021

Linear Contextual Bandits with Adversarial Corruptions

Heyang Zhao

Dongruo Zhou

Quanquan Gu

AAML

220

25 Oct 2021

Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning

Tong Zhang

191

02 Oct 2021

Distribution-free Contextual Dynamic Pricing

Yiyun Luo

W. Sun

Yufeng Liu

388

15 Sep 2021

Improved Algorithms for Misspecified Linear Markov Decision Processes

201

12 Sep 2021

Metadata-based Multi-Task Bandits with Bayesian Hierarchical ModelsNeural Information Processing Systems (NeurIPS), 2021

Runzhe Wan

Linjuan Ge

Rui Song

218

13 Aug 2021