Papers citing 'Adapting to Misspecification in Contextual Bandits'

Title
Improved Training Mechanism for Reinforcement Learning via Online Model Selection Aida Afshar Aldo Pacchiano 40 0 0 01 Dec 2025
A Polynomial-time Algorithm for Online Sparse Linear Regression with Improved Regret Bound under Weaker ConditionsAnnual Conference Computational Learning Theory (COLT), 2025 Junfan Li Shizhong Liao Zenglin Xu L. Nie 80 0 0 31 Oct 2025
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback Orin Levy Liad Erez Alon Cohen Yishay Mansour 80 0 0 10 Oct 2025
Non-Linear Model-Based Sequential Decision-Making in Agriculture Sakshi Arya Wentao Lin 124 0 0 02 Sep 2025
Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?Conference on Uncertainty in Artificial Intelligence (UAI), 2025 Hwanwoo Kim Chong Liu Yuxin Chen 185 2 0 13 Jun 2025
Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025 Dravyansh Sharma Arun Sai Suggala OffRL 279 8 0 06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021 Chen-Yu Wei Christoph Dann Julian Zimmert 281 48 0 31 Dec 2024
Symmetric Linear Bandits with Hidden Symmetry Nam-Phuong Tran T. Ta Debmalya Mandal Long Tran-Thanh 306 1 0 22 May 2024
Diffusion Models Meet Contextual Bandits Imad Aouali DiffM 256 5 0 15 Feb 2024
Robust Causal Bandits for Linear ModelsIEEE Journal on Selected Areas in Information Theory (JSAIT), 2023 Zirui Yan Arpan Mukherjee Burak Varici A. Tajer CML 227 4 0 30 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationNeural Information Processing Systems (NeurIPS), 2023 Chen Ye Rui Yang Quanquan Gu Tong Zhang OffRL 380 29 0 23 Oct 2023
Bayesian Design Principles for Frequentist Sequential LearningInternational Conference on Machine Learning (ICML), 2023 Yunbei Xu A. Zeevi 474 16 0 01 Oct 2023
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic CorruptionInternational Conference on Algorithmic Learning Theory (ALT), 2023 Shubhada Agrawal Timothée Mathieu D. Basu Odalric-Ambrym Maillard 185 3 0 28 Sep 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023 Haolin Liu Chen-Yu Wei Julian Zimmert 236 11 0 02 Sep 2023
On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Yunfan Li Lin F. Yang 262 6 0 19 Jun 2023
Does Sparsity Help in Learning Misspecified Linear Bandits?International Conference on Machine Learning (ICML), 2023 Jialin Dong Lin F. Yang 213 2 0 29 Mar 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual BanditsInternational Conference on Machine Learning (ICML), 2023 Weitong Zhang Jiafan He Zhiyuan Fan Q. Gu 217 6 0 16 Mar 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023 Shinji Ito Kei Takemura 142 13 0 24 Feb 2023
A Blackbox Approach to Best of Both Worlds in Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2023 Christoph Dann Chen-Yu Wei Julian Zimmert 213 28 0 20 Feb 2023
Practical Contextual Bandits with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023 Mengxiao Zhang Yuheng Zhang Olga Vrousgou Haipeng Luo Paul Mineiro 277 9 0 17 Feb 2023
Infinite Action Contextual Bandits with Reusable Data ExhaustInternational Conference on Machine Learning (ICML), 2023 Mark Rucker Yinglun Zhu Paul Mineiro OffRL 274 2 0 16 Feb 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge BanditsAnnals of Statistics (Ann. Stat.), 2023 Nived Rajaraman Yanjun Han Jiantao Jiao Kannan Ramchandran 405 3 0 12 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits Alekh Agarwal Claudio Gentile T. V. Marinov 157 0 0 07 Feb 2023
Learning to Generate All Feasible ActionsIEEE Access (IEEE Access), 2023 Mirco Theile Daniele Bernardini Raphael Trumpp C. Piazza Marco Caccamo Alberto L. Sangiovanni-Vincentelli 142 3 0 26 Jan 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022 Chen Ye Wei Xiong Quanquan Gu Tong Zhang 468 37 0 12 Dec 2022
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via RegressionAnnual Conference Computational Learning Theory (COLT), 2022 Aleksandrs Slivkins Xingyu Zhou Karthik Abinav Sankararaman Dylan J. Foster 263 28 0 14 Nov 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022 Osama A. Hanna Lin F. Yang Christina Fragouli 268 17 0 08 Nov 2022
Lifelong Bandit Optimization: No Prior and No RegretConference on Uncertainty in Artificial Intelligence (UAI), 2022 Felix Schur Parnian Kassraie Jonas Rothfuss Andreas Krause 282 3 0 27 Oct 2022
Robust Contextual Linear Bandits Rong Zhu Branislav Kveton 192 3 0 26 Oct 2022
Deploying a Steered Query Optimizer in Production at Microsoft Wangda Zhang Matteo Interlandi Paul Mineiro S. Qiao Nasim Ghazanfari Marc T. Friedman Rafah Hosn Hiren Patel Alekh Jindal 128 27 0 24 Oct 2022
Conditionally Risk-Averse Contextual Bandits Mónika Farsang Paul Mineiro Wangda Zhang 204 2 0 24 Oct 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Yuxuan Han Jialin Zeng Yang Wang Yangzhen Xiang Jiheng Zhang 277 13 0 21 Oct 2022
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action SpacesInternational Conference on Machine Learning (ICML), 2022 Yinglun Zhu Paul Mineiro 190 18 0 12 Jul 2022
Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022 Aldo Pacchiano Christoph Dann Claudio Gentile 192 11 0 29 Jun 2022
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial CorruptionsNeural Information Processing Systems (NeurIPS), 2022 Jiafan He Dongruo Zhou Tong Zhang Quanquan Gu 233 53 0 13 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect OraclesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 217 1 0 30 Mar 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning FrameworkInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Runzhe Wan Linjuan Ge Rui Song 207 13 0 26 Feb 2022
Damped Online Newton Step for Portfolio SelectionAnnual Conference Computational Learning Theory (COLT), 2022 Zakaria Mhammedi Alexander Rakhlin 110 16 0 15 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022 Haipeng Luo Mengxiao Zhang Peng Zhao Zhi Zhou 198 20 0 12 Feb 2022
Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum StatesAnnual Conference Computational Learning Theory (COLT), 2022 Julian Zimmert Naman Agarwal Satyen Kale 132 19 0 06 Feb 2022
Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification Saptarshi Chakraborty Debolina Paul Swagatam Das OOD 216 0 0 06 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability Aadirupa Saha A. Krishnamurthy 259 41 0 24 Nov 2021
Misspecified Gaussian Process Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021 Ilija Bogunovic Andreas Krause 181 53 0 09 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021 Clémence Réda Andrea Tirinzoni Rémy Degenne 165 10 0 02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 211 25 0 25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions Heyang Zhao Dongruo Zhou Quanquan Gu AAML 203 24 0 25 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Tong Zhang 174 72 0 02 Oct 2021
Distribution-free Contextual Dynamic Pricing Yiyun Luo W. Sun Yufeng Liu 351 41 0 15 Sep 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes Daniel Vial Advait Parulekar Sanjay Shakkottai R. Srikant 172 7 0 12 Sep 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical ModelsNeural Information Processing Systems (NeurIPS), 2021 Runzhe Wan Linjuan Ge Rui Song 193 31 0 13 Aug 2021