v1v2v3v4 (latest)

Efficient Contextual Bandits in Non-stationary Worlds

5 August 2017

Papers citing "Efficient Contextual Bandits in Non-stationary Worlds"

50 / 75 papers shown

Title
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits Shaoang Li Jian Li 4 0 0 18 Sep 2025
Non-stationary Bandit Convex Optimization: A Comprehensive Study Xiaoqi Liu Dorian Baudry Julian Zimmert Patrick Rebeschini Arya Akhavan 104 1 0 03 Jun 2025
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Yunlong Hou Fengzhuo Zhang Cunxiao Du Xuan Zhang Jiachun Pan Tianyu Pang Chao Du Vincent Y. F. Tan Zhuoran Yang OffRL 180 2 0 21 May 2025
Beyond IID: data-driven decision-making in heterogeneous environments Omar Besbes Will Ma Omar Mouchtaki 183 9 0 03 Jan 2025
Improved Regret Bounds for Bandits with Expert Advice Nicolò Cesa-Bianchi Khaled Eldowa Emmanuel Esposito Julia Olkhovskaya 93 0 0 24 Jun 2024
A Parametric Contextual Online Learning Theory of Brokerage François Bachoc Tommaso Cesari Roberto Colomboni 84 3 0 22 May 2024
Mitigating Biases in Collective Decision-Making: Enhancing Performance in the Face of Fake News Axel Abels Elias Fernández Domingos Ann Nowé Tom Lenaerts 145 2 0 11 Mar 2024
Near-optimal Per-Action Regret Bounds for Sleeping Bandits Quan Nguyen Nishant A. Mehta 131 1 0 02 Mar 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change Aishwarya Mandyam Matthew Joerke William Denton Barbara E. Engelhardt Emma Brunskill 130 1 0 16 Nov 2023
An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits Kiarash Banihashem Mohammadtaghi Hajiaghayi Suho Shin Max Springer 154 2 0 29 Oct 2023
A Stability Principle for Learning under Non-Stationarity Chengpiao Huang Kaizheng Wang 203 4 0 27 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Zheqing Zhu Yueyang Liu Xu Kuang Benjamin Van Roy AI4TS 89 0 0 11 Oct 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Haolin Liu Chen-Yu Wei Julian Zimmert 92 11 0 02 Sep 2023
Online Learning with Costly Features in Non-stationary Environments Saeed Ghoorchian E. Kortukov S. Maghsudi OffRL 98 1 0 18 Jul 2023
Tracking Most Significant Shifts in Nonparametric Contextual Bandits Joe Suk Samory Kpotufe 148 7 0 11 Jul 2023
Meta-Learning Adversarial Bandit Algorithms M. Khodak Ilya Osadchiy Keegan Harris Maria-Florina Balcan Kfir Y. Levy Ron Meir Zhiwei Steven Wu FedML 148 4 0 05 Jul 2023
Non-stationary Reinforcement Learning under General Function Approximation Songtao Feng Ming Yin Ruiquan Huang Yu Wang J. Yang Yitao Liang 87 9 0 01 Jun 2023
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems Michael Rotman Lior Wolf 82 1 0 12 Mar 2023
MNL-Bandit in non-stationary environments Ayoub Foussoul Vineet Goyal Varun Gupta 144 3 0 04 Mar 2023
A Definition of Non-Stationary Bandits Yueyang Liu Kuang Xu Benjamin Van Roy 141 11 0 23 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 91 3 0 16 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints Yuan Deng Negin Golrezaei Patrick Jaillet Jason Cheuk Nam Liang Vahab Mirrokni 153 28 0 03 Feb 2023
Quantum contextual bandits and recommender systems for quantum data Shrigyan Brahmachari Josep Lumbreras Marco Tomamichel 87 8 0 31 Jan 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle Hyunwook Kang P. R. Kumar OffRL 97 1 0 29 Jan 2023
Smooth Non-Stationary Bandits S. Jia Qian Xie Nathan Kallus P. Frazier 220 12 0 29 Jan 2023
Contextual Bandits and Optimistically Universal Learning Moise Blanchard Steve Hanneke Patrick Jaillet OffRL 119 2 0 31 Dec 2022
Learning to Price Supply Chain Contracts against a Learning Retailer Xuejun Zhao Ruihao Zhu W. Haskell OffRL 108 1 0 02 Nov 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits Thomas Kleine Buening Aadirupa Saha 98 8 0 25 Oct 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges Bram van den Akker N. Weber Felipe Moraes Dmitri Goldenberg OffRL 82 1 0 09 Sep 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets Avishek Ghosh Abishek Sankararaman Kannan Ramchandran T. Javidi A. Mazumdar 100 6 0 31 May 2022
Non-Stationary Bandit Learning via Predictive Sampling Yueyang Liu Kuang Xu Benjamin Van Roy 226 19 0 04 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi Zhou 114 18 0 12 Feb 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit Yi Xiong Shuoguang Yang Hailun Zhang AAML 139 4 0 05 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability Aadirupa Saha A. Krishnamurthy 143 40 0 24 Nov 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits Aadirupa Saha Shubham Gupta 106 11 0 06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 123 24 0 25 Oct 2021
On Slowly-varying Non-stationary Bandits Ramakrishnan Krishnamurthy Médéric Fourmy 87 9 0 25 Oct 2021
Towards the D-Optimal Online Experiment Design for Recommender Selection Madina Abdrakhmanova Saniya Abushakimova Evren Körpeoglu H. A. Varol Kannan Achan 143 3 0 23 Oct 2021
Adapting to Misspecification in Contextual Bandits Dylan J. Foster Claudio Gentile M. Mohri Julian Zimmert 137 90 0 12 Jul 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits Hengrui Cai Zhihao Cen Ling Leng Rui Song AI4TS 191 6 0 30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution Chuanhao Li Qingyun Wu Hongning Wang 113 6 0 14 Apr 2021
Dynamic Pricing and Learning under the Bass Model Shipra Agrawal Steven Yin A. Zeevi 95 12 0 09 Mar 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Chen-Yu Wei Haipeng Luo OffRL 221 116 0 10 Feb 2021
Learning User Preferences in Non-Stationary Environments Wasim Huleihel S. Pal O. Shayevitz 172 13 0 29 Jan 2021
Adversarial Linear Contextual Bandits with Graph-Structured Side Observations Lingda Wang Bingcong Li Huozhi Zhou G. Giannakis Lav Varshney Zhizhen Zhao 96 8 0 10 Dec 2020
Non-Stationary Latent Bandits Joey Hong Branislav Kveton Manzil Zaheer Yinlam Chow Amr Ahmed Mohammad Ghavamzadeh Craig Boutilier OffRL 165 14 0 01 Dec 2020
Adversarial Dueling Bandits Aadirupa Saha Tomer Koren Yishay Mansour 142 28 0 27 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Mack Sweeney M. Adelsberg Kathryn B. Laskey C. Domeniconi 91 1 0 07 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control Weichao Mao Jianchao Tan Ruihao Zhu D. Simchi-Levi Tamer Bacsar 133 14 0 07 Oct 2020
Learning Product Rankings Robust to Fake Users Negin Golrezaei Vahideh H. Manshadi Jon Schneider S. Sekar 93 30 0 10 Sep 2020