v1v2 (latest)

A Simple Approach for Non-stationary Linear Bandits

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020

9 March 2021

Papers citing "A Simple Approach for Non-stationary Linear Bandits"

50 / 58 papers shown

Title
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits Shaoang Li Jian Li 104 0 0 18 Sep 2025
Finite-Time Guarantees for Multi-Agent Combinatorial Bandits with Nonstationary Rewards Katherine Adams J. Boutilier Qinyang He Yonatan Dov Mintz OffRL 80 0 0 28 Aug 2025
Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of PlasticityKnowledge Discovery and Data Mining (KDD), 2025 Zhiyuan Su Sunhao Dai Xiao Zhang 202 0 0 14 Jun 2025
Quick-Draw Bandits: Quickly Optimizing in Nonstationary Environments with Extremely Many ArmsKnowledge Discovery and Data Mining (KDD), 2025 Derek Everett Fred Lu Edward Raff Fernando Camacho James Holt 266 0 0 30 May 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System J. Gornet Yilin Mo Bruno Sinopoli 157 1 0 04 Apr 2025
Heavy-Tailed Linear Bandits: Huber Regression with One-Pass Update Jing Wang Yu Zhang Peng Zhao Zhi Zhou 313 1 0 01 Mar 2025
Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization Linfeng Cao Ming Shi Ness B. Shroff 157 0 0 19 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits Joe Suk Jung-hun Kim 320 1 0 31 Jan 2025
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPsNeural Information Processing Systems (NeurIPS), 2024 Long-Fei Li Peng Zhao Zhi Zhou 224 2 0 05 Nov 2024
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits Kuan-Ta Li Ping-Chun Hsieh Yu-Chih Huang 149 2 0 08 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift Seongho Son William Bankes Sayak Ray Chowdhury Brooks Paige Ilija Bogunovic 381 8 0 26 Jul 2024
Causally Abstracted Multi-armed Bandits Fabio Massimo Zennaro Nicholas Bishop Joel Dyer Yorgos Felekis Anisoara Calinescu Michael Wooldridge Theodoros Damoulas 287 6 0 26 Apr 2024
Adaptive Memory Replay for Continual Learning James Seale Smith Lazar Valkov Shaunak Halbe V. Gutta Rogerio Feris Z. Kira Leonid Karlinsky 167 15 0 18 Apr 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits Zhiyong Wang Jize Xie Yi Chen J. C. Lui Dongruo Zhou 253 1 0 15 Mar 2024
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Zheqing Zhu Yueyang Liu Xu Kuang Benjamin Van Roy AI4TS 217 1 0 11 Oct 2023
Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender SystemInternational Conference on Information and Knowledge Management (CIKM), 2023 Bowei He Xu He Renrui Zhang Yingxue Zhang Ruiming Tang Chen Ma AI4TS 130 13 0 15 Aug 2023
Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits Nicklas Werge Yi-Shan Wu Abdullah Akgul M. Kandemir 301 0 0 07 Jul 2023
A Black-box Approach for Non-stationary Multi-agent Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023 Haozhe Jiang Qiwen Cui Zhihan Xiong Maryam Fazel S. Du 195 6 0 12 Jun 2023
Non-stationary Reinforcement Learning under General Function ApproximationInternational Conference on Machine Learning (ICML), 2023 Songtao Feng Ming Yin Ruiquan Huang Yu Wang J. Yang Yitao Liang 135 9 0 01 Jun 2023
Learning to Seek: Multi-Agent Online Source Seeking Against Non-Stochastic Disturbances Bin Du Kun Qian Christian G. Claudel Dengfeng Sun 281 0 0 29 Apr 2023
Revisiting Weighted Strategy for Non-stationary Parametric BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Jing Wang Peng Zhao Zhihong Zhou 193 9 0 05 Mar 2023
MNL-Bandit in non-stationary environments Ayoub Foussoul Vineet Goyal Varun Gupta 261 3 0 04 Mar 2023
A Definition of Non-Stationary Bandits Yueyang Liu Kuang Xu Benjamin Van Roy 273 11 0 23 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Yue Kang Cho-Jui Hsieh T. C. Lee 236 2 0 18 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 135 3 0 16 Feb 2023
Adapting to Continuous Covariate Shift via Online Density Ratio EstimationNeural Information Processing Systems (NeurIPS), 2023 Yu Zhang Zhenyu Zhang Peng Zhao Masashi Sugiyama OOD 265 16 0 06 Feb 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls OracleAllerton Conference on Communication, Control, and Computing (Allerton), 2023 Hyunwook Kang P. R. Kumar OffRL 157 1 0 29 Jan 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with ConstraintsConference on Learning for Dynamics & Control (L4DC), 2022 Heng Guo Qi Zhu Xin Liu 249 15 0 27 Nov 2022
Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation DesignAAAI Conference on Artificial Intelligence (AAAI), 2022 Yuhao Ding Ming Jin Javad Lavaei 116 7 0 19 Nov 2022
Competing Bandits in Time Varying Matching MarketsConference on Learning for Dynamics & Control (L4DC), 2022 Deepan Muthirayan C. Maheshwari Pramod P. Khargonekar S. Shankar Sastry 203 5 0 21 Oct 2022
Dynamic Regret of Online Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022 Peng Zhao Longfei Li Zhi Zhou OffRL 184 19 0 26 Aug 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 224 16 0 12 Jul 2022
Recent Advances in Bayesian OptimizationACM Computing Surveys (ACM CSUR), 2022 Xilu Wang Yaochu Jin Sebastian Schmitt Markus Olhofer 227 377 0 07 Jun 2022
Open-environment Machine LearningNational Science Review (NSR), 2022 Zhi Zhou VLM 332 168 0 01 Jun 2022
Non-Stationary Bandit Learning via Predictive SamplingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Yueyang Liu Kuang Xu Benjamin Van Roy 342 21 0 04 May 2022
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems Saeed Ghoorchian E. Kortukov S. Maghsudi OffRL 174 0 0 07 Feb 2022
Rotting Infinitely Many-armed BanditsInternational Conference on Machine Learning (ICML), 2022 Jung-hun Kim Milan Vojnović Se-Young Yun 212 5 0 31 Jan 2022
Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex OptimizationJournal of machine learning research (JMLR), 2021 Peng Zhao Yu Zhang Lijun Zhang Zhi Zhou 289 75 0 29 Dec 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 203 25 0 25 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs Han Zhong Zhuoran Yang Zhaoran Wang Csaba Szepesvári 295 21 0 18 Oct 2021
Distribution-free Contextual Dynamic Pricing Yiyun Luo W. Sun Yufeng Liu 335 41 0 15 Sep 2021
Weighted Gaussian Process Bandits for Non-stationary Environments Yuntian Deng Xingyu Zhou Baekjin Kim Ambuj Tewari Abhishek Gupta Ness B. Shroff 208 29 0 06 Jul 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit SolutionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021 Chuanhao Li Qingyun Wu Hongning Wang 161 6 0 14 Apr 2021
Regret Bounds for Generalized Linear Bandits under Parameter Drift Louis Faury Yoan Russac Marc Abeille Clément Calauzènes 150 12 0 09 Mar 2021
No-Regret Algorithms for Time-Varying Bayesian OptimizationAnnual Conference on Information Sciences and Systems (CISS), 2021 Xingyu Zhou Ness B. Shroff 127 24 0 11 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box ApproachAnnual Conference Computational Learning Theory (COLT), 2021 Chen-Yu Wei Haipeng Luo OffRL 385 121 0 10 Feb 2021
Non-stationary Online Learning with Memory and Non-stochastic ControlInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021 Peng Zhao Yu-Hu Yan Yu Wang Zhi Zhou 460 50 0 07 Feb 2021
Learning User Preferences in Non-Stationary EnvironmentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021 Wasim Huleihel S. Pal O. Shayevitz 228 13 0 29 Jan 2021
Non-Stationary Latent Bandits Joey Hong Branislav Kveton Manzil Zaheer Yinlam Chow Amr Ahmed Mohammad Ghavamzadeh Craig Boutilier OffRL 307 15 0 01 Dec 2020
Self-Concordant Analysis of Generalized Linear Bandits with ForgettingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020 Yoan Russac Louis Faury Olivier Cappé Aurélien Garivier 234 19 0 02 Nov 2020