Learning Contextual Bandits in a Non-stationary Environment

23 May 2018

Papers citing "Learning Contextual Bandits in a Non-stationary Environment"

26 / 26 papers shown

Title
Influencing Bandits: Arm Selection for Preference Shaping Viraj Nadkarni D. Manjunath Sharayu Moharir 45 0 0 29 Feb 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change Aishwarya Mandyam Matthew Joerke William Denton Barbara E. Engelhardt Emma Brunskill 130 1 0 16 Nov 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Nicklas Werge Abdullah Akgul M. Kandemir 145 0 0 07 Jul 2023
Discounted Thompson Sampling for Non-Stationary Bandit Problems Han Qi Yue Wang Li Zhu 77 3 0 18 May 2023
Disentangled Representation for Diversified Recommendations Xiaoying Zhang Hongning Wang Hang Li CML 92 14 0 13 Jan 2023
Contextual Bandits and Optimistically Universal Learning Moise Blanchard Steve Hanneke Patrick Jaillet OffRL 119 2 0 31 Dec 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits Thomas Kleine Buening Aadirupa Saha 98 8 0 25 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 129 14 0 12 Jul 2022
Dynamic Causal Bayesian Optimization Virginia Aglietti Neil Dhir Javier I. González Theodoros Damoulas 92 28 0 26 Oct 2021
On Limited-Memory Subsampling Strategies for Bandits Dorian Baudry Yoan Russac Olivier Cappé 143 8 0 21 Jun 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits Hengrui Cai Zhihao Cen Ling Leng Rui Song AI4TS 191 6 0 30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution Chuanhao Li Qingyun Wu Hongning Wang 113 6 0 14 Apr 2021
Lifelong Learning in Multi-Armed Bandits Matthieu Jedor Jonathan Louëdec Vianney Perchet 97 2 0 28 Dec 2020
Non-Stationary Latent Bandits Joey Hong Branislav Kveton Manzil Zaheer Yinlam Chow Amr Ahmed Mohammad Ghavamzadeh Craig Boutilier OffRL 165 14 0 01 Dec 2020
Unifying Clustered and Non-stationary Bandits Chuanhao Li Qingyun Wu Hongning Wang 123 13 0 05 Sep 2020
Self-Tuning Bandits over Unknown Covariate-Shifts Joe Suk Samory Kpotufe 209 10 0 16 Jul 2020
Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users Shijun Li Wenqiang Lei Qingyun Wu Xiangnan He Peng Jiang Tat-Seng Chua 243 127 0 23 May 2020
A Linear Bandit for Seasonal Environments Giuseppe Di Benedetto Vito Bellini Giovanni Zappella 58 7 0 28 Apr 2020
Algorithms for Non-Stationary Generalized Linear Bandits Yoan Russac Olivier Cappé Aurélien Garivier 122 25 0 23 Mar 2020
Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests X. Xu Fang Dong Yanghua Li Shaojian He Xuzhao Li 75 40 0 29 Feb 2020
Multiscale Non-stationary Stochastic Bandits Qin Ding Cho-Jui Hsieh James Sharpnack 51 0 0 13 Feb 2020
Fair Contextual Multi-Armed Bandits: Theory and Experiments Yifang Chen Alex Cuellar Haipeng Luo Jignesh Modi Heramb Nemlekar Stefanos Nikolaidis FaML 121 62 0 13 Dec 2019
Randomized Exploration for Non-Stationary Stochastic Linear Bandits Baekjin Kim Ambuj Tewari 228 19 0 11 Dec 2019
Weighted Linear Bandits for Non-Stationary Environments Yoan Russac Claire Vernade Olivier Cappé 215 108 0 19 Sep 2019
Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model Chang Li Maarten de Rijke 101 17 0 29 May 2019
Deep reinforcement learning for search, recommendation, and online advertising: a survey Xiangyu Zhao Long Xia Jiliang Tang D. Yin OffRL 148 89 0 18 Dec 2018