Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
1805.09365
Cited By
Learning Contextual Bandits in a Non-stationary Environment
23 May 2018
Qingyun Wu
Naveen Iyer
Hongning Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Contextual Bandits in a Non-stationary Environment"
26 / 26 papers shown
Title
Influencing Bandits: Arm Selection for Preference Shaping
Viraj Nadkarni
D. Manjunath
Sharayu Moharir
45
0
0
29 Feb 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
130
1
0
16 Nov 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
145
0
0
07 Jul 2023
Discounted Thompson Sampling for Non-Stationary Bandit Problems
Han Qi
Yue Wang
Li Zhu
77
3
0
18 May 2023
Disentangled Representation for Diversified Recommendations
Xiaoying Zhang
Hongning Wang
Hang Li
CML
92
14
0
13 Jan 2023
Contextual Bandits and Optimistically Universal Learning
Moise Blanchard
Steve Hanneke
Patrick Jaillet
OffRL
119
2
0
31 Dec 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits
Thomas Kleine Buening
Aadirupa Saha
98
8
0
25 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
129
14
0
12 Jul 2022
Dynamic Causal Bayesian Optimization
Virginia Aglietti
Neil Dhir
Javier I. González
Theodoros Damoulas
92
28
0
26 Oct 2021
On Limited-Memory Subsampling Strategies for Bandits
Dorian Baudry
Yoan Russac
Olivier Cappé
143
8
0
21 Jun 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits
Hengrui Cai
Zhihao Cen
Ling Leng
Rui Song
AI4TS
191
6
0
30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution
Chuanhao Li
Qingyun Wu
Hongning Wang
113
6
0
14 Apr 2021
Lifelong Learning in Multi-Armed Bandits
Matthieu Jedor
Jonathan Louëdec
Vianney Perchet
97
2
0
28 Dec 2020
Non-Stationary Latent Bandits
Joey Hong
Branislav Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
165
14
0
01 Dec 2020
Unifying Clustered and Non-stationary Bandits
Chuanhao Li
Qingyun Wu
Hongning Wang
123
13
0
05 Sep 2020
Self-Tuning Bandits over Unknown Covariate-Shifts
Joe Suk
Samory Kpotufe
209
10
0
16 Jul 2020
Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users
Shijun Li
Wenqiang Lei
Qingyun Wu
Xiangnan He
Peng Jiang
Tat-Seng Chua
243
127
0
23 May 2020
A Linear Bandit for Seasonal Environments
Giuseppe Di Benedetto
Vito Bellini
Giovanni Zappella
58
7
0
28 Apr 2020
Algorithms for Non-Stationary Generalized Linear Bandits
Yoan Russac
Olivier Cappé
Aurélien Garivier
122
25
0
23 Mar 2020
Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests
X. Xu
Fang Dong
Yanghua Li
Shaojian He
Xuzhao Li
75
40
0
29 Feb 2020
Multiscale Non-stationary Stochastic Bandits
Qin Ding
Cho-Jui Hsieh
James Sharpnack
51
0
0
13 Feb 2020
Fair Contextual Multi-Armed Bandits: Theory and Experiments
Yifang Chen
Alex Cuellar
Haipeng Luo
Jignesh Modi
Heramb Nemlekar
Stefanos Nikolaidis
FaML
121
62
0
13 Dec 2019
Randomized Exploration for Non-Stationary Stochastic Linear Bandits
Baekjin Kim
Ambuj Tewari
228
19
0
11 Dec 2019
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
215
108
0
19 Sep 2019
Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model
Chang Li
Maarten de Rijke
101
17
0
29 May 2019
Deep reinforcement learning for search, recommendation, and online advertising: a survey
Xiangyu Zhao
Long Xia
Jiliang Tang
D. Yin
OffRL
148
89
0
18 Dec 2018
1