Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
1708.01799
Cited By
v1
v2
v3
v4 (latest)
Efficient Contextual Bandits in Non-stationary Worlds
5 August 2017
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Efficient Contextual Bandits in Non-stationary Worlds"
50 / 75 papers shown
Title
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Shaoang Li
Jian Li
4
0
0
18 Sep 2025
Non-stationary Bandit Convex Optimization: A Comprehensive Study
Xiaoqi Liu
Dorian Baudry
Julian Zimmert
Patrick Rebeschini
Arya Akhavan
104
1
0
03 Jun 2025
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Yunlong Hou
Fengzhuo Zhang
Cunxiao Du
Xuan Zhang
Jiachun Pan
Tianyu Pang
Chao Du
Vincent Y. F. Tan
Zhuoran Yang
OffRL
180
2
0
21 May 2025
Beyond IID: data-driven decision-making in heterogeneous environments
Omar Besbes
Will Ma
Omar Mouchtaki
183
9
0
03 Jan 2025
Improved Regret Bounds for Bandits with Expert Advice
Nicolò Cesa-Bianchi
Khaled Eldowa
Emmanuel Esposito
Julia Olkhovskaya
93
0
0
24 Jun 2024
A Parametric Contextual Online Learning Theory of Brokerage
François Bachoc
Tommaso Cesari
Roberto Colomboni
84
3
0
22 May 2024
Mitigating Biases in Collective Decision-Making: Enhancing Performance in the Face of Fake News
Axel Abels
Elias Fernández Domingos
Ann Nowé
Tom Lenaerts
145
2
0
11 Mar 2024
Near-optimal Per-Action Regret Bounds for Sleeping Bandits
Quan Nguyen
Nishant A. Mehta
131
1
0
02 Mar 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
130
1
0
16 Nov 2023
An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits
Kiarash Banihashem
Mohammadtaghi Hajiaghayi
Suho Shin
Max Springer
154
2
0
29 Oct 2023
A Stability Principle for Learning under Non-Stationarity
Chengpiao Huang
Kaizheng Wang
203
4
0
27 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
89
0
0
11 Oct 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Haolin Liu
Chen-Yu Wei
Julian Zimmert
92
11
0
02 Sep 2023
Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
98
1
0
18 Jul 2023
Tracking Most Significant Shifts in Nonparametric Contextual Bandits
Joe Suk
Samory Kpotufe
148
7
0
11 Jul 2023
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
148
4
0
05 Jul 2023
Non-stationary Reinforcement Learning under General Function Approximation
Songtao Feng
Ming Yin
Ruiquan Huang
Yu Wang
J. Yang
Yitao Liang
87
9
0
01 Jun 2023
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems
Michael Rotman
Lior Wolf
82
1
0
12 Mar 2023
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
144
3
0
04 Mar 2023
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
141
11
0
23 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
91
3
0
16 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints
Yuan Deng
Negin Golrezaei
Patrick Jaillet
Jason Cheuk Nam Liang
Vahab Mirrokni
153
28
0
03 Feb 2023
Quantum contextual bandits and recommender systems for quantum data
Shrigyan Brahmachari
Josep Lumbreras
Marco Tomamichel
87
8
0
31 Jan 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle
Hyunwook Kang
P. R. Kumar
OffRL
97
1
0
29 Jan 2023
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
220
12
0
29 Jan 2023
Contextual Bandits and Optimistically Universal Learning
Moise Blanchard
Steve Hanneke
Patrick Jaillet
OffRL
119
2
0
31 Dec 2022
Learning to Price Supply Chain Contracts against a Learning Retailer
Xuejun Zhao
Ruihao Zhu
W. Haskell
OffRL
108
1
0
02 Nov 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits
Thomas Kleine Buening
Aadirupa Saha
98
8
0
25 Oct 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges
Bram van den Akker
N. Weber
Felipe Moraes
Dmitri Goldenberg
OffRL
82
1
0
09 Sep 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
T. Javidi
A. Mazumdar
100
6
0
31 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
226
19
0
04 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
114
18
0
12 Feb 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit
Yi Xiong
Shuoguang Yang
Hailun Zhang
AAML
139
4
0
05 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
143
40
0
24 Nov 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
106
11
0
06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
123
24
0
25 Oct 2021
On Slowly-varying Non-stationary Bandits
Ramakrishnan Krishnamurthy
Médéric Fourmy
87
9
0
25 Oct 2021
Towards the D-Optimal Online Experiment Design for Recommender Selection
Madina Abdrakhmanova
Saniya Abushakimova
Evren Körpeoglu
H. A. Varol
Kannan Achan
143
3
0
23 Oct 2021
Adapting to Misspecification in Contextual Bandits
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
137
90
0
12 Jul 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits
Hengrui Cai
Zhihao Cen
Ling Leng
Rui Song
AI4TS
191
6
0
30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution
Chuanhao Li
Qingyun Wu
Hongning Wang
113
6
0
14 Apr 2021
Dynamic Pricing and Learning under the Bass Model
Shipra Agrawal
Steven Yin
A. Zeevi
95
12
0
09 Mar 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
221
116
0
10 Feb 2021
Learning User Preferences in Non-Stationary Environments
Wasim Huleihel
S. Pal
O. Shayevitz
172
13
0
29 Jan 2021
Adversarial Linear Contextual Bandits with Graph-Structured Side Observations
Lingda Wang
Bingcong Li
Huozhi Zhou
G. Giannakis
Lav Varshney
Zhizhen Zhao
96
8
0
10 Dec 2020
Non-Stationary Latent Bandits
Joey Hong
Branislav Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
165
14
0
01 Dec 2020
Adversarial Dueling Bandits
Aadirupa Saha
Tomer Koren
Yishay Mansour
142
28
0
27 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
91
1
0
07 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control
Weichao Mao
Jianchao Tan
Ruihao Zhu
D. Simchi-Levi
Tamer Bacsar
133
14
0
07 Oct 2020
Learning Product Rankings Robust to Fake Users
Negin Golrezaei
Vahideh H. Manshadi
Jon Schneider
S. Sekar
93
30
0
10 Sep 2020
1
2
Next