Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1405.3316
Cited By
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards
13 May 2014
Omar Besbes
Y. Gur
A. Zeevi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards"
20 / 20 papers shown
Title
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Jianyu Xu
Qiuzhuang Sun
Yang Yang
Huadong Mo
Daoyi Dong
83
0
0
24 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
60
0
0
31 Jan 2025
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
27
22
0
10 Jul 2023
Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan
C. Maheshwari
Pramod P. Khargonekar
S. Shankar Sastry
33
1
0
21 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Towards Futuristic Autonomous Experimentation--A Surprise-Reacting Sequential Experiment Policy
Imtiaz Ahmed
Satish Bukkapatnam
Bhaskar Botcha
Yucheng Ding
41
5
0
01 Dec 2021
On Slowly-varying Non-stationary Bandits
Ramakrishnan Krishnamurthy
Médéric Fourmy
24
8
0
25 Oct 2021
Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits
Junpei Komiyama
Edouard Fouché
Junya Honda
33
5
0
23 Jul 2021
Addressing the Long-term Impact of ML Decisions via Policy Regret
David Lindner
Hoda Heidari
Andreas Krause
OffRL
23
7
0
02 Jun 2021
A Simple Approach for Non-stationary Linear Bandits
Peng Zhao
Lijun Zhang
Yuan Jiang
Zhi-Hua Zhou
33
81
0
09 Mar 2021
Dynamic Pricing and Learning under the Bass Model
Shipra Agrawal
Steven Yin
A. Zeevi
21
11
0
09 Mar 2021
Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks
Jianyu Xu
Bin Liu
H. Mo
D. Dong
AAML
16
22
0
20 Feb 2021
Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits
Robert C. Gray
Jichen Zhu
Santiago Ontañón
26
7
0
10 Feb 2021
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits
Gourab Ghatak
23
17
0
06 Sep 2020
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
101
0
19 Sep 2019
Hedging the Drift: Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
26
89
0
04 Mar 2019
Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
36
132
0
06 Oct 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
24
37
0
23 Feb 2018
1