ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1405.3316
  4. Cited By
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with
  Non-stationary Rewards

Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards

13 May 2014
Omar Besbes
Y. Gur
A. Zeevi
ArXivPDFHTML

Papers citing "Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards"

20 / 20 papers shown
Title
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
Jianyu Xu
Qiuzhuang Sun
Yang Yang
Huadong Mo
Daoyi Dong
83
0
0
24 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
60
0
0
31 Jan 2025
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Continual Learning as Computationally Constrained Reinforcement Learning
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
27
22
0
10 Jul 2023
Competing Bandits in Time Varying Matching Markets
Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan
C. Maheshwari
Pramod P. Khargonekar
S. Shankar Sastry
33
1
0
21 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Towards Futuristic Autonomous Experimentation--A Surprise-Reacting
  Sequential Experiment Policy
Towards Futuristic Autonomous Experimentation--A Surprise-Reacting Sequential Experiment Policy
Imtiaz Ahmed
Satish Bukkapatnam
Bhaskar Botcha
Yucheng Ding
41
5
0
01 Dec 2021
On Slowly-varying Non-stationary Bandits
On Slowly-varying Non-stationary Bandits
Ramakrishnan Krishnamurthy
Médéric Fourmy
24
8
0
25 Oct 2021
Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits
Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits
Junpei Komiyama
Edouard Fouché
Junya Honda
33
5
0
23 Jul 2021
Addressing the Long-term Impact of ML Decisions via Policy Regret
Addressing the Long-term Impact of ML Decisions via Policy Regret
David Lindner
Hoda Heidari
Andreas Krause
OffRL
23
6
0
02 Jun 2021
A Simple Approach for Non-stationary Linear Bandits
A Simple Approach for Non-stationary Linear Bandits
Peng Zhao
Lijun Zhang
Yuan Jiang
Zhi-Hua Zhou
36
81
0
09 Mar 2021
Dynamic Pricing and Learning under the Bass Model
Dynamic Pricing and Learning under the Bass Model
Shipra Agrawal
Steven Yin
A. Zeevi
21
11
0
09 Mar 2021
Bayesian adversarial multi-node bandit for optimal smart grid protection
  against cyber attacks
Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks
Jianyu Xu
Bin Liu
H. Mo
D. Dong
AAML
16
22
0
20 Feb 2021
Regression Oracles and Exploration Strategies for Short-Horizon
  Multi-Armed Bandits
Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits
Robert C. Gray
Jichen Zhu
Santiago Ontañón
26
7
0
10 Feb 2021
A Change-Detection Based Thompson Sampling Framework for Non-Stationary
  Bandits
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits
Gourab Ghatak
23
17
0
06 Sep 2020
Weighted Linear Bandits for Non-Stationary Environments
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
101
0
19 Sep 2019
Hedging the Drift: Learning to Optimize under Non-Stationarity
Hedging the Drift: Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
29
89
0
04 Mar 2019
Learning to Optimize under Non-Stationarity
Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
36
133
0
06 Oct 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
24
37
0
23 Feb 2018
1