ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02664
  4. Cited By
Restless-UCB, an Efficient and Low-complexity Algorithm for Online
  Restless Bandits

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

5 November 2020
Siwei Wang
Longbo Huang
John C. S. Lui
    OffRL
ArXivPDFHTML

Papers citing "Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits"

19 / 19 papers shown
Title
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
Xiaoyi Wu
Bo Ji
Bin Li
FaML
46
0
0
01 Jan 2025
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
42
2
0
07 Oct 2024
Whittle Index Learning Algorithms for Restless Bandits with Constant
  Stepsizes
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
Vishesh Mittal
R. Meshram
Surya Prakash
27
0
0
06 Sep 2024
A Federated Online Restless Bandit Framework for Cooperative Resource
  Allocation
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
44
1
0
12 Jun 2024
Tabular and Deep Learning for the Whittle Index
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
26
2
0
04 Jun 2024
Restless Bandit Problem with Rewards Generated by a Linear Gaussian
  Dynamical System
Restless Bandit Problem with Rewards Generated by a Linear Gaussian Dynamical System
J. Gornet
Bruno Sinopoli
33
0
0
15 May 2024
Provably Efficient Reinforcement Learning for Adversarial Restless
  Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Guojun Xiong
Jian Li
25
1
0
02 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in
  Dense mmWave Networks
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shu-Fan Wang
Guojun Xiong
Shichen Zhang
Huacheng Zeng
Jian Li
Shivendra Panwar
21
0
0
25 Apr 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
51
6
0
16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
30
12
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
9
7
0
21 Mar 2023
Approximately Stationary Bandits with Knapsacks
Approximately Stationary Bandits with Knapsacks
Giannis Fikioris
Éva Tardos
AAML
13
7
0
28 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless
  Multi-Arm Bandits
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma
Shresth Verma
Aditya Mate
Aparna Taneja
Milind Tambe
16
0
0
19 Jan 2023
Stochastic Rising Bandits
Stochastic Rising Bandits
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
17
16
0
07 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
39
16
0
30 May 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
25
6
0
26 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
27
4
0
20 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
19
6
0
04 Jul 2021
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more
  Scalable than Optimism?
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?
Nicolas Gast
B. Gaujal
K. Khun
10
2
0
16 Jun 2021
1