Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.02664
Cited By
v1
v2 (latest)
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits
5 November 2020
Siwei Wang
Longbo Huang
John C. S. Lui
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits"
20 / 20 papers shown
Model-Based Learning of Whittle indices
Joël Charles-Rebuffé
Nicolas Gast
B. Gaujal
71
0
0
25 Nov 2025
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
IEEE Conference on Computer Communications (IEEE INFOCOM), 2025
Xiaoyi Wu
Bo Ji
Bin Li
FaML
382
1
0
01 Jan 2025
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
International Conference on Learning Representations (ICLR), 2024
Efstathia Soufleri
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
370
2
0
07 Oct 2024
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
Conference Information and Communication Technology (ICT), 2024
Vishesh Mittal
R. Meshram
Surya Prakash
176
0
0
06 Sep 2024
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
301
3
0
12 Jun 2024
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
287
5
0
04 Jun 2024
Restless Bandit Problem with Rewards Generated by a Linear Gaussian Dynamical System
J. Gornet
Bruno Sinopoli
284
0
0
15 May 2024
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Efstathia Soufleri
Jian Li
291
1
0
02 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shu-Fan Wang
Efstathia Soufleri
Shichen Zhang
Huacheng Zeng
Jian Li
Shivendra Panwar
223
0
0
25 Apr 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shu-Fan Wang
Efstathia Soufleri
Jian Li
486
9
0
16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Efstathia Soufleri
Jian Li
283
18
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Manufacturing & Service Operations Management (MSOM), 2023
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
225
10
0
21 Mar 2023
Approximately Stationary Bandits with Knapsacks
Annual Conference Computational Learning Theory (COLT), 2023
Giannis Fikioris
Éva Tardos
AAML
307
9
0
28 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma
Shresth Verma
Aditya Mate
Aparna Taneja
Milind Tambe
238
0
0
19 Jan 2023
Stochastic Rising Bandits
International Conference on Machine Learning (ICML), 2022
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
199
19
0
07 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
AAAI Conference on Artificial Intelligence (AAAI), 2022
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
231
28
0
30 May 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
IEEE/ACM Transactions on Networking (TON), 2022
Efstathia Soufleri
Shu-Fan Wang
Jian Li
Rahul Singh
386
13
0
26 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Efstathia Soufleri
Jian Li
Rahul Singh
264
4
0
20 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
232
6
0
04 Jul 2021
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?
Nicolas Gast
B. Gaujal
K. Khun
335
2
0
16 Jun 2021
1
Page 1 of 1