v1v2 (latest)

On learning Whittle index policy for restless bandits with scalable regret

IEEE Transactions on Control of Network Systems (IEEE TCNS), 2022

7 February 2022

N. Akbarzadeh

Aditya Mahajan

ArXiv (abs)PDF HTML

Papers citing "On learning Whittle index policy for restless bandits with scalable regret"

8 / 8 papers shown

Model-Based Learning of Whittle indices

Joël Charles-Rebuffé

Nicolas Gast

B. Gaujal

25 Nov 2025

Risk-Aware Decision Making in Restless Bandits: Theory and Algorithms for Planning and Learning

Nima Akbarzadeh

Erick Delage

Yossiri Adulyasak

428

30 Oct 2024

DOPL: Direct Online Preference Learning for Restless Bandits with Preference FeedbackInternational Conference on Learning Representations (ICLR), 2024

372

07 Oct 2024

A Federated Online Restless Bandit Framework for Cooperative Resource Allocation

Jingwen Tong

Xinran Li

Liqun Fu

Jun Zhang

Khaled B. Letaief

302

12 Jun 2024

Tabular and Deep Reinforcement Learning for Gittins Index

Harshit Dhankar

Kshitij Mishra

Tejas Bodas

439

02 May 2024

Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

223

25 Apr 2024

A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food

...

252

15 Mar 2024

Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-SpaceNeural Information Processing Systems (NeurIPS), 2023

Saghar Adler

V. Subramanian

202

05 Jun 2023