ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.03463
  4. Cited By
On learning Whittle index policy for restless bandits with scalable
  regret
v1v2 (latest)

On learning Whittle index policy for restless bandits with scalable regret

IEEE Transactions on Control of Network Systems (IEEE TCNS), 2022
7 February 2022
N. Akbarzadeh
Aditya Mahajan
ArXiv (abs)PDFHTML

Papers citing "On learning Whittle index policy for restless bandits with scalable regret"

8 / 8 papers shown
Model-Based Learning of Whittle indices
Model-Based Learning of Whittle indices
Joël Charles-Rebuffé
Nicolas Gast
B. Gaujal
74
1
0
25 Nov 2025
Risk-Aware Decision Making in Restless Bandits: Theory and Algorithms for Planning and Learning
Risk-Aware Decision Making in Restless Bandits: Theory and Algorithms for Planning and Learning
Nima Akbarzadeh
Erick Delage
Yossiri Adulyasak
428
0
0
30 Oct 2024
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference FeedbackInternational Conference on Learning Representations (ICLR), 2024
Efstathia Soufleri
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
372
2
0
07 Oct 2024
A Federated Online Restless Bandit Framework for Cooperative Resource
  Allocation
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
302
3
0
12 Jun 2024
Tabular and Deep Reinforcement Learning for Gittins Index
Tabular and Deep Reinforcement Learning for Gittins Index
Harshit Dhankar
Kshitij Mishra
Tejas Bodas
439
1
0
02 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in
  Dense mmWave Networks
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shu-Fan Wang
Efstathia Soufleri
Shichen Zhang
Huacheng Zeng
Jian Li
Shivendra Panwar
223
0
0
25 Apr 2024
A resource-constrained stochastic scheduling algorithm for homeless
  street outreach and gleaning edible food
A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food
Conor M. Artman
Aditya Mate
Ezinne Nwankwo
A. Heching
Tsuyoshi Idé
...
Kush R. Varshney
Lauri Goldkind
Gidi Kroch
Jaclyn Sawyer
Ian Watson
252
0
0
15 Mar 2024
Bayesian Learning of Optimal Policies in Markov Decision Processes with
  Countably Infinite State-Space
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-SpaceNeural Information Processing Systems (NeurIPS), 2023
Saghar Adler
V. Subramanian
202
3
0
05 Jun 2023
1
Page 1 of 1