ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.07965
  4. Cited By
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in
  Application to Preventive Healthcare
v1v2 (latest)

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare

17 May 2021
Arpita Biswas
Gaurav Aggarwal
Pradeep Varakantham
Milind Tambe
ArXiv (abs)PDFHTML

Papers citing "Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare"

29 / 29 papers shown
Title
Multi-agent Markov Entanglement
Multi-agent Markov Entanglement
Shuze Chen
Tianyi Peng
56
0
0
03 Jun 2025
Lagrangian Index Policy for Restless Bandits with Average Reward
Lagrangian Index Policy for Restless Bandits with Average Reward
Konstantin Avrachenkov
Vivek Borkar
Pratik Shah
109
1
0
17 Dec 2024
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
75
2
0
07 Oct 2024
The Digital Transformation in Health: How AI Can Improve the Performance
  of Health Systems
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
68
11
0
24 Sep 2024
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless
  Multi-armed Bandits
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
Gongpu Chen
Soung Chang Liew
Deniz Gunduz
31
1
0
19 Aug 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning:
  Insights from SwipeRx
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRLOnRL
73
2
0
15 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in
  Resource-Limited Settings
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
58
0
0
14 Aug 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
123
6
0
11 Aug 2024
EduQate: Generating Adaptive Curricula through RMABs in Education
  Settings
EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Sidney Tio
Dexun Li
Pradeep Varakantham
OffRL
21
0
0
20 Jun 2024
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart
  Target Tracking
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking
Yuhang Hao
Zengfu Wang
Jing-Zhi Fu
Quan Pan
79
0
0
19 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
94
7
0
16 Dec 2023
Towards a Pretrained Model for Restless Bandits via Multi-arm
  Generalization
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
Yunfan Zhao
Nikhil Behari
Edward Hughes
Edwin Zhang
Dheeraj M. Nagaraj
K. Tuyls
Aparna Taneja
Milind Tambe
74
8
0
23 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
87
14
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
58
8
0
21 Mar 2023
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource
  Allocation
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation
Aditya Mate
Bryan Wilder
Aparna Taneja
Milind Tambe
OffRL
36
3
0
06 Feb 2023
Data-pooling Reinforcement Learning for Personalized Healthcare
  Intervention
Data-pooling Reinforcement Learning for Personalized Healthcare Intervention
Xinyun Chen
P. Shi
Shanwen Pu
OffRL
71
5
0
16 Nov 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
146
6
0
18 Sep 2022
On-the-fly Adaptation of Patrolling Strategies in Changing Environments
On-the-fly Adaptation of Patrolling Strategies in Changing Environments
Tomávs Brázdil
David Klavska
Antonín Kuvcera
Vít Musil
Petr Novotný
Vojtvech vRehák
TTAAAML
21
0
0
16 Jun 2022
Efficient Resource Allocation with Fairness Constraints in Restless
  Multi-Armed Bandits
Efficient Resource Allocation with Fairness Constraints in Restless Multi-Armed Bandits
Dexun Li
Pradeep Varakantham
69
9
0
08 Jun 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
84
17
0
30 May 2022
Near-optimality for infinite-horizon restless bandits with many arms
Near-optimality for infinite-horizon restless bandits with many arms
Xinming Zhang
P. Frazier
21
16
0
29 Mar 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
56
6
0
26 Feb 2022
Minimizing Expected Intrusion Detection Time in Adversarial Patrolling
Minimizing Expected Intrusion Detection Time in Adversarial Patrolling
David Klavska
Antonín Kuvcera
Vít Musil
Vojtvech vRehák
AAML
21
0
0
02 Feb 2022
Networked Restless Multi-Armed Bandits for Mobile Interventions
Networked Restless Multi-Armed Bandits for Mobile Interventions
H. Ou
Christoph Siebenbrunner
J. Killian
M. Brooks
David Kempe
Yevgeniy Vorobeychik
Milind Tambe
71
8
0
28 Jan 2022
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
137
40
0
05 Oct 2021
Field Study in Deploying Restless Multi-Armed Bandits: Assisting
  Non-Profits in Improving Maternal and Child Health
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health
Aditya Mate
Lovish Madaan
Aparna Taneja
N. Madhiwalla
Shresth Verma
Gargi Singh
Aparna Hegde
Pradeep Varakantham
Milind Tambe
81
54
0
16 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
57
6
0
04 Jul 2021
Q-Learning Lagrange Policies for Multi-Action Restless Bandits
Q-Learning Lagrange Policies for Multi-Action Restless Bandits
J. Killian
Arpita Biswas
Sanket Shah
Milind Tambe
OffRL
52
33
0
22 Jun 2021
Efficient Algorithms for Finite Horizon and Streaming Restless
  Multi-Armed Bandit Problems
Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Aditya Mate
Arpita Biswas
Christoph Siebenbrunner
Susobhan Ghosh
Milind Tambe
89
9
0
08 Mar 2021
1