ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.08796
  4. Cited By
Reinforcement Learning in Healthcare: A Survey
v1v2v3v4 (latest)

Reinforcement Learning in Healthcare: A Survey

ACM Computing Surveys (ACM CSUR), 2019
22 August 2019
Chao Yu
Jiming Liu
S. Nemati
    LM&MAOffRL
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning in Healthcare: A Survey"

50 / 262 papers shown
Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning
Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning
Bokang Zhang
Chaojun Lu
Jianhui Li
Junfeng Wu
AAML
185
0
0
27 Nov 2025
OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability
OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability
Karen Ullrich
Jingtong Su
Claudia Shi
Arjun Subramonian
Amir Bar
Ivan Evtimov
Nikolaos Tsilivis
Randall Balestriero
Julia Kempe
Mark Ibrahim
151
1
0
25 Nov 2025
Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies
Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies
Dong-Hee Shin
Deok-Joong Lee
Young-Han Son
Tae-Eui Kam
OffRL
196
2
0
15 Nov 2025
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Xinming Gao
Shangzhe Li
Yujin Cai
Wenwu Yu
OffRL
162
0
0
15 Nov 2025
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Yunchang Ma
Tenglong Liu
Yixing Lan
Xin Yin
Changxin Zhang
Xinglong Zhang
Xin Xu
OffRL
289
0
0
12 Nov 2025
Bernstein-von Mises for Adaptively Collected Data
Bernstein-von Mises for Adaptively Collected Data
Kevin Du
Yash Nair
Lucas Janson
143
0
0
10 Nov 2025
Directional-Clamp PPO
Directional-Clamp PPO
Gilad Karpel
Ruida Zhou
Shoham Sabach
Mohammad Ghavamzadeh
107
0
0
04 Nov 2025
Sample-efficient and Scalable Exploration in Continuous-Time RL
Sample-efficient and Scalable Exploration in Continuous-Time RL
Klemens Iten
Lenart Treven
Bhavya Sukhija
Florian Dorfler
Andreas Krause
OffRL
181
1
0
28 Oct 2025
Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets
Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets
Himadri S. Pandey
Kai Wang
Gian-Gabriel P. Garcia
155
0
0
24 Oct 2025
Do You Trust the Process?: Modeling Institutional Trust for Community Adoption of Reinforcement Learning Policies
Do You Trust the Process?: Modeling Institutional Trust for Community Adoption of Reinforcement Learning Policies
Naina Balepur
Xingrui Pei
Hari Sundaram
OffRL
109
0
0
24 Oct 2025
Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges
Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges
Christian Bluethgen
Dave Van Veen
Daniel Truhn
Jakob Nikolas Kather
Michael Moor
...
Akshay S. Chaudhari
Thomas Frauenfelder
C. Langlotz
Michael Krauthammer
Farhad Nooralahzadeh
LM&MAAI4CE
321
0
0
10 Oct 2025
Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
Noor Islam S. Mohammad
92
0
0
09 Oct 2025
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
Sheng Wang
Ruiming Wu
Charles Herndon
Yihang Liu
Shunsuke Koga
Jeanne Shen
Zhi Huang
187
5
0
06 Oct 2025
Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors
Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors
Chang Liu
Ladda Thiamwong
Yanjie Fu
Rui Xie
OffRL
173
0
0
22 Sep 2025
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
S. Hazra
P. Dasgupta
Soumyajit Dey
148
0
0
11 Sep 2025
Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform
Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform
Zhaoxun "Lorenz" Liu
Wagner H. Souza
J. N. Han
Amin Madani
96
0
0
10 Sep 2025
LLM-Driven Policy Diffusion: Enhancing Generalization in Offline Reinforcement Learning
LLM-Driven Policy Diffusion: Enhancing Generalization in Offline Reinforcement Learning
Hanping Zhang
Yuhong Guo
OffRL
199
0
0
30 Aug 2025
Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
Dilruk Perera
Gousia Habib
Qianyi Xu
Daniel J. Tan
Kai He
Erik Cambria
Mengling Feng
OffRLAI4TS
306
0
0
28 Aug 2025
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Ziwei Su
Imon Banerjee
Diego Klabjan
OffRL
220
0
0
02 Aug 2025
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary
Wassim Uddin Mondal
Laxmidhar Behera
OffRL
453
3
0
11 Jun 2025
How to Provably Improve Return Conditioned Supervised Learning?
Zhishuai Liu
Yu Yang
Ruhan Wang
Pan Xu
Dongruo Zhou
OffRL
236
1
0
10 Jun 2025
SAFER: A Calibrated Risk-Aware Multimodal Recommendation Model for Dynamic Treatment Regimes
SAFER: A Calibrated Risk-Aware Multimodal Recommendation Model for Dynamic Treatment Regimes
Yishan Shen
Yuyang Ye
Hui Xiong
Yong Chen
OffRL
126
1
0
07 Jun 2025
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
Alper Kamil Bozkurt
Calin Belta
Ming C. Lin
302
1
0
01 Jun 2025
Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
Lingkai Kong
Haichuan Wang
Tonghan Wang
Guojun Xiong
Milind Tambe
OffRL
461
7
0
29 May 2025
Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning
Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning
Esra Adiyeke
Tianqi Liu
Venkata Sai Dheeraj Naganaboina
Han Li
Tyler J. Loftus
...
Karandeep Singh
Ruogu Fang
Parisa Rashidi
A. Bihorac
T. Ozrazgat-Baslanti
OffRL
519
1
0
27 May 2025
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Shijie Liu
Andrew C. Cullen
Paul Montague
S. Erfani
Benjamin I. P. Rubinstein
OffRLAAML
257
4
0
27 May 2025
medDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support
medDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support
Qianyi Xu
Gousia Habib
Dilruk Perera
Mengling Feng
Mengling Feng
OffRL
415
1
0
26 May 2025
Counterfactual Explanations for Continuous Action Reinforcement Learning
Counterfactual Explanations for Continuous Action Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Shuyang Dong
Shangtong Zhang
Lu Feng
OffRLLRM
383
0
0
19 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Xinhu Zheng
Gang Wang
AI4CE
603
33
0
08 May 2025
Active Sampling for MRI-based Sequential Decision Making
Active Sampling for MRI-based Sequential Decision Making
Yuning Du
Jingshuai Liu
R. Dharmakumar
Sotirios A. Tsaftaris
280
0
0
07 May 2025
Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models
Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models
Fredy Pokou
Jules Sadefo Kamdem
François Benhmad
AIFin
269
1
0
23 Apr 2025
Can Machine Learning Agents Deal with Hard Choices?
Can Machine Learning Agents Deal with Hard Choices?
Kangyu Wang
640
0
0
18 Apr 2025
How to Adapt Control Barrier Functions? A Learning-Based Approach with Applications to a VTOL Quadplane
How to Adapt Control Barrier Functions? A Learning-Based Approach with Applications to a VTOL QuadplaneIEEE Conference on Decision and Control (CDC), 2025
Taekyung Kim
Randal W. Beard
Dimitra Panagou
512
0
0
03 Apr 2025
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao
Fan Feng
Jing Huo
Shangdong Yang
Meng Fang
Zhenxing Ge
Yang Gao
AAMLOffRL
252
2
0
26 Mar 2025
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Latent Embedding Adaptation for Human Preference Alignment in Diffusion PlannersIEEE International Conference on Robotics and Automation (ICRA), 2025
Wen Zheng Terence Ng
Jianda Chen
Yuan Xu
Tianwei Zhang
417
1
0
24 Mar 2025
Zero-Shot Action Generalization with Limited Observations
Zero-Shot Action Generalization with Limited ObservationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Abdullah Alchihabi
Hanping Zhang
Yuhong Guo
OffRL
370
0
0
11 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
1.4K
68
0
10 Mar 2025
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
Yang Xu
Washim Uddin Mondal
Vaneet Aggarwal
OffRL
592
8
0
24 Feb 2025
Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement Learning
Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2025
Ali Baheri
Zahra Sharooei
Chirayu Salgarkar
1.1K
4
0
17 Jan 2025
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Joo Seung Lee
Malini Mahendra
Anil Aswani
OffRL
349
1
0
10 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
522
8
0
03 Jan 2025
Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application
Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application
Lucky Li
237
0
0
08 Dec 2024
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
475
1
0
05 Dec 2024
Provably Efficient Action-Manipulation Attack Against Continuous
  Reinforcement Learning
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
Zhi Luo
Xiaoyu Yang
Pan Zhou
D. Wang
AAML
281
1
0
20 Nov 2024
Upside-Down Reinforcement Learning for More Interpretable Optimal ControlInternational Conference on Agents and Artificial Intelligence (ICAART), 2024
Juan Cardenas-Cartagena
Massimiliano Falzari
Marco Zullich
Matthia Sabatelli
OffRL
323
0
0
18 Nov 2024
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
410
4
0
17 Nov 2024
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
276
0
0
07 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsNeural Information Processing Systems (NeurIPS), 2024
Rui Yang
Jie Wang
Guoping Wu
Yangqiu Song
AAMLOffRL
451
9
0
01 Nov 2024
StepCountJITAI: simulation environment for RL with application to
  physical activity adaptive intervention
StepCountJITAI: simulation environment for RL with application to physical activity adaptive intervention
Karine Karine
Benjamin M. Marlin
222
2
0
01 Nov 2024
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large
  Language Models
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Junda Wu
Xintong Li
Ruoyu Wang
Yu Xia
Yuxin Xiong
...
Xiang Chen
Branislav Kveton
Lina Yao
Jingbo Shang
Julian McAuley
OffRLLRM
256
7
0
31 Oct 2024
123456
Next
Page 1 of 6