ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.09751
  4. Cited By
Learning When-to-Treat Policies
v1v2v3 (latest)

Learning When-to-Treat Policies

Journal of the American Statistical Association (JASA), 2019
23 May 2019
Xinkun Nie
Emma Brunskill
Stefan Wager
    CMLOffRL
ArXiv (abs)PDFHTML

Papers citing "Learning When-to-Treat Policies"

50 / 53 papers shown
Title
CaRT: Teaching LLM Agents to Know When They Know Enough
CaRT: Teaching LLM Agents to Know When They Know Enough
Grace Liu
Yuxiao Qu
J. Schneider
Aarti Singh
Aviral Kumar
LRM
128
0
0
09 Oct 2025
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli
Michael Gimelfarb
Nathan Samuel de Lara
Haruki Nishimura
Masha Itkina
Florian Shkurti
OffRL
215
1
0
27 May 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to doInternational Conference on Learning Representations (ICLR), 2025
Yoav Wald
M. Goldstein
Yonathan Efroni
Wouter A. C. van Amsterdam
Rajesh Ranganath
CML
323
0
0
20 Mar 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
533
0
0
17 Jan 2025
Off-dynamics Conditional Diffusion Planners
Off-dynamics Conditional Diffusion PlannersIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffMOffRL
288
0
0
16 Oct 2024
Fitted Q-Iteration via Max-Plus-Linear Approximation
Fitted Q-Iteration via Max-Plus-Linear ApproximationIEEE Control Systems Letters (L-CSS), 2024
Y. Liu
Mohammad Amin Sharifi Kolarijani
212
2
0
12 Sep 2024
Functional Acceleration for Policy Mirror Descent
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
303
1
0
23 Jul 2024
Artificial Intelligence-based Decision Support Systems for Precision and
  Digital Health
Artificial Intelligence-based Decision Support Systems for Precision and Digital Health
Nina Deliu
Bibhas Chakraborty
174
8
0
22 Jul 2024
Structured Difference-of-Q via Orthogonal Learning
Structured Difference-of-Q via Orthogonal Learning
Defu Cao
Angela Zhou
338
0
0
12 Jun 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
Longxiang He
Li Shen
Xueqian Wang
261
12
0
28 May 2024
A Semiparametric Instrumented Difference-in-Differences Approach to
  Policy Learning
A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning
Pan Zhao
Yifan Cui
CML
239
2
0
14 Oct 2023
Machine Learning Who to Nudge: Causal vs Predictive Targeting in a Field
  Experiment on Student Financial Aid Renewal
Machine Learning Who to Nudge: Causal vs Predictive Targeting in a Field Experiment on Student Financial Aid RenewalJournal of Econometrics (JE), 2023
Susan Athey
Niall Keleher
Jann Spiess
112
20
0
12 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline
  Reinforcement Learning
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He
Li Shen
Linrui Zhang
Junbo Tan
Xueqian Wang
OffRL
248
16
0
09 Oct 2023
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in
  IBMDPs
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs
Hector Kohler
R. Akrour
Philippe Preux
OffRL
354
1
0
23 Sep 2023
$\pi2\text{vec}$: Policy Representations with Successor Features
π2vec\pi2\text{vec}π2vec: Policy Representations with Successor FeaturesInternational Conference on Learning Representations (ICLR), 2023
Gianluca Scarpellini
Ksenia Konyushkova
Claudio Fantacci
T. Paine
Yutian Chen
Misha Denil
OffRL
190
1
0
16 Jun 2023
On the Importance of Feature Decorrelation for Unsupervised
  Representation Learning in Reinforcement Learning
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Hojoon Lee
Ko-tik Lee
Dongyoon Hwang
Hyunho Lee
ByungKun Lee
Jaegul Choo
SSLOOD
190
11
0
09 Jun 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationInternational Conference on Learning Representations (ICLR), 2023
Thanh Nguyen-Tang
R. Arora
OffRL
194
6
0
24 Feb 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and HealthcareAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
183
8
0
18 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
Infinite Action Contextual Bandits with Reusable Data ExhaustInternational Conference on Machine Learning (ICML), 2023
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
270
2
0
16 Feb 2023
Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection
Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection
Daiqi Gao
Yufeng Liu
D. Zeng
OffRL
198
0
0
29 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
154
0
0
03 Jan 2023
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based
  Offline Reinforcement Learning
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
388
10
0
30 Nov 2022
On Instance-Dependent Bounds for Offline Reinforcement Learning with
  Linear Function Approximation
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function ApproximationAAAI Conference on Artificial Intelligence (AAAI), 2022
Thanh Nguyen-Tang
Ming Yin
Sunil R. Gupta
Svetha Venkatesh
R. Arora
OffRL
168
23
0
23 Nov 2022
Counterfactual Learning with Multioutput Deep Kernels
Counterfactual Learning with Multioutput Deep Kernels
A. Caron
G. Baio
I. Manolopoulou
BDLCMLOffRL
192
2
0
20 Nov 2022
Distributionally Robust Offline Reinforcement Learning with Linear
  Function Approximation
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Xiaoteng Ma
Zhipeng Liang
Jose H. Blanchet
MingWen Liu
Li Xia
Jiheng Zhang
Qianchuan Zhao
Zhengyuan Zhou
OODOffRL
269
34
0
14 Sep 2022
Game-Theoretic Algorithms for Conditional Moment Matching
Game-Theoretic Algorithms for Conditional Moment Matching
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
105
0
0
19 Aug 2022
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible ActionsConference on Uncertainty in Artificial Intelligence (UAI), 2022
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
144
6
0
01 Jul 2022
Interpretable Deep Causal Learning for Moderation Effects
Interpretable Deep Causal Learning for Moderation Effects
A. Caron
G. Baio
I. Manolopoulou
CMLOOD
207
2
0
21 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize
  Offline Reinforcement Learning
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
189
14
0
14 Jun 2022
Learning Optimal Dynamic Treatment Regimes Using Causal Tree Methods in
  Medicine
Learning Optimal Dynamic Treatment Regimes Using Causal Tree Methods in MedicineMachine Learning in Health Care (MLHC), 2022
Theresa Blümlein
Joel Persson
Stefan Feuerriegel
CML
186
14
0
14 Apr 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement LearningAnnals of Statistics (Ann. Stat.), 2022
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
444
13
0
03 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement
  Learning in Infinite Horizons
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsJournal of the American Statistical Association (JASA), 2022
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRLOnRL
196
15
0
26 Feb 2022
A Behavior Regularized Implicit Policy for Offline Reinforcement
  Learning
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning
Shentao Yang
Zhendong Wang
Huangjie Zheng
Yihao Feng
Mingyuan Zhou
OffRL
132
10
0
19 Feb 2022
Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit
  Performance
Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit Performance
Gabriel Okasa
CML
185
11
0
30 Jan 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
S. Saghafian
CML
235
21
0
08 Dec 2021
Offline Neural Contextual Bandits: Pessimism, Optimization and
  Generalization
Offline Neural Contextual Bandits: Pessimism, Optimization and GeneralizationInternational Conference on Learning Representations (ICLR), 2021
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
190
34
0
27 Nov 2021
Offline Reinforcement Learning: Fundamental Barriers for Value Function
  Approximation
Offline Reinforcement Learning: Fundamental Barriers for Value Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2021
Dylan J. Foster
A. Krishnamurthy
D. Simchi-Levi
Yunzong Xu
OffRL
262
71
0
21 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes
  under Sequential Ignorability
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential IgnorabilityAnnals of Statistics (Ann. Stat.), 2021
Yupeng Tang
Seung-seob Lee
OffRL
317
28
0
24 Oct 2021
Stateful Offline Contextual Policy Evaluation and Learning
Stateful Offline Contextual Policy Evaluation and Learning
Nathan Kallus
Angela Zhou
OffRL
110
6
0
19 Oct 2021
Estimation of Optimal Dynamic Treatment Assignment Rules under Policy
  Constraints
Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraints
Shosei Sakaguchi
333
6
0
09 Jun 2021
GEAR: On Optimal Decision Making with Auxiliary Data
GEAR: On Optimal Decision Making with Auxiliary Data
Hengrui Cai
R. Song
Wenbin Lu
192
1
0
21 Apr 2021
Calibrated Optimal Decision Making with Multiple Data Sources and
  Limited Outcome
Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome
Hengrui Cai
Wenbin Lu
R. Song
205
2
0
21 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2021
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELMOffRL
196
108
0
30 Mar 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale
  of Pessimism
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of PessimismIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
702
311
0
22 Mar 2021
Estimating the Long-Term Effects of Novel Treatments
Estimating the Long-Term Effects of Novel TreatmentsNeural Information Processing Systems (NeurIPS), 2021
Keith Battocchi
E. Dillon
Maggie Hei
Greg Lewis
Miruna Oprescu
Vasilis Syrgkanis
CML
211
12
0
15 Mar 2021
Dynamic covariate balancing: estimating treatment effects over time with
  potential local projections
Dynamic covariate balancing: estimating treatment effects over time with potential local projections
Davide Viviano
Jelena Bradic
265
0
0
01 Mar 2021
Continuous Action Reinforcement Learning from a Mixture of Interpretable
  Experts
Continuous Action Reinforcement Learning from a Mixture of Interpretable ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
R. Akrour
Davide Tateo
Jan Peters
179
26
0
10 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRLGP
964
2,327
0
04 May 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved ConfoundingNeural Information Processing Systems (NeurIPS), 2020
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
277
70
0
12 Mar 2020
Double/Debiased Machine Learning for Dynamic Treatment Effects via
  g-Estimation
Double/Debiased Machine Learning for Dynamic Treatment Effects via g-EstimationNeural Information Processing Systems (NeurIPS), 2020
Greg Lewis
Vasilis Syrgkanis
CML
336
45
0
17 Feb 2020
12
Next