v1v2v3 (latest)

Learning When-to-Treat Policies

Journal of the American Statistical Association (JASA), 2019

23 May 2019

Papers citing "Learning When-to-Treat Policies"

50 / 53 papers shown

Title
CaRT: Teaching LLM Agents to Know When They Know Enough Grace Liu Yuxiao Qu J. Schneider Aarti Singh Aviral Kumar LRM 128 0 0 09 Oct 2025
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation Hossein Goli Michael Gimelfarb Nathan Samuel de Lara Haruki Nishimura Masha Itkina Florian Shkurti OffRL 215 1 0 27 May 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to doInternational Conference on Learning Representations (ICLR), 2025 Yoav Wald M. Goldstein Yonathan Efroni Wouter A. C. van Amsterdam Rajesh Ranganath CML 323 0 0 20 Mar 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024 Abdullah Akgul Manuel Haußmann M. Kandemir OffRL 533 0 0 17 Jan 2025
Off-dynamics Conditional Diffusion PlannersIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024 Wen Zheng Terence Ng Jianda Chen Tianwei Zhang DiffM OffRL 288 0 0 16 Oct 2024
Fitted Q-Iteration via Max-Plus-Linear ApproximationIEEE Control Systems Letters (L-CSS), 2024 Y. Liu Mohammad Amin Sharifi Kolarijani 212 2 0 12 Sep 2024
Functional Acceleration for Policy Mirror Descent Veronica Chelu Doina Precup 303 1 0 23 Jul 2024
Artificial Intelligence-based Decision Support Systems for Precision and Digital Health Nina Deliu Bibhas Chakraborty 174 8 0 22 Jul 2024
Structured Difference-of-Q via Orthogonal Learning Defu Cao Angela Zhou 338 0 0 12 Jun 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization Longxiang He Li Shen Xueqian Wang 261 12 0 28 May 2024
A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning Pan Zhao Yifan Cui CML 239 2 0 14 Oct 2023
Machine Learning Who to Nudge: Causal vs Predictive Targeting in a Field Experiment on Student Financial Aid RenewalJournal of Econometrics (JE), 2023 Susan Athey Niall Keleher Jann Spiess 112 20 0 12 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning Longxiang He Li Shen Linrui Zhang Junbo Tan Xueqian Wang OffRL 248 16 0 09 Oct 2023
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs Hector Kohler R. Akrour Philippe Preux OffRL 354 1 0 23 Sep 2023
$$\pi2\text{vec}$: Policy Representations with Successor Features$ $\pi2\text{vec}$ : Policy Representations with Successor FeaturesInternational Conference on Learning Representations (ICLR), 2023 Gianluca Scarpellini Ksenia Konyushkova Claudio Fantacci T. Paine Yutian Chen Misha Denil OffRL 190 1 0 16 Jun 2023
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023 Hojoon Lee Ko-tik Lee Dongyoon Hwang Hyunho Lee ByungKun Lee Jaegul Choo SSL OOD 190 11 0 09 Jun 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationInternational Conference on Learning Representations (ICLR), 2023 Thanh Nguyen-Tang R. Arora OffRL 194 6 0 24 Feb 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and HealthcareAdaptive Agents and Multi-Agent Systems (AAMAS), 2023 Ge Gao Song Ju Markel Sanz Ausin Min Chi OffRL 183 8 0 18 Feb 2023
Infinite Action Contextual Bandits with Reusable Data ExhaustInternational Conference on Machine Learning (ICML), 2023 Mark Rucker Yinglun Zhu Paul Mineiro OffRL 270 2 0 16 Feb 2023
Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection Daiqi Gao Yufeng Liu D. Zeng OffRL 198 0 0 29 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health Yuhe Gao C. Shi R. Song 154 0 0 03 Jan 2023
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022 Marc Rigter Bruno Lacerda Nick Hawes OffRL 388 10 0 30 Nov 2022
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function ApproximationAAAI Conference on Artificial Intelligence (AAAI), 2022 Thanh Nguyen-Tang Ming Yin Sunil R. Gupta Svetha Venkatesh R. Arora OffRL 168 23 0 23 Nov 2022
Counterfactual Learning with Multioutput Deep Kernels A. Caron G. Baio I. Manolopoulou BDL CML OffRL 192 2 0 20 Nov 2022
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation Xiaoteng Ma Zhipeng Liang Jose H. Blanchet MingWen Liu Li Xia Jiheng Zhang Qianchuan Zhao Zhengyuan Zhou OOD OffRL 269 34 0 14 Sep 2022
Game-Theoretic Algorithms for Conditional Moment Matching Gokul Swamy Sanjiban Choudhury J. Andrew Bagnell Zhiwei Steven Wu 105 0 0 19 Aug 2022
Offline Policy Optimization with Eligible ActionsConference on Uncertainty in Artificial Intelligence (UAI), 2022 Yao Liu Yannis Flet-Berliac Emma Brunskill OffRL 144 6 0 01 Jul 2022
Interpretable Deep Causal Learning for Moderation Effects A. Caron G. Baio I. Manolopoulou CML OOD 207 2 0 21 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022 Shentao Yang Yihao Feng Shujian Zhang Mi Zhou OffRL 189 14 0 14 Jun 2022
Learning Optimal Dynamic Treatment Regimes Using Causal Tree Methods in MedicineMachine Learning in Health Care (MLHC), 2022 Theresa Blümlein Joel Persson Stefan Feuerriegel CML 186 14 0 14 Apr 2022
Testing Stationarity and Change Point Detection in Reinforcement LearningAnnals of Statistics (Ann. Stat.), 2022 Mengbing Li C. Shi Zhanghua Wu Piotr Fryzlewicz OffRL 444 13 0 03 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsJournal of the American Statistical Association (JASA), 2022 C. Shi Shuang Luo Yuan Le Hongtu Zhu R. Song OffRL OnRL 196 15 0 26 Feb 2022
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning Shentao Yang Zhendong Wang Huangjie Zheng Yihao Feng Mingyuan Zhou OffRL 132 10 0 19 Feb 2022
Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit Performance Gabriel Okasa CML 185 11 0 30 Jan 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach S. Saghafian CML 235 21 0 08 Dec 2021
Offline Neural Contextual Bandits: Pessimism, Optimization and GeneralizationInternational Conference on Learning Representations (ICLR), 2021 Thanh Nguyen-Tang Sunil R. Gupta A. Nguyen Svetha Venkatesh OffRL 190 34 0 27 Nov 2021
Offline Reinforcement Learning: Fundamental Barriers for Value Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2021 Dylan J. Foster A. Krishnamurthy D. Simchi-Levi Yunzong Xu OffRL 262 71 0 21 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential IgnorabilityAnnals of Statistics (Ann. Stat.), 2021 Yupeng Tang Seung-seob Lee OffRL 317 28 0 24 Oct 2021
Stateful Offline Contextual Policy Evaluation and Learning Nathan Kallus Angela Zhou OffRL 110 6 0 19 Oct 2021
Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraints Shosei Sakaguchi 333 6 0 09 Jun 2021
GEAR: On Optimal Decision Making with Auxiliary Data Hengrui Cai R. Song Wenbin Lu 192 1 0 21 Apr 2021
Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome Hengrui Cai Wenbin Lu R. Song 205 2 0 21 Apr 2021
Benchmarks for Deep Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2021 Justin Fu Mohammad Norouzi Ofir Nachum George Tucker Ziyun Wang ... Yutian Chen Aviral Kumar Cosmin Paduraru Sergey Levine T. Paine ELM OffRL 196 108 0 30 Mar 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of PessimismIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021 Paria Rashidinejad Banghua Zhu Cong Ma Jiantao Jiao Stuart J. Russell OffRL 702 311 0 22 Mar 2021
Estimating the Long-Term Effects of Novel TreatmentsNeural Information Processing Systems (NeurIPS), 2021 Keith Battocchi E. Dillon Maggie Hei Greg Lewis Miruna Oprescu Vasilis Syrgkanis CML 211 12 0 15 Mar 2021
Dynamic covariate balancing: estimating treatment effects over time with potential local projections Davide Viviano Jelena Bradic 265 0 0 01 Mar 2021
Continuous Action Reinforcement Learning from a Mixture of Interpretable ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020 R. Akrour Davide Tateo Jan Peters 179 26 0 10 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 964 2,327 0 04 May 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved ConfoundingNeural Information Processing Systems (NeurIPS), 2020 Hongseok Namkoong Ramtin Keramati Steve Yadlowsky Emma Brunskill OffRL 277 70 0 12 Mar 2020
Double/Debiased Machine Learning for Dynamic Treatment Effects via g-EstimationNeural Information Processing Systems (NeurIPS), 2020 Greg Lewis Vasilis Syrgkanis CML 336 45 0 17 Feb 2020