Statistical Inference with M-Estimators on Adaptively Collected Data

Statistical Inference with M-Estimators on Adaptively Collected Data

29 April 2021

Kelly W. Zhang

Papers citing "Statistical Inference with M-Estimators on Adaptively Collected Data"

17 / 17 papers shown

Title
Efficient Adaptive Experimentation with Non-Compliance Miruna Oprescu Brian M Cho Nathan Kallus 110 0 0 23 May 2025
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning Weidong Liu Jiyuan Tu Yichen Zhang Xi Chen OffRL 45 4 0 04 Oct 2023
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits Ruohan Zhan Vitor Hadad David A. Hirshberg Susan Athey OffRL 36 61 0 03 Jun 2021
Post-Contextual-Bandit Inference Aurélien F. Bibaut Antoine Chambaz Maria Dimakopoulou Nathan Kallus Mark van der Laan 42 40 0 01 Jun 2021
Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting Haoyu Chen Wenbin Lu R. Song OffRL 39 30 0 14 Oct 2020
Power Constrained Bandits Jiayu Yao Emma Brunskill Weiwei Pan Susan Murphy Finale Doshi-Velez 57 36 0 13 Apr 2020
Inference for Batched Bandits Kelly W. Zhang Lucas Janson Susan Murphy 64 82 0 08 Feb 2020
Confidence Intervals for Policy Evaluation in Adaptive Experiments Vitor Hadad David A. Hirshberg Ruohan Zhan Stefan Wager Susan Athey 31 143 0 07 Nov 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes Nathan Kallus Masatoshi Uehara OffRL 66 185 0 22 Aug 2019
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator Sarah Dean Horia Mania Nikolai Matni Benjamin Recht Stephen Tu 31 283 0 23 May 2018
Accurate Inference for Adaptive Linear Models Y. Deshpande Lester W. Mackey Vasilis Syrgkanis Matt Taddy OffRL 55 61 0 18 Dec 2017
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits Yu Wang Alekh Agarwal Miroslav Dudík OffRL 59 220 0 04 Dec 2016
A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients I. Hochberg G. Feraru Mark Kozdoba Shie Mannor Moshe Tennenholtz E. Yom-Tov 22 167 0 13 May 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Philip S. Thomas Emma Brunskill OffRL 225 573 0 04 Apr 2016
Uniformity and the delta method Maximilian Kasy 33 33 0 21 Jul 2015
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 133 993 0 15 Sep 2012
On the uniform asymptotic validity of subsampling and the bootstrap Joseph P. Romano A. Shaikh 88 103 0 12 Apr 2012