Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.14074
Cited By
Statistical Inference with M-Estimators on Adaptively Collected Data
29 April 2021
Kelly W. Zhang
Lucas Janson
Susan Murphy
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Statistical Inference with M-Estimators on Adaptively Collected Data"
17 / 17 papers shown
Title
Efficient Adaptive Experimentation with Non-Compliance
Miruna Oprescu
Brian M Cho
Nathan Kallus
110
0
0
23 May 2025
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
45
4
0
04 Oct 2023
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Ruohan Zhan
Vitor Hadad
David A. Hirshberg
Susan Athey
OffRL
36
61
0
03 Jun 2021
Post-Contextual-Bandit Inference
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
42
40
0
01 Jun 2021
Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting
Haoyu Chen
Wenbin Lu
R. Song
OffRL
39
30
0
14 Oct 2020
Power Constrained Bandits
Jiayu Yao
Emma Brunskill
Weiwei Pan
Susan Murphy
Finale Doshi-Velez
57
36
0
13 Apr 2020
Inference for Batched Bandits
Kelly W. Zhang
Lucas Janson
Susan Murphy
64
82
0
08 Feb 2020
Confidence Intervals for Policy Evaluation in Adaptive Experiments
Vitor Hadad
David A. Hirshberg
Ruohan Zhan
Stefan Wager
Susan Athey
31
143
0
07 Nov 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
66
185
0
22 Aug 2019
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
31
283
0
23 May 2018
Accurate Inference for Adaptive Linear Models
Y. Deshpande
Lester W. Mackey
Vasilis Syrgkanis
Matt Taddy
OffRL
55
61
0
18 Dec 2017
Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Yu Wang
Alekh Agarwal
Miroslav Dudík
OffRL
59
220
0
04 Dec 2016
A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients
I. Hochberg
G. Feraru
Mark Kozdoba
Shie Mannor
Moshe Tennenholtz
E. Yom-Tov
22
167
0
13 May 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
225
573
0
04 Apr 2016
Uniformity and the delta method
Maximilian Kasy
33
33
0
21 Jul 2015
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
133
993
0
15 Sep 2012
On the uniform asymptotic validity of subsampling and the bootstrap
Joseph P. Romano
A. Shaikh
88
103
0
12 Apr 2012
1