Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1911.02768
Cited By
v1
v2
v3
v4 (latest)
Confidence Intervals for Policy Evaluation in Adaptive Experiments
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2019
7 November 2019
Vitor Hadad
David A. Hirshberg
Ruohan Zhan
Stefan Wager
Susan Athey
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Confidence Intervals for Policy Evaluation in Adaptive Experiments"
31 / 81 papers shown
Title
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
347
3
0
15 Sep 2022
Multi-disciplinary fairness considerations in machine learning for clinical trials
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Isabel Chien
Nina Deliu
Richard Turner
Adrian Weller
S. Villar
Niki Kilbertus
FaML
132
26
0
18 May 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
International Statistical Review (ISR), 2022
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
217
16
0
04 Mar 2022
Synthetically Controlled Bandits
Vivek Farias
C. Moallemi
Tianyi Peng
Andrew Zheng
202
13
0
14 Feb 2022
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap
Masahiro Kato
Kaito Ariu
Masaaki Imaizumi
and Masahiro Nomura
Chao Qin
565
3
0
12 Jan 2022
Efficient Inference Without Trading-off Regret in Bandits: An Allocation Probability Test for Thompson Sampling
Nina Deliu
Joseph Jay Williams
S. Villar
213
12
0
30 Oct 2021
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
Journal of the American Statistical Association (JASA), 2021
Ye Shen
Hengrui Cai
Rui Song
OffRL
333
6
0
29 Oct 2021
Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making
Alex Chohlas-Wood
Madison Coots
Henry Zhu
Emma Brunskill
Sharad Goel
FaML
251
31
0
18 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yi Xiong
Ningyuan Chen
Yi Xiong
OffRL
OnRL
225
5
0
31 Jul 2021
Near-optimal inference in adaptive linear regression
K. Khamaru
Y. Deshpande
Tor Lattimore
Lester W. Mackey
Martin J. Wainwright
255
19
0
05 Jul 2021
A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms
Neural Information Processing Systems (NeurIPS), 2021
Anand Kalvit
A. Zeevi
234
38
0
03 Jun 2021
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Knowledge Discovery and Data Mining (KDD), 2021
Ruohan Zhan
Vitor Hadad
David A. Hirshberg
Susan Athey
OffRL
241
71
0
03 Jun 2021
Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
Neural Information Processing Systems (NeurIPS), 2021
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
OffRL
166
17
0
03 Jun 2021
Post-Contextual-Bandit Inference
Neural Information Processing Systems (NeurIPS), 2021
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
150
47
0
01 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
International Conference on Machine Learning (ICML), 2021
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
193
43
0
10 May 2021
Policy Learning with Adaptively Collected Data
Management Sciences (MS), 2021
Ruohan Zhan
Zhimei Ren
Susan Athey
Zhengyuan Zhou
OffRL
235
31
0
05 May 2021
Statistical Inference with M-Estimators on Adaptively Collected Data
Neural Information Processing Systems (NeurIPS), 2021
Kelly W. Zhang
Lucas Janson
Susan Murphy
OffRL
153
52
0
29 Apr 2021
Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments
Joseph Jay Williams
Jacob Nogas
Nina Deliu
Hammad Shaikh
S. Villar
A. Durand
Anna N. Rafferty
AAML
147
11
0
22 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference
Neural Information Processing Systems (NeurIPS), 2021
Maria Dimakopoulou
Zhimei Ren
Zhengyuan Zhou
179
41
0
25 Feb 2021
Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability
Masahiro Kato
OffRL
182
0
0
17 Feb 2021
Weak Signal Asymptotics for Sequentially Randomized Experiments
Management Sciences (MS), 2021
Xueheng Kuang
Stefan Wager
451
11
0
25 Jan 2021
Policy design in experiments with unknown interference
Davide Viviano
Jess Rudder
375
10
0
16 Nov 2020
Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy
Masahiro Kato
Yusuke Kaneko
OffRL
145
4
0
23 Oct 2020
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
233
43
0
21 Oct 2020
The Adaptive Doubly Robust Estimator for Policy Evaluation in Adaptive Experiments and a Paradox Concerning Logging Policy
Masahiro Kato
Shota Yasui
K. McAlinn
OffRL
221
0
0
08 Oct 2020
Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales
Masahiro Kato
OffRL
113
3
0
12 Jun 2020
Power Constrained Bandits
Machine Learning in Health Care (MLHC), 2020
Jiayu Yao
Emma Brunskill
Weiwei Pan
Susan Murphy
Finale Doshi-Velez
291
42
0
13 Apr 2020
Panel Experiments and Dynamic Causal Effects: A Finite Population Perspective
Quantitative Economics (Quant. Econ.), 2020
Iavor Bojinov
Ashesh Rambachan
N. Shephard
277
55
0
22 Mar 2020
On conditional versus marginal bias in multi-armed bandits
International Conference on Machine Learning (ICML), 2020
Jaehyeok Shin
Aaditya Ramdas
Alessandro Rinaldo
188
13
0
19 Feb 2020
Inference for Batched Bandits
Neural Information Processing Systems (NeurIPS), 2020
Kelly W. Zhang
Lucas Janson
Susan Murphy
305
102
0
08 Feb 2020
Online Causal Inference for Advertising in Real-Time Bidding Auctions
Marketing science (Providence, R.I.) (MSPRI), 2019
Caio Waisman
Harikesh S. Nair
Carlos Carrion
CML
247
13
0
22 Aug 2019
Previous
1
2