ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.02768
  4. Cited By
Confidence Intervals for Policy Evaluation in Adaptive Experiments
v1v2v3v4 (latest)

Confidence Intervals for Policy Evaluation in Adaptive Experiments

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2019
7 November 2019
Vitor Hadad
David A. Hirshberg
Ruohan Zhan
Stefan Wager
Susan Athey
ArXiv (abs)PDFHTML

Papers citing "Confidence Intervals for Policy Evaluation in Adaptive Experiments"

31 / 81 papers shown
Title
Best Arm Identification with Contextual Information under a Small Gap
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
347
3
0
15 Sep 2022
Multi-disciplinary fairness considerations in machine learning for
  clinical trials
Multi-disciplinary fairness considerations in machine learning for clinical trialsConference on Fairness, Accountability and Transparency (FAccT), 2022
Isabel Chien
Nina Deliu
Richard Turner
Adrian Weller
S. Villar
Niki Kilbertus
FaML
132
26
0
18 May 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive InterventionsInternational Statistical Review (ISR), 2022
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
217
16
0
04 Mar 2022
Synthetically Controlled Bandits
Synthetically Controlled Bandits
Vivek Farias
C. Moallemi
Tianyi Peng
Andrew Zheng
202
13
0
14 Feb 2022
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget
  under a Small Gap
Optimal Best Arm Identification in Two-Armed Bandits with a Fixed Budget under a Small Gap
Masahiro Kato
Kaito Ariu
Masaaki Imaizumi
and Masahiro Nomura
Chao Qin
565
3
0
12 Jan 2022
Efficient Inference Without Trading-off Regret in Bandits: An Allocation
  Probability Test for Thompson Sampling
Efficient Inference Without Trading-off Regret in Bandits: An Allocation Probability Test for Thompson Sampling
Nina Deliu
Joseph Jay Williams
S. Villar
213
12
0
30 Oct 2021
Doubly Robust Interval Estimation for Optimal Policy Evaluation in
  Online Learning
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online LearningJournal of the American Statistical Association (JASA), 2021
Ye Shen
Hengrui Cai
Rui Song
OffRL
333
6
0
29 Oct 2021
Learning to be Fair: A Consequentialist Approach to Equitable
  Decision-Making
Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making
Alex Chohlas-Wood
Madison Coots
Henry Zhu
Emma Brunskill
Sharad Goel
FaML
251
31
0
18 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap
Debiasing Samples from Online Learning Using BootstrapInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yi Xiong
Ningyuan Chen
Yi Xiong
OffRLOnRL
225
5
0
31 Jul 2021
Near-optimal inference in adaptive linear regression
Near-optimal inference in adaptive linear regression
K. Khamaru
Y. Deshpande
Tor Lattimore
Lester W. Mackey
Martin J. Wainwright
255
19
0
05 Jul 2021
A Closer Look at the Worst-case Behavior of Multi-armed Bandit
  Algorithms
A Closer Look at the Worst-case Behavior of Multi-armed Bandit AlgorithmsNeural Information Processing Systems (NeurIPS), 2021
Anand Kalvit
A. Zeevi
234
38
0
03 Jun 2021
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual
  Bandits
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual BanditsKnowledge Discovery and Data Mining (KDD), 2021
Ruohan Zhan
Vitor Hadad
David A. Hirshberg
Susan Athey
OffRL
241
71
0
03 Jun 2021
Risk Minimization from Adaptively Collected Data: Guarantees for
  Supervised and Policy Learning
Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy LearningNeural Information Processing Systems (NeurIPS), 2021
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
OffRL
166
17
0
03 Jun 2021
Post-Contextual-Bandit Inference
Post-Contextual-Bandit InferenceNeural Information Processing Systems (NeurIPS), 2021
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
150
47
0
01 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
Deeply-Debiased Off-Policy Interval EstimationInternational Conference on Machine Learning (ICML), 2021
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
193
43
0
10 May 2021
Policy Learning with Adaptively Collected Data
Policy Learning with Adaptively Collected DataManagement Sciences (MS), 2021
Ruohan Zhan
Zhimei Ren
Susan Athey
Zhengyuan Zhou
OffRL
235
31
0
05 May 2021
Statistical Inference with M-Estimators on Adaptively Collected Data
Statistical Inference with M-Estimators on Adaptively Collected DataNeural Information Processing Systems (NeurIPS), 2021
Kelly W. Zhang
Lucas Janson
Susan Murphy
OffRL
153
52
0
29 Apr 2021
Challenges in Statistical Analysis of Data Collected by a Bandit
  Algorithm: An Empirical Exploration in Applications to Adaptively Randomized
  Experiments
Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments
Joseph Jay Williams
Jacob Nogas
Nina Deliu
Hammad Shaikh
S. Villar
A. Durand
Anna N. Rafferty
AAML
147
11
0
22 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference
Online Multi-Armed Bandits with Adaptive InferenceNeural Information Processing Systems (NeurIPS), 2021
Maria Dimakopoulou
Zhimei Ren
Zhengyuan Zhou
179
41
0
25 Feb 2021
Adaptive Doubly Robust Estimator from Non-stationary Logging Policy
  under a Convergence of Average Probability
Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability
Masahiro Kato
OffRL
182
0
0
17 Feb 2021
Weak Signal Asymptotics for Sequentially Randomized Experiments
Weak Signal Asymptotics for Sequentially Randomized ExperimentsManagement Sciences (MS), 2021
Xueheng Kuang
Stefan Wager
451
11
0
25 Jan 2021
Policy design in experiments with unknown interference
Policy design in experiments with unknown interference
Davide Viviano
Jess Rudder
375
10
0
16 Nov 2020
Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under
  Batch Update Policy
Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy
Masahiro Kato
Yusuke Kaneko
OffRL
145
4
0
23 Oct 2020
Optimal Off-Policy Evaluation from Multiple Logging Policies
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
233
43
0
21 Oct 2020
The Adaptive Doubly Robust Estimator for Policy Evaluation in Adaptive
  Experiments and a Paradox Concerning Logging Policy
The Adaptive Doubly Robust Estimator for Policy Evaluation in Adaptive Experiments and a Paradox Concerning Logging Policy
Masahiro Kato
Shota Yasui
K. McAlinn
OffRL
221
0
0
08 Oct 2020
Confidence Interval for Off-Policy Evaluation from Dependent Samples via
  Bandit Algorithm: Approach from Standardized Martingales
Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales
Masahiro Kato
OffRL
113
3
0
12 Jun 2020
Power Constrained Bandits
Power Constrained BanditsMachine Learning in Health Care (MLHC), 2020
Jiayu Yao
Emma Brunskill
Weiwei Pan
Susan Murphy
Finale Doshi-Velez
291
42
0
13 Apr 2020
Panel Experiments and Dynamic Causal Effects: A Finite Population
  Perspective
Panel Experiments and Dynamic Causal Effects: A Finite Population PerspectiveQuantitative Economics (Quant. Econ.), 2020
Iavor Bojinov
Ashesh Rambachan
N. Shephard
277
55
0
22 Mar 2020
On conditional versus marginal bias in multi-armed bandits
On conditional versus marginal bias in multi-armed banditsInternational Conference on Machine Learning (ICML), 2020
Jaehyeok Shin
Aaditya Ramdas
Alessandro Rinaldo
188
13
0
19 Feb 2020
Inference for Batched Bandits
Inference for Batched BanditsNeural Information Processing Systems (NeurIPS), 2020
Kelly W. Zhang
Lucas Janson
Susan Murphy
305
102
0
08 Feb 2020
Online Causal Inference for Advertising in Real-Time Bidding Auctions
Online Causal Inference for Advertising in Real-Time Bidding AuctionsMarketing science (Providence, R.I.) (MSPRI), 2019
Caio Waisman
Harikesh S. Nair
Carlos Carrion
CML
247
13
0
22 Aug 2019
Previous
12