Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.03217
Cited By
v1
v2
v3 (latest)
Inference for Batched Bandits
Neural Information Processing Systems (NeurIPS), 2020
8 February 2020
Kelly W. Zhang
Lucas Janson
Susan Murphy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Inference for Batched Bandits"
50 / 55 papers shown
Bernstein-von Mises for Adaptively Collected Data
Kevin Du
Yash Nair
Lucas Janson
154
1
0
10 Nov 2025
The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin
Rong Jiang
Cong Ma
194
0
0
05 Nov 2025
Kernel Treatment Effects with Adaptively Collected Data
Houssam Zenati
Bariscan Bozkurt
Arthur Gretton
146
1
0
11 Oct 2025
ISMIE: A Framework to Characterize Information Seeking in Modern Information Environments
Shuoqi Sun
Danula Hettiachchi
Damiano Spina
161
0
0
09 Oct 2025
Adaptive Off-Policy Inference for M-Estimators Under Model Misspecification
James Leiner
Robin Dunn
Aaditya Ramdas
OffRL
147
2
0
17 Sep 2025
Admissibility of Completely Randomized Trials: A Large-Deviation Approach
ACM Conference on Economics and Computation (EC), 2025
Guido Imbens
Chao Qin
Stefan Wager
304
0
0
05 Jun 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
689
4
0
22 Feb 2025
A Near-optimal, Scalable and Parallelizable Framework for Stochastic Bandits Robust to Adversarial Corruptions and Beyond
Zicheng Hu
Cheng Chen
435
0
0
11 Feb 2025
Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Ojash Neopane
Aaditya Ramdas
Aarti Singh
CML
293
3
0
21 Nov 2024
Off-policy estimation with adaptively collected data: the power of online learning
Neural Information Processing Systems (NeurIPS), 2024
Jeonghwan Lee
Cong Ma
OffRL
381
1
0
19 Nov 2024
Linear Contextual Bandits with Interference
Yang Xu
Wenbin Lu
Rui Song
357
4
0
24 Sep 2024
MiWaves Reinforcement Learning Algorithm
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen Walton
Susan Murphy
217
4
0
27 Aug 2024
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
327
0
0
08 Aug 2024
Oralytics Reinforcement Learning Algorithm
Anna L. Trella
Kelly W. Zhang
Stephanie M Carpenter
David Elashoff
Zara M Greer
Inbal Nahum-Shani
Dennis Ruenger
Vivek Shetty
Susan Murphy
143
1
0
19 Jun 2024
Demistifying Inference after Adaptive Experiments
Aurélien F. Bibaut
Nathan Kallus
249
3
0
02 May 2024
Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate Choices
Masahiro Kato
Akihiro Oga
Wataru Komatsubara
Ryo Inokuchi
387
11
0
06 Mar 2024
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
525
4
0
27 Feb 2024
Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice
Tanner Fiez
Houssam Nassif
Yu-Cheng Chen
Sergio Gamez
Lalit P. Jain
437
8
0
16 Feb 2024
An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits
Biyonka Liang
Iavor Bojinov
325
9
0
09 Nov 2023
Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference
Neural Information Processing Systems (NeurIPS), 2023
Licong Lin
Mufang Ying
Suvrojit Ghosh
K. Khamaru
Cun-Hui Zhang
337
3
0
01 Oct 2023
Optimal Conditional Inference in Adaptive Experiments
Jiafeng Chen
Isaiah Andrews
254
4
0
21 Sep 2023
Adaptive Linear Estimating Equations
Neural Information Processing Systems (NeurIPS), 2023
Mufang Ying
K. Khamaru
Cun-Hui Zhang
450
7
0
14 Jul 2023
Statistical Inference on Multi-armed Bandits with Delayed Feedback
International Conference on Machine Learning (ICML), 2023
Lei Shi
Jingshen Wang
Tianhao Wu
355
7
0
03 Jul 2023
Optimal tests following sequential experiments
Karun Adusumilli
239
4
0
30 Apr 2023
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling
Machine-mediated learning (ML), 2023
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
498
13
0
11 Apr 2023
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches
Ethan Che
Hongseok Namkoong
OffRL
443
3
0
21 Mar 2023
Semi-parametric inference based on adaptively collected data
Licong Lin
K. Khamaru
Martin J. Wainwright
OffRL
363
7
0
05 Mar 2023
Design-Based Inference for Multi-arm Bandits
D. Ham
Iavor Bojinov
Michael Lindon
M. Tingley
316
2
0
27 Feb 2023
A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization
Yasong Feng
Weijian Luo
Yimin Huang
Tianyu Wang
410
10
0
03 Feb 2023
Anytime-valid off-policy inference for contextual bandits
ACM / IMS Journal of Data Science (JDS), 2022
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
507
49
0
19 Oct 2022
Reward Imputation with Sketching for Contextual Batched Bandits
Neural Information Processing Systems (NeurIPS), 2022
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
201
4
0
13 Oct 2022
Entropy Regularization for Population Estimation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Ben Chugg
Peter Henderson
Jacob Goldin
Mark A. Lemley
253
3
0
24 Aug 2022
Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
AAAI Conference on Artificial Intelligence (AAAI), 2022
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
Susan Murphy
OnRL
193
29
0
15 Aug 2022
Some performance considerations when using multi-armed bandit algorithms in the presence of missing data
PLoS ONE (PLoS ONE), 2022
Xijin Chen
K. M. Lee
S. Villar
D. Robertson
228
2
0
08 May 2022
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection
AAAI Conference on Artificial Intelligence (AAAI), 2022
Peter Henderson
Ben Chugg
Brandon R. Anderson
Kristen M. Altenburger
Alex Turk
J. Guyton
Jacob Goldin
Mark A. Lemley
OffRL
228
10
0
25 Apr 2022
Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization
Tong Li
Jacob Nogas
Haochen Song
Anna N. Rafferty
Eric M. Schwartz
...
Harsh Kumar
Nina Deliu
S. Villar
Dehan Kong
Joseph Jay Williams
407
6
0
15 Dec 2021
Safe Data Collection for Offline and Online Policy Learning
Ruihao Zhu
Branislav Kveton
OffRL
167
5
0
08 Nov 2021
Efficient Inference Without Trading-off Regret in Bandits: An Allocation Probability Test for Thompson Sampling
Nina Deliu
Joseph Jay Williams
S. Villar
256
13
0
30 Oct 2021
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
Journal of the American Statistical Association (JASA), 2021
Ye Shen
Hengrui Cai
Rui Song
OffRL
397
6
0
29 Oct 2021
Lipschitz Bandits with Batched Feedback
Yasong Feng
Zengfeng Huang
Tianyu Wang
468
17
0
19 Oct 2021
Efficient Online Estimation of Causal Effects by Deciding What to Observe
Shantanu Gupta
Zachary Chase Lipton
David Benjamin Childers
CML
426
19
0
20 Aug 2021
Near-optimal inference in adaptive linear regression
K. Khamaru
Y. Deshpande
Tor Lattimore
Lester W. Mackey
Martin J. Wainwright
323
21
0
05 Jul 2021
A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms
Neural Information Processing Systems (NeurIPS), 2021
Anand Kalvit
A. Zeevi
324
42
0
03 Jun 2021
Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits
Knowledge Discovery and Data Mining (KDD), 2021
Ruohan Zhan
Vitor Hadad
David A. Hirshberg
Susan Athey
OffRL
316
73
0
03 Jun 2021
From Finite to Countable-Armed Bandits
Neural Information Processing Systems (NeurIPS), 2021
Anand Kalvit
A. Zeevi
247
14
0
22 May 2021
Deeply-Debiased Off-Policy Interval Estimation
International Conference on Machine Learning (ICML), 2021
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
261
43
0
10 May 2021
Statistical Inference with M-Estimators on Adaptively Collected Data
Neural Information Processing Systems (NeurIPS), 2021
Kelly W. Zhang
Lucas Janson
Susan Murphy
OffRL
230
56
0
29 Apr 2021
Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments
Joseph Jay Williams
Jacob Nogas
Nina Deliu
Hammad Shaikh
S. Villar
A. Durand
Anna N. Rafferty
AAML
219
12
0
22 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference
Neural Information Processing Systems (NeurIPS), 2021
Maria Dimakopoulou
Zhimei Ren
Zhengyuan Zhou
241
42
0
25 Feb 2021
Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability
Masahiro Kato
OffRL
237
0
0
17 Feb 2021
1
2
Next
Page 1 of 2