v1v2v3 (latest)

Inference for Batched Bandits

Neural Information Processing Systems (NeurIPS), 2020

8 February 2020

Kelly W. Zhang

Lucas Janson

Susan Murphy

ArXiv (abs)PDF HTML

Papers citing "Inference for Batched Bandits"

50 / 55 papers shown

Bernstein-von Mises for Adaptively Collected Data

Kevin Du

Yash Nair

Lucas Janson

154

10 Nov 2025

The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin

Rong Jiang

Cong Ma

194

05 Nov 2025

Kernel Treatment Effects with Adaptively Collected Data

Houssam Zenati

Bariscan Bozkurt

Arthur Gretton

146

11 Oct 2025

ISMIE: A Framework to Characterize Information Seeking in Modern Information Environments

Shuoqi Sun

Danula Hettiachchi

Damiano Spina

161

09 Oct 2025

Adaptive Off-Policy Inference for M-Estimators Under Model Misspecification

147

17 Sep 2025

Admissibility of Completely Randomized Trials: A Large-Deviation ApproachACM Conference on Economics and Computation (EC), 2025

Guido Imbens

Chao Qin

Stefan Wager

304

05 Jun 2025

Statistical Inference in Reinforcement Learning: A Selective Survey

Chengchun Shi

OffRL

689

22 Feb 2025

A Near-optimal, Scalable and Parallelizable Framework for Stochastic Bandits Robust to Adversarial Corruptions and Beyond

Zicheng Hu

Cheng Chen

435

11 Feb 2025

Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment EffectInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

293

21 Nov 2024

Off-policy estimation with adaptively collected data: the power of online learningNeural Information Processing Systems (NeurIPS), 2024

Jeonghwan Lee

Cong Ma

OffRL

381

19 Nov 2024

Linear Contextual Bandits with Interference

Yang Xu

Wenbin Lu

Rui Song

357

24 Sep 2024

MiWaves Reinforcement Learning Algorithm

Susobhan Ghosh

Yongyi Guo

Pei-Yao Hung

Lara N. Coughlin

Erin Bonar

Inbal Nahum-Shani

Maureen Walton

Susan Murphy

217

27 Aug 2024

AExGym: Benchmarks and Environments for Adaptive Experimentation

327

08 Aug 2024

Oralytics Reinforcement Learning Algorithm

Anna L. Trella

Kelly W. Zhang

Stephanie M Carpenter

Inbal Nahum-Shani

143

19 Jun 2024

Demistifying Inference after Adaptive Experiments

Aurélien F. Bibaut

Nathan Kallus

249

02 May 2024

Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate Choices

387

06 Mar 2024

Batched Nonparametric Contextual Bandits

Rong Jiang

Cong Ma

OffRL

525

27 Feb 2024

Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice

437

16 Feb 2024

An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits

Biyonka Liang

Iavor Bojinov

325

09 Nov 2023

Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and InferenceNeural Information Processing Systems (NeurIPS), 2023

337

01 Oct 2023

Optimal Conditional Inference in Adaptive Experiments

Jiafeng Chen

Isaiah Andrews

254

21 Sep 2023

Adaptive Linear Estimating EquationsNeural Information Processing Systems (NeurIPS), 2023

Mufang Ying

K. Khamaru

Cun-Hui Zhang

450

14 Jul 2023

Statistical Inference on Multi-armed Bandits with Delayed FeedbackInternational Conference on Machine Learning (ICML), 2023

Lei Shi

Jingshen Wang

Tianhao Wu

355

03 Jul 2023

Optimal tests following sequential experiments

Karun Adusumilli

239

30 Apr 2023

Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingMachine-mediated learning (ML), 2023

Kelly Zhang

Susan Murphy

OffRL

498

11 Apr 2023

Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches

Ethan Che

Hongseok Namkoong

OffRL

443

21 Mar 2023

Semi-parametric inference based on adaptively collected data

363

05 Mar 2023

Design-Based Inference for Multi-arm Bandits

316

27 Feb 2023

A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization

410

03 Feb 2023

Anytime-valid off-policy inference for contextual banditsACM / IMS Journal of Data Science (JDS), 2022

507

19 Oct 2022

Reward Imputation with Sketching for Contextual Batched BanditsNeural Information Processing Systems (NeurIPS), 2022

Jun Xu

201

13 Oct 2022

Entropy Regularization for Population EstimationAAAI Conference on Artificial Intelligence (AAAI), 2022

Ben Chugg

Peter Henderson

Jacob Goldin

Mark A. Lemley

253

24 Aug 2022

Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-CareAAAI Conference on Artificial Intelligence (AAAI), 2022

Anna L. Trella

Kelly W. Zhang

Inbal Nahum-Shani

Vivek Shetty

Finale Doshi-Velez

Susan Murphy

OnRL

193

15 Aug 2022

Some performance considerations when using multi-armed bandit algorithms in the presence of missing dataPLoS ONE (PLoS ONE), 2022

228

08 May 2022

Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit SelectionAAAI Conference on Artificial Intelligence (AAAI), 2022

Peter Henderson

Ben Chugg

Brandon R. Anderson

Kristen M. Altenburger

Jacob Goldin

228

25 Apr 2022

Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization

...

407

15 Dec 2021

Safe Data Collection for Offline and Online Policy Learning

Ruihao Zhu

Branislav Kveton

OffRL

167

08 Nov 2021

Efficient Inference Without Trading-off Regret in Bandits: An Allocation Probability Test for Thompson Sampling

Nina Deliu

Joseph Jay Williams

S. Villar

256

30 Oct 2021

Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online LearningJournal of the American Statistical Association (JASA), 2021

397

29 Oct 2021

Lipschitz Bandits with Batched Feedback

Yasong Feng

Zengfeng Huang

Tianyu Wang

468

19 Oct 2021

Efficient Online Estimation of Causal Effects by Deciding What to Observe

Shantanu Gupta

Zachary Chase Lipton

David Benjamin Childers

CML

426

20 Aug 2021

Near-optimal inference in adaptive linear regression

323

05 Jul 2021

A Closer Look at the Worst-case Behavior of Multi-armed Bandit AlgorithmsNeural Information Processing Systems (NeurIPS), 2021

Anand Kalvit

A. Zeevi

324

03 Jun 2021

Off-Policy Evaluation via Adaptive Weighting with Data from Contextual BanditsKnowledge Discovery and Data Mining (KDD), 2021

316

03 Jun 2021

From Finite to Countable-Armed BanditsNeural Information Processing Systems (NeurIPS), 2021

Anand Kalvit

A. Zeevi

247

22 May 2021

Deeply-Debiased Off-Policy Interval EstimationInternational Conference on Machine Learning (ICML), 2021

261

10 May 2021

Statistical Inference with M-Estimators on Adaptively Collected DataNeural Information Processing Systems (NeurIPS), 2021

Kelly W. Zhang

Lucas Janson

Susan Murphy

OffRL

230

29 Apr 2021

Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments

219

22 Mar 2021

Online Multi-Armed Bandits with Adaptive InferenceNeural Information Processing Systems (NeurIPS), 2021

Maria Dimakopoulou

Zhimei Ren

Zhengyuan Zhou

241

25 Feb 2021

Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability

Masahiro Kato

OffRL

237

17 Feb 2021