ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08436
  4. Cited By
Residual Bootstrap Exploration for Bandit Algorithms

Residual Bootstrap Exploration for Bandit Algorithms

19 February 2020
ChiHua Wang
Yang Yu
Botao Hao
Guang Cheng
ArXiv (abs)PDFHTML

Papers citing "Residual Bootstrap Exploration for Bandit Algorithms"

15 / 15 papers shown
Batch Ensemble for Variance Dependent Regret in Stochastic Bandits
Batch Ensemble for Variance Dependent Regret in Stochastic BanditsAAAI Conference on Artificial Intelligence (AAAI), 2024
Asaf B. Cassel
Orin Levy
Yishay Mansour
OffRL
210
3
0
13 Sep 2024
Dynamic Online Recommendation for Two-Sided Market with Bayesian
  Incentive Compatibility
Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility
Yuantong Li
Guang Cheng
Xiaowu Dai
240
1
0
04 Jun 2024
FLASH: Federated Learning Across Simultaneous Heterogeneities
FLASH: Federated Learning Across Simultaneous Heterogeneities
Xiangyu Chang
Sk. Miraj Ahmed
S. Krishnamurthy
Başak Güler
A. Swami
Samet Oymak
Amit K. Roy-Chowdhury
FedML
379
4
0
13 Feb 2024
Forced Exploration in Bandit Problems
Forced Exploration in Bandit ProblemsAAAI Conference on Artificial Intelligence (AAAI), 2023
Han Qi
Fei-Yu Guo
Li Zhu
330
1
0
12 Dec 2023
Did we personalize? Assessing personalization by an online reinforcement
  learning algorithm using resampling
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingMachine-mediated learning (ML), 2023
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
505
13
0
11 Apr 2023
Multiplier Bootstrap-based Exploration
Multiplier Bootstrap-based ExplorationInternational Conference on Machine Learning (ICML), 2023
Runzhe Wan
Haoyu Wei
Branislav Kveton
R. Song
283
3
0
03 Feb 2023
Federated Online Sparse Decision Making
ChiHua Wang
Wenjie Li
Guang Cheng
Guang Lin
FedML
349
4
0
27 Feb 2022
Residual Bootstrap Exploration for Stochastic Linear Bandit
Residual Bootstrap Exploration for Stochastic Linear BanditConference on Uncertainty in Artificial Intelligence (UAI), 2022
Shuang Wu
ChiHua Wang
Yuantong Li
Guang Cheng
290
9
0
23 Feb 2022
From Optimality to Robustness: Dirichlet Sampling Strategies in
  Stochastic Bandits
From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
Dorian Baudry
Patrick Saux
Odalric-Ambrym Maillard
189
7
0
18 Nov 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
463
40
0
08 Aug 2021
GuideBoot: Guided Bootstrap for Deep Contextual Bandits
GuideBoot: Guided Bootstrap for Deep Contextual BanditsThe Web Conference (WWW), 2021
Feiyang Pan
Haoming Li
Xiang Ao
Wei Wang
Yanrong Kang
Ao Tan
Qing He
169
0
0
18 Jul 2021
Optimum-statistical Collaboration Towards General and Efficient
  Black-box Optimization
Optimum-statistical Collaboration Towards General and Efficient Black-box Optimization
Wenjie Li
ChiHua Wang
Guang Cheng
Qifan Song
531
9
0
17 Jun 2021
Sub-sampling for Efficient Non-Parametric Bandit Exploration
Sub-sampling for Efficient Non-Parametric Bandit ExplorationNeural Information Processing Systems (NeurIPS), 2020
Dorian Baudry
E. Kaufmann
Odalric-Ambrym Maillard
180
14
0
27 Oct 2020
Online Regularization towards Always-Valid High-Dimensional Dynamic
  Pricing
Online Regularization towards Always-Valid High-Dimensional Dynamic Pricing
ChiHua Wang
Zhanyu Wang
W. Sun
Guang Cheng
364
11
0
05 Jul 2020
BanditPAM: Almost Linear Time $k$-Medoids Clustering via Multi-Armed
  Bandits
BanditPAM: Almost Linear Time kkk-Medoids Clustering via Multi-Armed BanditsNeural Information Processing Systems (NeurIPS), 2020
Mo Tiwari
Martin Jinye Zhang
James Mayclin
Sebastian Thrun
Chris Piech
Ilan Shomorony
198
12
0
11 Jun 2020
1
Page 1 of 1