ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.10937
  4. Cited By
Best arm identification in multi-armed bandits with delayed feedback

Best arm identification in multi-armed bandits with delayed feedback

29 March 2018
Aditya Grover
Todor Markov
Patrick Attia
Norman Jin
Nicholas Perkins
Bryan Cheong
M. Chen
Zi Yang
Stephen J. Harris
W. Chueh
Stefano Ermon
ArXivPDFHTML

Papers citing "Best arm identification in multi-armed bandits with delayed feedback"

9 / 9 papers shown
Title
Biased Dueling Bandits with Stochastic Delayed Feedback
Biased Dueling Bandits with Stochastic Delayed Feedback
Bongsoo Yi
Yue Kang
Yao Li
30
1
0
26 Aug 2024
Optimal Batched Best Arm Identification
Optimal Batched Best Arm Identification
Tianyuan Jin
Yu Yang
Jing Tang
Xiaokui Xiao
Pan Xu
36
3
0
21 Oct 2023
A Survey for Solving Mixed Integer Programming via Machine Learning
A Survey for Solving Mixed Integer Programming via Machine Learning
Jiayi Zhang
Chang-rui Liu
Junchi Yan
Xijun Li
Hui-Ling Zhen
M. Yuan
AI4CE
32
71
0
06 Mar 2022
Partial Likelihood Thompson Sampling
Partial Likelihood Thompson Sampling
Han Wu
Stefan Wager
LM&MA
22
1
0
02 Mar 2022
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary
  Dueling Bandits
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
20
10
0
06 Nov 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-shan Shiu
29
51
0
20 Aug 2021
Learning from an Exploring Demonstrator: Optimal Reward Estimation for
  Bandits
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
Kumar Krishna Agrawal
Aditya Grover
Vidya Muthukumar
A. Pananjady
16
8
0
28 Jun 2021
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or
  Bayesian?
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian?
S. Santosh
S. Darak
14
0
0
05 Jun 2021
Optimal Algorithms for Range Searching over Multi-Armed Bandits
Optimal Algorithms for Range Searching over Multi-Armed Bandits
Siddharth Barman
Ramakrishnan Krishnamurthy
S. Rahul
13
0
0
04 May 2021
1