Stochastic bandits with arm-dependent delays

Stochastic bandits with arm-dependent delays

18 June 2020

Anne Gael Manegueu

Alexandra Carpentier

Papers citing "Stochastic bandits with arm-dependent delays"

14 / 14 papers shown

Title
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 41 0 0 18 Feb 2025
Biased Dueling Bandits with Stochastic Delayed Feedback Bongsoo Yi Yue Kang Yao Li 38 1 0 26 Aug 2024
Faster Stochastic Optimization with Arbitrary Delays via Asynchronous Mini-Batching Amit Attia Ofir Gaash Tomer Koren 40 0 0 14 Aug 2024
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Yunchang Yang Hangshi Zhong Tianhao Wu B. Liu Liwei Wang S. Du OffRL 27 8 0 03 Feb 2023
Evaluating COVID-19 vaccine allocation policies using Bayesian $m$ -top exploration Alexandra Cimpean T. Verstraeten L. Willem N. Hens Ann Nowé Pieter J. K. Libin 21 2 0 30 Jan 2023
Dynamical Linear Bandits Marco Mussi Alberto Maria Metelli Marcello Restelli 38 2 0 16 Nov 2022
Learning in Stackelberg Games with Non-myopic Agents Nika Haghtalab Thodoris Lykouris Sloan Nietert Alexander Wei 15 29 0 19 Aug 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization Quan-Wu Xiao Qing Ling Tianyi Chen 41 0 0 14 Jun 2022
Partial Likelihood Thompson Sampling Han Wu Stefan Wager LM&MA 30 1 0 02 Mar 2022
Thompson Sampling with Unrestricted Delays Hang Wu Stefan Wager 32 7 0 24 Feb 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback Tiancheng Jin Tal Lancewicki Haipeng Luo Yishay Mansour Aviv A. Rosenberg 71 21 0 31 Jan 2022
Nonstochastic Bandits with Composite Anonymous Feedback Nicolò Cesa-Bianchi Tommaso Cesari Roberto Colomboni Claudio Gentile Yishay Mansour 108 39 0 06 Dec 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback Tal Lancewicki Aviv A. Rosenberg Yishay Mansour 30 32 0 29 Dec 2020
Learning-NUM: Network Utility Maximization with Unknown Utility Functions and Queueing Delay Xinzhe Fu E. Modiano 11 18 0 16 Dec 2020