Online Stochastic Optimization under Correlated Bandit Feedback

v1v2v3 (latest)

Online Stochastic Optimization under Correlated Bandit Feedback

4 February 2014

ArXiv (abs)PDF HTML

Papers citing "Online Stochastic Optimization under Correlated Bandit Feedback"

11 / 11 papers shown

Title
Parameter-Free Algorithms for Performative Regret Minimization under Decision-Dependent Distributions Sungwoo Park Junyeop Kwon Byeongnoh Kim Suhyun Chae Jeeyong Lee Dabeen Lee 76 0 0 23 Feb 2024
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook Baihan Lin OffRL AI4TS 127 27 0 24 Oct 2022
Online Learning Demands in Max-min Fairness Kirthevasan Kandasamy Gur-Eyal Sela Joseph E. Gonzalez Michael I. Jordan Ion Stoica FaML 26 15 0 15 Dec 2020
Hidden Incentives for Auto-Induced Distributional Shift David M. Krueger Tegan Maharaj Jan Leike 80 52 0 19 Sep 2020
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces Ahmed Touati Adrien Ali Taïga Marc G. Bellemare 71 19 0 09 Mar 2020
Bayesian Optimization under Heavy-tailed Payoffs Sayak Ray Chowdhury Aditya Gopalan 65 27 0 16 Sep 2019
Introduction to Multi-Armed Bandits Aleksandrs Slivkins 677 1,024 0 15 Apr 2019
On Kernelized Multi-armed Bandits Sayak Ray Chowdhury Aditya Gopalan 129 464 0 03 Apr 2017
Simple regret for infinitely many armed bandits Alexandra Carpentier Michal Valko 239 89 0 18 May 2015
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards Omar Besbes Y. Gur A. Zeevi 88 127 0 13 May 2014
Bandits and Experts in Metric Spaces Robert D. Kleinberg Aleksandrs Slivkins E. Upfal 193 125 0 04 Dec 2013