Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1402.0562
Cited By
v1
v2
v3 (latest)
Online Stochastic Optimization under Correlated Bandit Feedback
4 February 2014
M. G. Azar
A. Lazaric
Emma Brunskill
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Online Stochastic Optimization under Correlated Bandit Feedback"
11 / 11 papers shown
Title
Parameter-Free Algorithms for Performative Regret Minimization under Decision-Dependent Distributions
Sungwoo Park
Junyeop Kwon
Byeongnoh Kim
Suhyun Chae
Jeeyong Lee
Dabeen Lee
76
0
0
23 Feb 2024
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
127
27
0
24 Oct 2022
Online Learning Demands in Max-min Fairness
Kirthevasan Kandasamy
Gur-Eyal Sela
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
FaML
26
15
0
15 Dec 2020
Hidden Incentives for Auto-Induced Distributional Shift
David M. Krueger
Tegan Maharaj
Jan Leike
80
52
0
19 Sep 2020
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces
Ahmed Touati
Adrien Ali Taïga
Marc G. Bellemare
71
19
0
09 Mar 2020
Bayesian Optimization under Heavy-tailed Payoffs
Sayak Ray Chowdhury
Aditya Gopalan
65
27
0
16 Sep 2019
Introduction to Multi-Armed Bandits
Aleksandrs Slivkins
677
1,024
0
15 Apr 2019
On Kernelized Multi-armed Bandits
Sayak Ray Chowdhury
Aditya Gopalan
129
464
0
03 Apr 2017
Simple regret for infinitely many armed bandits
Alexandra Carpentier
Michal Valko
239
89
0
18 May 2015
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards
Omar Besbes
Y. Gur
A. Zeevi
88
127
0
13 May 2014
Bandits and Experts in Metric Spaces
Robert D. Kleinberg
Aleksandrs Slivkins
E. Upfal
193
125
0
04 Dec 2013
1