Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.06917
Cited By
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
22 February 2017
Quentin Berthet
Vianney Perchet
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe"
5 / 5 papers shown
Title
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
57
3
0
07 Nov 2024
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
45
5
0
29 Jan 2023
Active Model Estimation in Markov Decision Processes
Jean Tarbouriech
S. Shekhar
Matteo Pirotta
Mohammad Ghavamzadeh
A. Lazaric
8
24
0
06 Mar 2020
Kernel-based methods for bandit convex optimization
Sébastien Bubeck
Ronen Eldan
Y. Lee
76
163
0
11 Jul 2016
Detection of Planted Solutions for Flat Satisfiability Problems
Quentin Berthet
J. Ellenberg
27
6
0
21 Feb 2015
1