Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe

22 February 2017

Papers citing "Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe"

5 / 5 papers shown

Title
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Heyang Zhao Chenlu Ye Quanquan Gu Tong Zhang OffRL 57 3 0 07 Nov 2024
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits Subhojyoti Mukherjee Qiaomin Xie Josiah P. Hanna R. Nowak OffRL 45 5 0 29 Jan 2023
Active Model Estimation in Markov Decision Processes Jean Tarbouriech S. Shekhar Matteo Pirotta Mohammad Ghavamzadeh A. Lazaric 8 24 0 06 Mar 2020
Kernel-based methods for bandit convex optimization Sébastien Bubeck Ronen Eldan Y. Lee 76 163 0 11 Jul 2016
Detection of Planted Solutions for Flat Satisfiability Problems Quentin Berthet J. Ellenberg 27 6 0 21 Feb 2015