Improved Regret Analysis for Variance-Adaptive Linear Bandits and
Horizon-Free Linear Mixture MDPs

Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs

5 November 2021

Insoon Yang

Papers citing "Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs"

12 / 12 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 231 0 0 04 May 2025
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance S. Iwazaki Shion Takeno 83 1 0 10 Feb 2025
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Chenlu Ye Yujia Jin Alekh Agarwal Tong Zhang 112 0 0 04 Feb 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 207 0 0 08 Nov 2024
Second Order Bounds for Contextual Bandits with Function Approximation Aldo Pacchiano 64 4 0 24 Sep 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 40 4 0 19 Jul 2024
Horizon-Free Regret for Linear Markov Decision Processes Zihan Zhang Jason D. Lee Yuxin Chen Simon S. Du 33 3 0 15 Mar 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits Zhiyong Wang Jize Xie Yi Chen J. C. Lui Dongruo Zhou 28 0 0 15 Mar 2024
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments Runlong Zhou Zihan Zhang S. Du 44 10 0 31 Jan 2023
Policy Optimization for Stochastic Shortest Path Liyu Chen Haipeng Luo Aviv A. Rosenberg 21 12 0 07 Feb 2022
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Andrew Wagenmaker Yifang Chen Max Simchowitz S. Du Kevin G. Jamieson 73 37 0 07 Dec 2021
Optimism in Reinforcement Learning with Generalized Linear Function Approximation Yining Wang Ruosong Wang S. Du A. Krishnamurthy 137 135 0 09 Dec 2019