Neural Thompson Sampling

Neural Thompson Sampling

2 October 2020

Quanquan Gu

Papers citing "Neural Thompson Sampling"

19 / 19 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 127 0 0 04 May 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 97 0 0 08 Nov 2024
Batched Bayesian optimization by maximizing the probability of including the optimum Jenna C. Fromer Runzhong Wang Mrunali Manjrekar Austin Tripp José Miguel Hernández-Lobato Connor W. Coley 47 0 0 08 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin P. Jaillet K. H. Low 32 5 0 24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Kai Xu Farid Tajaddodianfar Ben Allison 21 0 0 16 Jun 2024
Graph Neural Thompson Sampling Shuang Wu Arash A. Amini 45 0 0 15 Jun 2024
VITS : Variational Inference Thompson Sampling for contextual bandits Pierre Clavier Tom Huix Alain Durmus 25 3 0 19 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Nicklas Werge Abdullah Akgul M. Kandemir 35 0 0 07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He 34 8 0 05 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits Do June Min A. Stolcke A. Raju Colin Vaz Di He Venkatesh Ravichandran V. Trinh OffRL 27 0 0 23 Mar 2023
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Christoph Dann M. Mohri Tong Zhang Julian Zimmert OffRL 16 32 0 23 Aug 2022
Graph Neural Network Bandits Parnian Kassraie Andreas Krause Ilija Bogunovic 26 11 0 13 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling Yifei Ming Ying Fan Yixuan Li OODD 27 113 0 28 Jun 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Ziyi Huang H. Lam A. Meisami Haofeng Zhang 34 4 0 31 Jan 2022
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He OffRL 29 39 0 07 Oct 2021
Deep Exploration for Recommendation Systems Zheqing Zhu Benjamin Van Roy 29 11 0 26 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits Sattar Vakili N. Bouziani Sepehr Jalali A. Bernacchia Da-shan Shiu 29 51 0 20 Aug 2021
Neural Active Learning with Performance Guarantees Pranjal Awasthi Christoph Dann Claudio Gentile Ayush Sekhari Zhilei Wang 24 22 0 06 Jun 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching Ofir Nabati Tom Zahavy Shie Mannor 19 18 0 07 Feb 2021