Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.00827
Cited By
Neural Thompson Sampling
2 October 2020
Weitong Zhang
Dongruo Zhou
Lihong Li
Quanquan Gu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Thompson Sampling"
19 / 19 papers shown
Title
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
127
0
0
04 May 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
97
0
0
08 Nov 2024
Batched Bayesian optimization by maximizing the probability of including the optimum
Jenna C. Fromer
Runzhong Wang
Mrunali Manjrekar
Austin Tripp
José Miguel Hernández-Lobato
Connor W. Coley
47
0
0
08 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
P. Jaillet
K. H. Low
32
5
0
24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
21
0
0
16 Jun 2024
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
45
0
0
15 Jun 2024
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
25
3
0
19 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
35
0
0
07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
34
8
0
05 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
27
0
0
23 Mar 2023
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
Christoph Dann
M. Mohri
Tong Zhang
Julian Zimmert
OffRL
16
32
0
23 Aug 2022
Graph Neural Network Bandits
Parnian Kassraie
Andreas Krause
Ilija Bogunovic
26
11
0
13 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling
Yifei Ming
Ying Fan
Yixuan Li
OODD
27
113
0
28 Jun 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework
Ziyi Huang
H. Lam
A. Meisami
Haofeng Zhang
34
4
0
31 Jan 2022
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
OffRL
29
39
0
07 Oct 2021
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
29
11
0
26 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-shan Shiu
29
51
0
20 Aug 2021
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
24
22
0
06 Jun 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
19
18
0
07 Feb 2021
1