ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.00827
  4. Cited By
Neural Thompson Sampling

Neural Thompson Sampling

2 October 2020
Weitong Zhang
Dongruo Zhou
Lihong Li
Quanquan Gu
ArXivPDFHTML

Papers citing "Neural Thompson Sampling"

19 / 19 papers shown
Title
Neural Logistic Bandits
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
127
0
0
04 May 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
97
0
0
08 Nov 2024
Batched Bayesian optimization by maximizing the probability of including the optimum
Batched Bayesian optimization by maximizing the probability of including the optimum
Jenna C. Fromer
Runzhong Wang
Mrunali Manjrekar
Austin Tripp
José Miguel Hernández-Lobato
Connor W. Coley
47
0
0
08 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
P. Jaillet
K. H. Low
32
5
0
24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
21
0
0
16 Jun 2024
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
45
0
0
15 Jun 2024
VITS : Variational Inference Thompson Sampling for contextual bandits
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
25
3
0
19 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary
  Contextual Bandits
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
35
0
0
07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
34
8
0
05 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
27
0
0
23 Mar 2023
A Provably Efficient Model-Free Posterior Sampling Method for Episodic
  Reinforcement Learning
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
Christoph Dann
M. Mohri
Tong Zhang
Julian Zimmert
OffRL
16
32
0
23 Aug 2022
Graph Neural Network Bandits
Graph Neural Network Bandits
Parnian Kassraie
Andreas Krause
Ilija Bogunovic
26
11
0
13 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling
POEM: Out-of-Distribution Detection with Posterior Sampling
Yifei Ming
Ying Fan
Yixuan Li
OODD
27
113
0
28 Jun 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error:
  An Enhanced Bayesian Upper Confidence Bound Framework
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework
Ziyi Huang
H. Lam
A. Meisami
Haofeng Zhang
34
4
0
31 Jan 2022
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
OffRL
29
39
0
07 Oct 2021
Deep Exploration for Recommendation Systems
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
29
11
0
26 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-shan Shiu
29
51
0
20 Aug 2021
Neural Active Learning with Performance Guarantees
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
24
22
0
06 Jun 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
19
18
0
07 Feb 2021
1