Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.13807
Cited By
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
27 November 2021
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization"
21 / 21 papers shown
Title
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
Anni Zhou
Raheem Beyah
Rishikesan Kamaleswaran
41
0
0
20 Mar 2025
Online Clustering of Dueling Bandits
Zhiyong Wang
Jiahang Sun
Mingze Kong
Jize Xie
Qinghua Hu
J. C. Lui
Zhongxiang Dai
83
0
0
04 Feb 2025
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
45
0
0
15 Jun 2024
Learning Decision Policies with Instrumental Variables through Double Machine Learning
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
36
1
0
14 May 2024
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
37
48
0
11 Apr 2024
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
34
9
0
20 Mar 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
25
3
0
06 Jan 2024
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with
ε
ε
ε
-Greedy Exploration
Shuai Zhang
Hongkang Li
Meng Wang
Miao Liu
Pin-Yu Chen
Songtao Lu
Sijia Liu
K. Murugesan
Subhajit Chaudhury
32
19
0
24 Oct 2023
On the Disconnect Between Theory and Practice of Neural Networks: Limits of the NTK Perspective
Jonathan Wenger
Felix Dangel
Agustinus Kristiadi
25
0
0
29 Sep 2023
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Minshuo Chen
Mengdi Wang
DiffM
12
34
0
13 Jul 2023
Optimal Best-Arm Identification in Bandits with Access to Offline Data
Shubhada Agrawal
Sandeep Juneja
Karthikeyan Shanmugam
A. Suggala
22
4
0
15 Jun 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
33
8
0
13 Jun 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
38
5
0
24 Feb 2023
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Thanh Nguyen-Tang
Ming Yin
Sunil R. Gupta
Svetha Venkatesh
R. Arora
OffRL
50
15
0
23 Nov 2022
Data-Driven Offline Decision-Making via Invariant Representation Learning
Qi
Yi-Hsun Su
Aviral Kumar
Sergey Levine
OffRL
27
19
0
21 Nov 2022
PAC-Bayesian Offline Contextual Bandits With Guarantees
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
19
12
0
24 Oct 2022
Text Summarization with Oracle Expectation
Yumo Xu
Mirella Lapata
VLM
14
4
0
26 Sep 2022
Federated Neural Bandits
Zhongxiang Dai
Yao Shu
Arun Verma
Flint Xiaofeng Fan
Bryan Kian Hsiang Low
P. Jaillet
FedML
21
13
0
28 May 2022
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
52
7
0
11 Mar 2021
On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear Widths
Quynh N. Nguyen
33
49
0
24 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1