ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.13807
  4. Cited By
Offline Neural Contextual Bandits: Pessimism, Optimization and
  Generalization

Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

27 November 2021
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
    OffRL
ArXivPDFHTML

Papers citing "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization"

21 / 21 papers shown
Title
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
Anni Zhou
Raheem Beyah
Rishikesan Kamaleswaran
41
0
0
20 Mar 2025
Online Clustering of Dueling Bandits
Online Clustering of Dueling Bandits
Zhiyong Wang
Jiahang Sun
Mingze Kong
Jize Xie
Qinghua Hu
J. C. Lui
Zhongxiang Dai
83
0
0
04 Feb 2025
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
45
0
0
15 Jun 2024
Learning Decision Policies with Instrumental Variables through Double
  Machine Learning
Learning Decision Policies with Instrumental Variables through Double Machine Learning
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
36
1
0
14 May 2024
An Overview of Diffusion Models: Applications, Guided Generation,
  Statistical Rates and Optimization
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
37
48
0
11 Apr 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
34
9
0
20 Mar 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity,
  Posterior Sampling, and Beyond
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
25
3
0
06 Jan 2024
On the Convergence and Sample Complexity Analysis of Deep Q-Networks
  with $ε$-Greedy Exploration
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with εεε-Greedy Exploration
Shuai Zhang
Hongkang Li
Meng Wang
Miao Liu
Pin-Yu Chen
Songtao Lu
Sijia Liu
K. Murugesan
Subhajit Chaudhury
32
19
0
24 Oct 2023
On the Disconnect Between Theory and Practice of Neural Networks: Limits
  of the NTK Perspective
On the Disconnect Between Theory and Practice of Neural Networks: Limits of the NTK Perspective
Jonathan Wenger
Felix Dangel
Agustinus Kristiadi
25
0
0
29 Sep 2023
Reward-Directed Conditional Diffusion: Provable Distribution Estimation
  and Reward Improvement
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Minshuo Chen
Mengdi Wang
DiffM
12
34
0
13 Jul 2023
Optimal Best-Arm Identification in Bandits with Access to Offline Data
Optimal Best-Arm Identification in Bandits with Access to Offline Data
Shubhada Agrawal
Sandeep Juneja
Karthikeyan Shanmugam
A. Suggala
22
4
0
15 Jun 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual
  Bandits
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
33
8
0
13 Jun 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
38
5
0
24 Feb 2023
On Instance-Dependent Bounds for Offline Reinforcement Learning with
  Linear Function Approximation
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Thanh Nguyen-Tang
Ming Yin
Sunil R. Gupta
Svetha Venkatesh
R. Arora
OffRL
50
15
0
23 Nov 2022
Data-Driven Offline Decision-Making via Invariant Representation
  Learning
Data-Driven Offline Decision-Making via Invariant Representation Learning
Qi
Yi-Hsun Su
Aviral Kumar
Sergey Levine
OffRL
27
19
0
21 Nov 2022
PAC-Bayesian Offline Contextual Bandits With Guarantees
PAC-Bayesian Offline Contextual Bandits With Guarantees
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
19
12
0
24 Oct 2022
Text Summarization with Oracle Expectation
Text Summarization with Oracle Expectation
Yumo Xu
Mirella Lapata
VLM
14
4
0
26 Sep 2022
Federated Neural Bandits
Federated Neural Bandits
Zhongxiang Dai
Yao Shu
Arun Verma
Flint Xiaofeng Fan
Bryan Kian Hsiang Low
P. Jaillet
FedML
21
13
0
28 May 2022
Sample Complexity of Offline Reinforcement Learning with Deep ReLU
  Networks
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks
Thanh Nguyen-Tang
Sunil R. Gupta
Hung The Tran
Svetha Venkatesh
OffRL
52
7
0
11 Mar 2021
On the Proof of Global Convergence of Gradient Descent for Deep ReLU
  Networks with Linear Widths
On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear Widths
Quynh N. Nguyen
33
49
0
24 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1