ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.15368
  4. Cited By
Offline Contextual Bandits with Overparameterized Models
v1v2v3v4 (latest)

Offline Contextual Bandits with Overparameterized Models

27 June 2020
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Offline Contextual Bandits with Overparameterized Models"

9 / 9 papers shown
Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms
Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms
Katherine Avery
Chinmay Pendse
David D. Jensen
CML
144
0
0
04 Aug 2025
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
509
0
0
01 Jul 2025
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
Anni Zhou
Raheem Beyah
Rishikesan Kamaleswaran
337
1
0
20 Mar 2025
Asymptotically Optimal Regret for Black-Box Predict-then-Optimize
Asymptotically Optimal Regret for Black-Box Predict-then-Optimize
Samuel Tan
P. Frazier
193
1
0
12 Jun 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
337
22
0
20 Mar 2024
Reward-Directed Conditional Diffusion: Provable Distribution Estimation
  and Reward Improvement
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward ImprovementNeural Information Processing Systems (NeurIPS), 2023
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Minshuo Chen
Mengdi Wang
DiffM
310
52
0
13 Jul 2023
PAC-Bayesian Offline Contextual Bandits With Guarantees
PAC-Bayesian Offline Contextual Bandits With GuaranteesInternational Conference on Machine Learning (ICML), 2022
Otmane Sakhi
Pierre Alquier
Nicolas Chopin
OffRL
473
23
0
24 Oct 2022
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible ActionsConference on Uncertainty in Artificial Intelligence (UAI), 2022
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
195
6
0
01 Jul 2022
Offline Neural Contextual Bandits: Pessimism, Optimization and
  Generalization
Offline Neural Contextual Bandits: Pessimism, Optimization and GeneralizationInternational Conference on Learning Representations (ICLR), 2021
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
259
35
0
27 Nov 2021
1
Page 1 of 1