Neural Contextual Bandits with Deep Representation and Shallow Exploration

International Conference on Learning Representations (ICLR), 2020

3 December 2020

Quanquan Gu

Papers citing "Neural Contextual Bandits with Deep Representation and Shallow Exploration"

50 / 64 papers shown

Provable Anytime Ensemble Sampling Algorithms in Nonlinear Contextual Bandits

Jiazheng Sun

Weixin Wang

Pan Xu

200

12 Oct 2025

Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference

214

24 Sep 2025

Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown

Emile Anand

Sarah Liaw

340

21 Jul 2025

Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of PlasticityKnowledge Discovery and Data Mining (KDD), 2025

Zhiyuan Su

Sunhao Dai

Xiao Zhang

333

14 Jun 2025

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

273

02 Jun 2025

In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

240

21 May 2025

Neural Logistic Bandits

Seoungbin Bae

Dabeen Lee

1.1K

04 May 2025

Active Human Feedback Collection via Neural Contextual Dueling Bandits

Bryan Kian Hsiang Low

361

16 Apr 2025

Neural Contextual Bandits Under Delayed Feedback Constraints

Mohammadali Moghimi

Sharu Theresa Jose

Shana Moothedath

404

16 Apr 2025

CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning

Yexin Li

OffRL

479

23 Mar 2025

Online Clustering of Dueling Bandits

341

04 Feb 2025

A Metric Topology of Deep Learning for Data Classification

421

20 Jan 2025

Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning

Akhila Vangara

Alex Egg

OffRL

268

30 Nov 2024

Variance-Aware Linear UCB with Deep Representation for Neural Contextual BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

H. Bui

Enrique Mallada

Anqi Liu

1.2K

08 Nov 2024

PageRank Bandits for Link PredictionNeural Information Processing Systems (NeurIPS), 2024

437

03 Nov 2024

The Digital Transformation in Health: How AI Can Improve the Performance of Health SystemsHealth systems and reform (HSR), 2024

África Periánez

Ana Fernández del Río

276

24 Sep 2024

Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx

Ana Fernández del Río

Michael Brennan Leong

275

15 Aug 2024

Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services

Ana Fernández del Río

Michael Brennan Leong

205

14 Aug 2024

Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings

Ana Fernández del Río

215

14 Aug 2024

Meta Clustering of Neural BanditsKnowledge Discovery and Data Mining (KDD), 2024

440

10 Aug 2024

A Contextual Combinatorial Bandit Approach to Negotiation

Yexin Li

Zhancun Mu

Siyuan Qi

289

30 Jun 2024

Graph Neural Thompson Sampling

Shuang Wu

Arash A. Amini

420

15 Jun 2024

Towards Domain Adaptive Neural Contextual Bandits

Ziyan Wang

Hao Wang

496

13 Jun 2024

Uncertainty of Joint Neural Contextual Bandit

Hongbo Guo

Zheqing Zhu

356

04 Jun 2024

Neural Active Learning Meets the Partial Monitoring FrameworkConference on Uncertainty in Artificial Intelligence (UAI), 2024

M. Heuillet

Ola Ahmad

Audrey Durand

266

14 May 2024

Stochastic Bandits with ReLU Neural Networks

331

12 May 2024

Efficient Online Set-valued Classification with Bandit Feedback

Zhou Wang

Xingye Qiao

OffRL

296

07 May 2024

Active Preference Learning for Ordering Items In- and Out-of-sampleNeural Information Processing Systems (NeurIPS), 2024

Herman Bergström

Emil Carlsson

Devdatt Dubhashi

Fredrik D. Johansson

301

05 May 2024

ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease TreatmentInternational Conference on Cyber-Physical Systems (ICCPS), 2024

Hao-Lun Hsu

Qitong Gao

Miroslav Pajic

316

11 Mar 2024

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Xiaoying Zhang

Jean-François Ton

Wei Shen

Hongning Wang

Yang Liu

197

08 Mar 2024

FLASH: Federated Learning Across Simultaneous Heterogeneities

Amit K. Roy-Chowdhury

FedML

383

13 Feb 2024

Randomized Confidence Bounds for Stochastic Partial Monitoring

M. Heuillet

Ola Ahmad

Audrey Durand

426

07 Feb 2024

Tree Search-Based Evolutionary Bandits for Protein Sequence OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2024

Hui Yuan

Mengdi Wang

298

08 Jan 2024

Risk-Aware Continuous Control with Neural Contextual BanditsAAAI Conference on Artificial Intelligence (AAAI), 2023

J. Ayala-Romero

A. Garcia-Saavedra

Xavier Pérez Costa

269

15 Dec 2023

Pearl: A Production-ready Reinforcement Learning Agent

Zheqing Zhu

Rodrigo de Salvo Braz

...

406

06 Dec 2023

Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

286

11 Oct 2023

Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-PricingSocial Science Research Network (SSRN), 2023

282

14 Sep 2023

Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem

283

15 Aug 2023

VITS : Variational Inference Thompson Sampling for contextual banditsInternational Conference on Machine Learning (ICML), 2023

Pierre Clavier

Tom Huix

Alain Durmus

506

19 Jul 2023

Scalable Neural Contextual Bandit for Recommender SystemsInternational Conference on Information and Knowledge Management (CIKM), 2023

Zheqing Zhu

Benjamin Van Roy

OffRL

401

26 Jun 2023

Representation-Driven Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

Ofir Nabati

Guy Tennenholtz

Shie Mannor

353

31 May 2023

Learning Personalized Page Content Ranking Using Customer Representation

214

09 May 2023

Neural Exploitation and Exploration of Contextual Bandits

267

05 May 2023

Evaluating Online Bandit Exploration In Large-Scale Recommender System

267

05 Apr 2023

Uncertainty-Aware Instance Reweighting for Off-Policy LearningNeural Information Processing Systems (NeurIPS), 2023

Yang Liu

309

11 Mar 2023

Neural-BO: A Black-box Optimization Algorithm using Deep Neural NetworksNeurocomputing (Neurocomputing), 2023

Dat Phan-Trong

Hung The Tran

Sunil R. Gupta

334

03 Mar 2023

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret GuaranteesNeural Information Processing Systems (NeurIPS), 2022

309

24 Oct 2022

Sample-Then-Optimize Batch Neural Thompson SamplingNeural Information Processing Systems (NeurIPS), 2022

Zhongxiang Dai

Yao Shu

Bryan Kian Hsiang Low

Patrick Jaillet

AAML

228

13 Oct 2022

Hierarchical Conversational Preference Elicitation with Bandit FeedbackInternational Conference on Information and Knowledge Management (CIKM), 2022

Shuai Li

346

06 Sep 2022

Neural Design for Genetic Perturbation ExperimentsInternational Conference on Learning Representations (ICLR), 2022

330

26 Jul 2022