Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.01780
Cited By
Neural Contextual Bandits with Deep Representation and Shallow Exploration
International Conference on Learning Representations (ICLR), 2020
3 December 2020
Pan Xu
Zheng Wen
Handong Zhao
Quanquan Gu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Contextual Bandits with Deep Representation and Shallow Exploration"
50 / 64 papers shown
Provable Anytime Ensemble Sampling Algorithms in Nonlinear Contextual Bandits
Jiazheng Sun
Weixin Wang
Pan Xu
200
1
0
12 Oct 2025
Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference
Ziyi Han
Xutong Liu
Ruiting Zhou
Xiangxiang Dai
J. C. Lui
MoMe
MoE
214
0
0
24 Sep 2025
Feel-Good Thompson Sampling for Contextual Bandits: a Markov Chain Monte Carlo Showdown
Emile Anand
Sarah Liaw
340
4
0
21 Jul 2025
Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity
Knowledge Discovery and Data Mining (KDD), 2025
Zhiyuan Su
Sunhao Dai
Xiao Zhang
333
1
0
14 Jun 2025
Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration
Youngmin Oh
J. Park
Taejin Paik
Jaemin Park
273
1
0
02 Jun 2025
In-Domain African Languages Translation Using LLMs and Multi-armed Bandits
Pratik Rakesh Singh
Kritarth Prasad
Mohammadi Zaki
Pankaj Wasnik
240
1
0
21 May 2025
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
1.1K
2
0
04 May 2025
Active Human Feedback Collection via Neural Contextual Dueling Bandits
Arun Verma
Xiaoqiang Lin
Zhongxiang Dai
Daniela Rus
Bryan Kian Hsiang Low
361
4
0
16 Apr 2025
Neural Contextual Bandits Under Delayed Feedback Constraints
Mohammadali Moghimi
Sharu Theresa Jose
Shana Moothedath
404
0
0
16 Apr 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
OffRL
479
2
0
23 Mar 2025
Online Clustering of Dueling Bandits
Zhiyong Wang
Jiahang Sun
Mingze Kong
Jize Xie
Qinghua Hu
J. C. Lui
Zhongxiang Dai
341
0
0
04 Feb 2025
A Metric Topology of Deep Learning for Data Classification
Jwo-Yuh Wu
L. Huang
Wen-Hsuan Li
Chun-Hung Liu
421
0
0
20 Jan 2025
Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning
Akhila Vangara
Alex Egg
OffRL
268
0
0
30 Nov 2024
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
H. Bui
Enrique Mallada
Anqi Liu
1.2K
4
0
08 Nov 2024
PageRank Bandits for Link Prediction
Neural Information Processing Systems (NeurIPS), 2024
Yikun Ban
Jiaru Zou
Zihao Li
Yunzhe Qi
Dongqi Fu
Jian Kang
Hanghang Tong
Jingrui He
437
17
0
03 Nov 2024
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
Health systems and reform (HSR), 2024
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
276
22
0
24 Sep 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
OnRL
275
3
0
15 Aug 2024
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
205
7
0
14 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
215
0
0
14 Aug 2024
Meta Clustering of Neural Bandits
Knowledge Discovery and Data Mining (KDD), 2024
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
440
13
0
10 Aug 2024
A Contextual Combinatorial Bandit Approach to Negotiation
Yexin Li
Zhancun Mu
Siyuan Qi
289
3
0
30 Jun 2024
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
420
1
0
15 Jun 2024
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
496
1
0
13 Jun 2024
Uncertainty of Joint Neural Contextual Bandit
Hongbo Guo
Zheqing Zhu
356
0
0
04 Jun 2024
Neural Active Learning Meets the Partial Monitoring Framework
Conference on Uncertainty in Artificial Intelligence (UAI), 2024
M. Heuillet
Ola Ahmad
Audrey Durand
266
1
0
14 May 2024
Stochastic Bandits with ReLU Neural Networks
Kan Xu
Hamsa Bastani
Surbhi Goel
Osbert Bastani
331
1
0
12 May 2024
Efficient Online Set-valued Classification with Bandit Feedback
Zhou Wang
Xingye Qiao
OffRL
296
1
0
07 May 2024
Active Preference Learning for Ordering Items In- and Out-of-sample
Neural Information Processing Systems (NeurIPS), 2024
Herman Bergström
Emil Carlsson
Devdatt Dubhashi
Fredrik D. Johansson
301
6
0
05 May 2024
ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment
International Conference on Cyber-Physical Systems (ICCPS), 2024
Hao-Lun Hsu
Qitong Gao
Miroslav Pajic
316
2
0
11 Mar 2024
Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Xiaoying Zhang
Jean-François Ton
Wei Shen
Hongning Wang
Yang Liu
197
24
0
08 Mar 2024
FLASH: Federated Learning Across Simultaneous Heterogeneities
Xiangyu Chang
Sk. Miraj Ahmed
S. Krishnamurthy
Başak Güler
A. Swami
Samet Oymak
Amit K. Roy-Chowdhury
FedML
383
4
0
13 Feb 2024
Randomized Confidence Bounds for Stochastic Partial Monitoring
M. Heuillet
Ola Ahmad
Audrey Durand
426
2
0
07 Feb 2024
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jiahao Qiu
Hui Yuan
Jinghong Zhang
Wentao Chen
Huazheng Wang
Mengdi Wang
298
3
0
08 Jan 2024
Risk-Aware Continuous Control with Neural Contextual Bandits
AAAI Conference on Artificial Intelligence (AAAI), 2023
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Pérez Costa
269
4
0
15 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
406
13
0
06 Dec 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
286
1
0
11 Oct 2023
Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing
Social Science Research Network (SSRN), 2023
Junhui Cai
Ran Chen
Martin J. Wainwright
Linda H. Zhao
282
7
0
14 Sep 2023
Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Elena Gal
Shaun Singh
Aldo Pacchiano
Benjamin Walker
Terry Lyons
Jakob N. Foerster
FaML
283
0
0
15 Aug 2023
VITS : Variational Inference Thompson Sampling for contextual bandits
International Conference on Machine Learning (ICML), 2023
Pierre Clavier
Tom Huix
Alain Durmus
506
6
0
19 Jul 2023
Scalable Neural Contextual Bandit for Recommender Systems
International Conference on Information and Knowledge Management (CIKM), 2023
Zheqing Zhu
Benjamin Van Roy
OffRL
401
15
0
26 Jun 2023
Representation-Driven Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Ofir Nabati
Guy Tennenholtz
Shie Mannor
353
3
0
31 May 2023
Learning Personalized Page Content Ranking Using Customer Representation
Xin Shen
Yan Zhao
Sujan Perera
Yujia Liu
Jinyun Yan
Mitchell Goodman
BDL
214
9
0
09 May 2023
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
267
11
0
05 May 2023
Evaluating Online Bandit Exploration In Large-Scale Recommender System
Hongbo Guo
Ruben Naeff
Alex Nikulkov
Zheqing Zhu
OffRL
267
11
0
05 Apr 2023
Uncertainty-Aware Instance Reweighting for Off-Policy Learning
Neural Information Processing Systems (NeurIPS), 2023
Xiaoying Zhang
Junpu Chen
Hongning Wang
Hong Xie
Yang Liu
John C. S. Lui
Hang Li
OffRL
309
4
0
11 Mar 2023
Neural-BO: A Black-box Optimization Algorithm using Deep Neural Networks
Neurocomputing (Neurocomputing), 2023
Dat Phan-Trong
Hung The Tran
Sunil R. Gupta
334
11
0
03 Mar 2023
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Neural Information Processing Systems (NeurIPS), 2022
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
309
6
0
24 Oct 2022
Sample-Then-Optimize Batch Neural Thompson Sampling
Neural Information Processing Systems (NeurIPS), 2022
Zhongxiang Dai
Yao Shu
Bryan Kian Hsiang Low
Patrick Jaillet
AAML
228
30
0
13 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
International Conference on Information and Knowledge Management (CIKM), 2022
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
346
19
0
06 Sep 2022
Neural Design for Genetic Perturbation Experiments
International Conference on Learning Representations (ICLR), 2022
Aldo Pacchiano
Drausin Wulsin
Robert A. Barton
L. Voloch
330
7
0
26 Jul 2022
1
2
Next
Page 1 of 2