Practical Contextual Bandits with Regression Oracles

3 March 2018

Papers citing "Practical Contextual Bandits with Regression Oracles"

31 / 31 papers shown

Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback Nan Lu Ethan X. Fang Junwei Lu 251 0 0 27 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 86 1 0 06 Mar 2025
On The Statistical Complexity of Offline Decision-Making Thanh Nguyen-Tang R. Arora OffRL 53 1 0 10 Jan 2025
An Online Learning Approach to Prompt-based Selection of Generative Models Xiaoyan Hu Ho-fung Leung Farzan Farnia 45 2 0 17 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning Kaiwen Wang Nathan Kallus Wen Sun OffRL 72 7 0 19 Sep 2024
Towards Domain Adaptive Neural Contextual Bandits Ziyan Wang Hao Wang Hao Wang 52 0 0 13 Jun 2024
Online Learning with Unknown Constraints Karthik Sridharan Seung Won Wilson Yoo 33 2 0 06 Mar 2024
Stochastic Graph Bandit Learning with Side-Observations Xueping Gong Jiheng Zhang 36 1 0 29 Aug 2023
Oracle Efficient Online Multicalibration and Omniprediction Sumegha Garg Christopher Jung Omer Reingold Aaron Roth 23 18 0 18 Jul 2023
Provably Efficient Reinforcement Learning via Surprise Bound Hanlin Zhu Ruosong Wang Jason D. Lee OffRL 30 5 0 22 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust Mark Rucker Yinglun Zhu Paul Mineiro OffRL 23 1 0 16 Feb 2023
Multicalibration as Boosting for Regression Ira Globus-Harris Declan Harrison Michael Kearns Aaron Roth Jessica Sorrell 35 21 0 31 Jan 2023
Eluder-based Regret for Stochastic Contextual MDPs Orin Levy Asaf B. Cassel Alon Cohen Yishay Mansour 38 6 0 27 Nov 2022
Global Optimization with Parametric Function Approximation Chong Liu Yu Wang 41 7 0 16 Nov 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Yuxuan Han Jialin Zeng Yang Wang Yangzhen Xiang Jiheng Zhang 61 9 0 21 Oct 2022
Efficient Active Learning with Abstention Yinglun Zhu Robert D. Nowak 54 11 0 31 Mar 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 24 1 0 30 Mar 2022
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation Dylan J. Foster A. Krishnamurthy D. Simchi-Levi Yunzong Xu OffRL 23 62 0 21 Nov 2021
Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination Dylan J. Foster A. Krishnamurthy 48 43 0 05 Jul 2021
On component interactions in two-stage recommender systems Jiri Hron K. Krauth Michael I. Jordan Niki Kilbertus CML LRM 42 31 0 28 Jun 2021
An Efficient Algorithm for Deep Stochastic Contextual Bandits Tan Zhu Guannan Liang Chunjiang Zhu HaiNing Li J. Bi 45 1 0 12 Apr 2021
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with Applications in Robot-Assisted Feeding E. Gordon Sumegh Roychowdhury Tapomayukh Bhattacharjee Kevin G. Jamieson S. Srinivasa 28 18 0 05 Nov 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality Kwang-Sung Jun Chicheng Zhang 31 10 0 15 Jun 2020
Federated Residual Learning Alekh Agarwal John Langford Chen-Yu Wei FedML 24 40 0 28 Mar 2020
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability D. Simchi-Levi Yunzong Xu OffRL 51 107 0 28 Mar 2020
Optimism in Reinforcement Learning with Generalized Linear Function Approximation Yining Wang Ruosong Wang S. Du A. Krishnamurthy 137 135 0 09 Dec 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces Mikael Henaff OffRL 22 31 0 01 Nov 2019
Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items E. Gordon Xiang Meng Matt Barnes Tapomayukh Bhattacharjee S. Srinivasa OffRL OnRL 18 45 0 19 Aug 2019
Model selection for contextual bandits Dylan J. Foster A. Krishnamurthy Haipeng Luo OffRL 34 90 0 03 Jun 2019
Rarely-switching linear bandits: optimization of causal effects for the real world B. Lansdell Sofia Triantafillou Konrad Paul Kording 22 4 0 30 May 2019
Active Learning for Cost-Sensitive Classification A. Krishnamurthy Alekh Agarwal Tzu-Kuo Huang Hal Daumé John Langford 21 79 0 03 Mar 2017