Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.01088
Cited By
Practical Contextual Bandits with Regression Oracles
3 March 2018
Dylan J. Foster
Alekh Agarwal
Miroslav Dudík
Haipeng Luo
Robert Schapire
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Practical Contextual Bandits with Regression Oracles"
31 / 31 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
251
0
0
27 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
86
1
0
06 Mar 2025
On The Statistical Complexity of Offline Decision-Making
Thanh Nguyen-Tang
R. Arora
OffRL
53
1
0
10 Jan 2025
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
45
2
0
17 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
72
7
0
19 Sep 2024
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
52
0
0
13 Jun 2024
Online Learning with Unknown Constraints
Karthik Sridharan
Seung Won Wilson Yoo
33
2
0
06 Mar 2024
Stochastic Graph Bandit Learning with Side-Observations
Xueping Gong
Jiheng Zhang
36
1
0
29 Aug 2023
Oracle Efficient Online Multicalibration and Omniprediction
Sumegha Garg
Christopher Jung
Omer Reingold
Aaron Roth
23
18
0
18 Jul 2023
Provably Efficient Reinforcement Learning via Surprise Bound
Hanlin Zhu
Ruosong Wang
Jason D. Lee
OffRL
30
5
0
22 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
23
1
0
16 Feb 2023
Multicalibration as Boosting for Regression
Ira Globus-Harris
Declan Harrison
Michael Kearns
Aaron Roth
Jessica Sorrell
35
21
0
31 Jan 2023
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy
Asaf B. Cassel
Alon Cohen
Yishay Mansour
38
6
0
27 Nov 2022
Global Optimization with Parametric Function Approximation
Chong Liu
Yu Wang
41
7
0
16 Nov 2022
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
61
9
0
21 Oct 2022
Efficient Active Learning with Abstention
Yinglun Zhu
Robert D. Nowak
54
11
0
31 Mar 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
24
1
0
30 Mar 2022
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Dylan J. Foster
A. Krishnamurthy
D. Simchi-Levi
Yunzong Xu
OffRL
23
62
0
21 Nov 2021
Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination
Dylan J. Foster
A. Krishnamurthy
48
43
0
05 Jul 2021
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CML
LRM
42
31
0
28 Jun 2021
An Efficient Algorithm for Deep Stochastic Contextual Bandits
Tan Zhu
Guannan Liang
Chunjiang Zhu
HaiNing Li
J. Bi
45
1
0
12 Apr 2021
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with Applications in Robot-Assisted Feeding
E. Gordon
Sumegh Roychowdhury
Tapomayukh Bhattacharjee
Kevin G. Jamieson
S. Srinivasa
28
18
0
05 Nov 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
31
10
0
15 Jun 2020
Federated Residual Learning
Alekh Agarwal
John Langford
Chen-Yu Wei
FedML
24
40
0
28 Mar 2020
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability
D. Simchi-Levi
Yunzong Xu
OffRL
51
107
0
28 Mar 2020
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
137
135
0
09 Dec 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
22
31
0
01 Nov 2019
Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items
E. Gordon
Xiang Meng
Matt Barnes
Tapomayukh Bhattacharjee
S. Srinivasa
OffRL
OnRL
18
45
0
19 Aug 2019
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
34
90
0
03 Jun 2019
Rarely-switching linear bandits: optimization of causal effects for the real world
B. Lansdell
Sofia Triantafillou
Konrad Paul Kording
22
4
0
30 May 2019
Active Learning for Cost-Sensitive Classification
A. Krishnamurthy
Alekh Agarwal
Tzu-Kuo Huang
Hal Daumé
John Langford
21
79
0
03 Mar 2017
1