ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.01088
  4. Cited By
Practical Contextual Bandits with Regression Oracles

Practical Contextual Bandits with Regression Oracles

3 March 2018
Dylan J. Foster
Alekh Agarwal
Miroslav Dudík
Haipeng Luo
Robert Schapire
ArXivPDFHTML

Papers citing "Practical Contextual Bandits with Regression Oracles"

31 / 31 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
248
0
0
27 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
86
1
0
06 Mar 2025
On The Statistical Complexity of Offline Decision-Making
On The Statistical Complexity of Offline Decision-Making
Thanh Nguyen-Tang
R. Arora
OffRL
53
1
0
10 Jan 2025
An Online Learning Approach to Prompt-based Selection of Generative Models
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
45
2
0
17 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
72
7
0
19 Sep 2024
Towards Domain Adaptive Neural Contextual Bandits
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
52
0
0
13 Jun 2024
Online Learning with Unknown Constraints
Online Learning with Unknown Constraints
Karthik Sridharan
Seung Won Wilson Yoo
33
2
0
06 Mar 2024
Stochastic Graph Bandit Learning with Side-Observations
Stochastic Graph Bandit Learning with Side-Observations
Xueping Gong
Jiheng Zhang
34
1
0
29 Aug 2023
Oracle Efficient Online Multicalibration and Omniprediction
Oracle Efficient Online Multicalibration and Omniprediction
Sumegha Garg
Christopher Jung
Omer Reingold
Aaron Roth
23
18
0
18 Jul 2023
Provably Efficient Reinforcement Learning via Surprise Bound
Provably Efficient Reinforcement Learning via Surprise Bound
Hanlin Zhu
Ruosong Wang
Jason D. Lee
OffRL
30
5
0
22 Feb 2023
Infinite Action Contextual Bandits with Reusable Data Exhaust
Infinite Action Contextual Bandits with Reusable Data Exhaust
Mark Rucker
Yinglun Zhu
Paul Mineiro
OffRL
23
1
0
16 Feb 2023
Multicalibration as Boosting for Regression
Multicalibration as Boosting for Regression
Ira Globus-Harris
Declan Harrison
Michael Kearns
Aaron Roth
Jessica Sorrell
32
21
0
31 Jan 2023
Eluder-based Regret for Stochastic Contextual MDPs
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy
Asaf B. Cassel
Alon Cohen
Yishay Mansour
38
5
0
27 Nov 2022
Global Optimization with Parametric Function Approximation
Global Optimization with Parametric Function Approximation
Chong Liu
Yu Wang
41
7
0
16 Nov 2022
Optimal Contextual Bandits with Knapsacks under Realizability via
  Regression Oracles
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
59
9
0
21 Oct 2022
Efficient Active Learning with Abstention
Efficient Active Learning with Abstention
Yinglun Zhu
Robert D. Nowak
54
11
0
31 Mar 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
24
1
0
30 Mar 2022
Offline Reinforcement Learning: Fundamental Barriers for Value Function
  Approximation
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation
Dylan J. Foster
A. Krishnamurthy
D. Simchi-Levi
Yunzong Xu
OffRL
23
62
0
21 Nov 2021
Efficient First-Order Contextual Bandits: Prediction, Allocation, and
  Triangular Discrimination
Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination
Dylan J. Foster
A. Krishnamurthy
48
43
0
05 Jul 2021
On component interactions in two-stage recommender systems
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CML
LRM
42
31
0
28 Jun 2021
An Efficient Algorithm for Deep Stochastic Contextual Bandits
An Efficient Algorithm for Deep Stochastic Contextual Bandits
Tan Zhu
Guannan Liang
Chunjiang Zhu
HaiNing Li
J. Bi
45
1
0
12 Apr 2021
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with
  Applications in Robot-Assisted Feeding
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with Applications in Robot-Assisted Feeding
E. Gordon
Sumegh Roychowdhury
Tapomayukh Bhattacharjee
Kevin G. Jamieson
S. Srinivasa
26
18
0
05 Nov 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic
  Optimality
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
31
10
0
15 Jun 2020
Federated Residual Learning
Federated Residual Learning
Alekh Agarwal
John Langford
Chen-Yu Wei
FedML
24
40
0
28 Mar 2020
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for
  Contextual Bandits under Realizability
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability
D. Simchi-Levi
Yunzong Xu
OffRL
51
107
0
28 Mar 2020
Optimism in Reinforcement Learning with Generalized Linear Function
  Approximation
Optimism in Reinforcement Learning with Generalized Linear Function Approximation
Yining Wang
Ruosong Wang
S. Du
A. Krishnamurthy
137
135
0
09 Dec 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
22
31
0
01 Nov 2019
Adaptive Robot-Assisted Feeding: An Online Learning Framework for
  Acquiring Previously Unseen Food Items
Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items
E. Gordon
Xiang Meng
Matt Barnes
Tapomayukh Bhattacharjee
S. Srinivasa
OffRL
OnRL
13
45
0
19 Aug 2019
Model selection for contextual bandits
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
34
90
0
03 Jun 2019
Rarely-switching linear bandits: optimization of causal effects for the
  real world
Rarely-switching linear bandits: optimization of causal effects for the real world
B. Lansdell
Sofia Triantafillou
Konrad Paul Kording
22
4
0
30 May 2019
Active Learning for Cost-Sensitive Classification
Active Learning for Cost-Sensitive Classification
A. Krishnamurthy
Alekh Agarwal
Tzu-Kuo Huang
Hal Daumé
John Langford
21
79
0
03 Mar 2017
1