ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.09011
  4. Cited By
Mostly Exploration-Free Algorithms for Contextual Bandits
v1v2v3v4v5v6v7v8 (latest)

Mostly Exploration-Free Algorithms for Contextual Bandits

28 April 2017
Hamsa Bastani
Mohsen Bayati
Khashayar Khosravi
ArXiv (abs)PDFHTML

Papers citing "Mostly Exploration-Free Algorithms for Contextual Bandits"

50 / 97 papers shown
Title
Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams
Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams
Mohammed Almutairi
109
0
0
05 Jun 2025
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
420
0
0
27 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
69
0
0
04 Apr 2025
Sparse Nonparametric Contextual Bandits
Sparse Nonparametric Contextual Bandits
Hamish Flynn
Julia Olkhovskaya
Paul Rognon-Vael
118
0
0
20 Mar 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
539
1
0
06 Mar 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
75
2
0
03 Jan 2025
Exploration and Persuasion
Exploration and Persuasion
Aleksandrs Slivkins
420
12
0
22 Oct 2024
Batched Online Contextual Sparse Bandits with Sequential Inclusion of
  Features
Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features
Rowan Swiers
Subash Prabanantham
Andrew Maher
21
0
0
13 Sep 2024
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
301
10
0
19 Aug 2024
Jump Starting Bandits with LLM-Generated Prior Knowledge
Jump Starting Bandits with LLM-Generated Prior Knowledge
P. A. Alamdari
Yanshuai Cao
Kevin H. Wilson
73
2
0
27 Jun 2024
Prompt Optimization with EASE? Efficient Ordering-aware Automated
  Selection of Exemplars
Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars
Zhaoxuan Wu
Xiaoqiang Lin
Zhongxiang Dai
Wenyang Hu
Yao Shu
See-Kiong Ng
Patrick Jaillet
Bryan Kian Hsiang Low
53
11
0
25 May 2024
Batched Nonparametric Contextual Bandits
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
108
1
0
27 Feb 2024
Incentivized Exploration via Filtered Posterior Sampling
Incentivized Exploration via Filtered Posterior Sampling
Anand Kalvit
Aleksandrs Slivkins
Yonatan Gur
57
2
0
20 Feb 2024
Thompson Sampling in Partially Observable Contextual Bandits
Thompson Sampling in Partially Observable Contextual Bandits
Hongju Park
Mohamad Kazem Shirani Faradonbeh
67
3
0
15 Feb 2024
Efficient Contextual Bandits with Uninformed Feedback Graphs
Efficient Contextual Bandits with Uninformed Feedback Graphs
Mengxiao Zhang
Yuheng Zhang
Haipeng Luo
Paul Mineiro
49
4
0
12 Feb 2024
Taming "data-hungry" reinforcement learning? Stability in continuous
  state-action spaces
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
Yaqi Duan
Martin J. Wainwright
OffRL
52
3
0
10 Jan 2024
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits
Yuko Kuroki
Alberto Rumi
Taira Tsuchiya
Fabio Vitale
Nicolò Cesa-Bianchi
122
7
0
24 Dec 2023
New Classes of the Greedy-Applicable Arm Feature Distributions in the
  Sparse Linear Bandit Problem
New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem
Koji Ichikawa
Shinji Ito
Daisuke Hatano
Hanna Sumita
Takuro Fukunaga
Naonori Kakimura
Ken-ichi Kawarabayashi
23
0
0
19 Dec 2023
Semidiscrete optimal transport with unknown costs
Semidiscrete optimal transport with unknown costs
Yinchu Zhu
I. Ryzhov
OT
22
1
0
01 Oct 2023
Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates
Kernel εεε-Greedy for Multi-Armed Bandits with Covariates
Sakshi Arya
Bharath K. Sriperumbudur
137
0
0
29 Jun 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
58
1
0
26 Jun 2023
Strategic Apple Tasting
Strategic Apple Tasting
Keegan Harris
Chara Podimata
Zhiwei Steven Wu
84
7
0
09 Jun 2023
Ranking with Popularity Bias: User Welfare under Self-Amplification
  Dynamics
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics
Guy Tennenholtz
Martin Mladenov
Nadav Merlis
Robert L. Axtell
Craig Boutilier
53
0
0
24 May 2023
Bandit Social Learning: Exploration under Myopic Behavior
Bandit Social Learning: Exploration under Myopic Behavior
Kiarash Banihashem
Mohammadtaghi Hajiaghayi
Suho Shin
Aleksandrs Slivkins
430
4
0
15 Feb 2023
Linear Partial Monitoring for Sequential Decision-Making: Algorithms,
  Regret Bounds and Applications
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications
Johannes Kirschner
Tor Lattimore
Andreas Krause
93
8
0
07 Feb 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with
  Bandit Feedback
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback
Wonyoung Hedge Kim
G. Iyengar
A. Zeevi
34
3
0
31 Jan 2023
Incentive-Aware Recommender Systems in Two-Sided Markets
Incentive-Aware Recommender Systems in Two-Sided Markets
Xiaowu Dai
Wenlu Xu
Yuan Qi
Michael I. Jordan
50
6
0
23 Nov 2022
Transfer Learning for Contextual Multi-armed Bandits
Transfer Learning for Contextual Multi-armed Bandits
Changxiao Cai
T. Tony Cai
Hongzhe Li
127
19
0
22 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Lifelong Bandit Optimization: No Prior and No Regret
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
79
3
0
27 Oct 2022
Advertising Media and Target Audience Optimization via High-dimensional
  Bandits
Advertising Media and Target Audience Optimization via High-dimensional Bandits
Wenjia Ba
J. Harrison
Harikesh S. Nair
57
0
0
17 Sep 2022
Risk-aware linear bandits with convex loss
Risk-aware linear bandits with convex loss
Patrick Saux
Odalric-Ambrym Maillard
54
2
0
15 Sep 2022
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual
  Bandits
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits
Wonyoung Hedge Kim
Kyungbok Lee
M. Paik
102
14
0
15 Sep 2022
On Private Online Convex Optimization: Optimal Algorithms in
  $\ell_p$-Geometry and High Dimensional Contextual Bandits
On Private Online Convex Optimization: Optimal Algorithms in ℓp\ell_pℓp​-Geometry and High Dimensional Contextual Bandits
Yuxuan Han
Zhicong Liang
Zhipeng Liang
Yang Wang
Yuan Yao
Jiheng Zhang
66
1
0
16 Jun 2022
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear
  Contextual Bandits
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits
Wonyoung Hedge Kim
M. Paik
Min-whan Oh
56
6
0
11 Jun 2022
Meta Representation Learning with Contextual Linear Bandits
Meta Representation Learning with Contextual Linear Bandits
Leonardo Cella
Karim Lounici
Massimiliano Pontil
116
5
0
30 May 2022
Integrating Reward Maximization and Population Estimation: Sequential
  Decision-Making for Internal Revenue Service Audit Selection
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection
Peter Henderson
Ben Chugg
Brandon R. Anderson
Kristen M. Altenburger
Alex Turk
J. Guyton
Jacob Goldin
Daniel E. Ho
OffRL
50
10
0
25 Apr 2022
Worst-case Performance of Greedy Policies in Bandits with Imperfect
  Context Observations
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations
Hongju Park
Mohamad Kazem Shirani Faradonbeh
OffRL
59
2
0
10 Apr 2022
Truncated LinUCB for Stochastic Linear Bandits
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
260
0
0
23 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits
Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella
Karim Lounici
Grégoire Pacreau
Massimiliano Pontil
105
22
0
21 Feb 2022
Efficient Algorithms for Learning to Control Bandits with Unobserved
  Contexts
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts
Hongju Park
Mohamad Kazem Shirani Faradonbeh
43
6
0
02 Feb 2022
Multitask Learning and Bandits via Robust Statistics
Multitask Learning and Bandits via Robust Statistics
Kan Xu
Hamsa Bastani
92
6
0
28 Dec 2021
Analysis of Thompson Sampling for Partially Observable Contextual
  Multi-Armed Bandits
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits
Yash J. Patel
Mohamad Kazem Shirani Faradonbeh
65
15
0
23 Oct 2021
Active Learning for Contextual Search with Binary Feedbacks
Active Learning for Contextual Search with Binary Feedbacks
Xi Chen
Quanquan C. Liu
Yining Wang
31
0
0
03 Oct 2021
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored
  Online Binary Classification
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification
James A. Grant
David S. Leslie
82
3
0
29 Sep 2021
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Eli Ben-Michael
D. J. Greiner
Kosuke Imai
Zhichao Jiang
OffRL
229
22
0
22 Sep 2021
Dynamic Selection in Algorithmic Decision-making
Dynamic Selection in Algorithmic Decision-making
Jin Li
Ye Luo
Xiaowei Zhang
89
2
0
28 Aug 2021
Model Selection for Generic Contextual Bandits
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
On component interactions in two-stage recommender systems
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CMLLRM
74
31
0
28 Jun 2021
Generalized Linear Bandits with Local Differential Privacy
Generalized Linear Bandits with Local Differential Privacy
Yuxuan Han
Zhipeng Liang
Yang Wang
Jiheng Zhang
87
32
0
07 Jun 2021
Fair Exploration via Axiomatic Bargaining
Fair Exploration via Axiomatic Bargaining
Jackie Baek
Vivek F. Farias
FaML
83
29
0
04 Jun 2021
12
Next