Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1003.0146
Cited By
A Contextual-Bandit Approach to Personalized News Article Recommendation
28 February 2010
Lihong Li
Wei Chu
John Langford
Robert Schapire
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Contextual-Bandit Approach to Personalized News Article Recommendation"
47 / 47 papers shown
Title
Counterfactual Multi-player Bandits for Explainable Recommendation Diversification
Yansen Zhang
Bowei He
Xiaokun Zhang
Haolun Wu
Zexu Sun
Chen Ma
84
1
0
27 May 2025
Abacus: A Cost-Based Optimizer for Semantic Operator Systems
Matthew Russo
Sivaprasad Sudhir
Gerardo Vitagliano
Chunwei Liu
Tim Kraska
Samuel Madden
Michael Cafarella
56
0
0
20 May 2025
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
370
0
0
04 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
111
0
0
02 May 2025
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
123
0
0
03 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
207
1
0
06 Mar 2025
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
Yuan Sui
Yufei He
Tri Cao
Simeng Han
Yulin Chen
Bryan Hooi
LRM
AI4CE
97
5
0
27 Feb 2025
Producers Equilibria and Dynamics in Engagement-Driven Recommender Systems
Krishna Acharya
Varun Vangala
Jingyan Wang
Juba Ziani
147
3
0
21 Feb 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
128
1
0
20 Feb 2025
Contextual Linear Bandits with Delay as Payoff
Mengxiao Zhang
Yingfei Wang
Haipeng Luo
103
0
0
18 Feb 2025
Linear Bandits with Partially Observable Features
Wonyoung Hedge Kim
Sungwoo Park
G. Iyengar
A. Zeevi
Min Hwan Oh
113
1
0
10 Feb 2025
Policy Design for Two-sided Platforms with Participation Dynamics
Haruka Kiyohara
Fan Yao
Sarah Dean
109
1
0
03 Feb 2025
Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting
Ahmed Ben Yahmed
Clément Calauzènes
Vianney Perchet
74
1
0
28 Jan 2025
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems
Marco Angioli
Marcello Barbirotta
Abdallah Cheikh
Antonio Mastrandrea
Francesco Menichelli
Mauro Olivieri
89
2
0
22 Jan 2025
A Complete Characterization of Learnability for Stochastic Noisy Bandits
Steve Hanneke
Kun Wang
80
0
0
20 Jan 2025
A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits
Jiannan Li
Yiyang Yang
Shaojie Tang
Yao Wang
103
0
0
18 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Shen Li
Yuyang Zhang
Zhaolin Ren
Claire Liang
Na Li
J. Shah
82
0
0
03 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
342
0
0
08 Nov 2024
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
126
3
0
17 Oct 2024
Second Order Bounds for Contextual Bandits with Function Approximation
Aldo Pacchiano
142
4
0
24 Sep 2024
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
155
6
0
19 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
84
5
0
24 Jul 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
86
5
0
19 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh Chawla
Khoa D. Doan
Khoa D. Doan
131
8
0
15 Jul 2024
Compositional Models for Estimating Causal Effects
Purva Pruthi
David D. Jensen
CML
111
0
0
25 Jun 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF
Akhil Agnihotri
Rahul Jain
Deepak Ramachandran
Zheng Wen
OffRL
97
2
0
13 Jun 2024
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
96
0
0
13 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
131
2
0
07 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
90
0
0
29 May 2024
To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding
Rohan Banerjee
Rajat Kumar Jenamani
Sidharth Vasudev
Amal Nanavati
Katherine Dimitropoulou
Sarah Dean
Tapomayukh Bhattacharjee
111
2
0
11 May 2024
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
79
3
0
10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
114
0
0
24 Mar 2024
LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits
Masahiro Kato
Shinji Ito
76
0
0
05 Mar 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
78
5
0
22 Feb 2024
Replicability is Asymptotically Free in Multi-armed Bandits
Junpei Komiyama
Shinji Ito
Yuichi Yoshida
Souta Koshino
87
1
0
12 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
60
1
0
05 Feb 2024
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
112
10
0
13 Apr 2023
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
113
0
0
01 Feb 2023
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
121
0
0
23 Feb 2022
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation
Xu He
Bo An
Yanghua Li
Haikai Chen
Qingyu Guo
Xuzhao Li
Zhirong Wang
50
12
0
21 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
90
74
0
17 Aug 2020
Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Abhimanyu Dubey
Alex Pentland
61
29
0
14 Aug 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
82
200
0
10 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
101
93
0
03 Mar 2020
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
78
42
0
06 Nov 2019
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
100
469
0
28 Feb 2017
Exploring compact reinforcement-learning representations with linear regression
Thomas J. Walsh
I. Szita
Carlos Diuk
Michael L. Littman
OffRL
158
114
0
09 May 2012
1