ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1003.0146
  4. Cited By
A Contextual-Bandit Approach to Personalized News Article Recommendation

A Contextual-Bandit Approach to Personalized News Article Recommendation

28 February 2010
Lihong Li
Wei Chu
John Langford
Robert Schapire
ArXivPDFHTML

Papers citing "A Contextual-Bandit Approach to Personalized News Article Recommendation"

47 / 47 papers shown
Title
Counterfactual Multi-player Bandits for Explainable Recommendation Diversification
Counterfactual Multi-player Bandits for Explainable Recommendation Diversification
Yansen Zhang
Bowei He
Xiaokun Zhang
Haolun Wu
Zexu Sun
Chen Ma
116
1
0
27 May 2025
Abacus: A Cost-Based Optimizer for Semantic Operator Systems
Abacus: A Cost-Based Optimizer for Semantic Operator Systems
Matthew Russo
Sivaprasad Sudhir
Gerardo Vitagliano
Chunwei Liu
Tim Kraska
Samuel Madden
Michael Cafarella
70
0
0
20 May 2025
Neural Logistic Bandits
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
397
0
0
04 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
131
0
0
02 May 2025
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
140
0
0
03 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
274
1
0
06 Mar 2025
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
Yuan Sui
Yufei He
Tri Cao
Simeng Han
Yulin Chen
Bryan Hooi
LRM
AI4CE
107
5
0
27 Feb 2025
Producers Equilibria and Dynamics in Engagement-Driven Recommender Systems
Producers Equilibria and Dynamics in Engagement-Driven Recommender Systems
Krishna Acharya
Varun Vangala
Jingyan Wang
Juba Ziani
162
3
0
21 Feb 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
145
1
0
20 Feb 2025
Contextual Linear Bandits with Delay as Payoff
Contextual Linear Bandits with Delay as Payoff
Mengxiao Zhang
Yingfei Wang
Haipeng Luo
110
0
0
18 Feb 2025
Linear Bandits with Partially Observable Features
Wonyoung Hedge Kim
Sungwoo Park
G. Iyengar
A. Zeevi
Min Hwan Oh
122
1
0
10 Feb 2025
Policy Design for Two-sided Platforms with Participation Dynamics
Policy Design for Two-sided Platforms with Participation Dynamics
Haruka Kiyohara
Fan Yao
Sarah Dean
118
1
0
03 Feb 2025
Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting
Ahmed Ben Yahmed
Clément Calauzènes
Vianney Perchet
87
1
0
28 Jan 2025
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems
Marco Angioli
Marcello Barbirotta
Abdallah Cheikh
Antonio Mastrandrea
Francesco Menichelli
Mauro Olivieri
106
2
0
22 Jan 2025
A Complete Characterization of Learnability for Stochastic Noisy Bandits
A Complete Characterization of Learnability for Stochastic Noisy Bandits
Steve Hanneke
Kun Wang
87
0
0
20 Jan 2025
A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits
A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits
Jiannan Li
Yiyang Yang
Shaojie Tang
Yao Wang
117
0
0
18 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Enhancing Preference-based Linear Bandits via Human Response Time
Shen Li
Yuyang Zhang
Zhaolin Ren
Claire Liang
Na Li
J. Shah
89
0
0
03 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
365
0
0
08 Nov 2024
An Online Learning Approach to Prompt-based Selection of Generative Models
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
139
3
0
17 Oct 2024
Second Order Bounds for Contextual Bandits with Function Approximation
Second Order Bounds for Contextual Bandits with Function Approximation
Aldo Pacchiano
152
4
0
24 Sep 2024
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
177
6
0
19 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
103
5
0
24 Jul 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
96
5
0
19 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh Chawla
Khoa D. Doan
Khoa D. Doan
142
8
0
15 Jul 2024
Compositional Models for Estimating Causal Effects
Compositional Models for Estimating Causal Effects
Purva Pruthi
David D. Jensen
CML
118
0
0
25 Jun 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF
Online Bandit Learning with Offline Preference Data for Improved RLHF
Akhil Agnihotri
Rahul Jain
Deepak Ramachandran
Zheng Wen
OffRL
104
2
0
13 Jun 2024
Towards Domain Adaptive Neural Contextual Bandits
Towards Domain Adaptive Neural Contextual Bandits
Ziyan Wang
Hao Wang
Hao Wang
107
0
0
13 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
143
2
0
07 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
102
0
0
29 May 2024
To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding
To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding
Rohan Banerjee
Rajat Kumar Jenamani
Sidharth Vasudev
Amal Nanavati
Katherine Dimitropoulou
Sarah Dean
Tapomayukh Bhattacharjee
118
2
0
11 May 2024
Generalized Linear Bandits with Limited Adaptivity
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
88
3
0
10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
120
0
0
24 Mar 2024
LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits
LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits
Masahiro Kato
Shinji Ito
83
0
0
05 Mar 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
87
5
0
22 Feb 2024
Replicability is Asymptotically Free in Multi-armed Bandits
Replicability is Asymptotically Free in Multi-armed Bandits
Junpei Komiyama
Shinji Ito
Yuichi Yoshida
Souta Koshino
91
1
0
12 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
69
1
0
05 Feb 2024
Learning Personalized Decision Support Policies
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
118
10
0
13 Apr 2023
Selective Uncertainty Propagation in Offline RL
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
128
0
0
01 Feb 2023
Truncated LinUCB for Stochastic Linear Bandits
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
140
0
0
23 Feb 2022
Contextual User Browsing Bandits for Large-Scale Online Mobile
  Recommendation
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation
Xu He
Bo An
Yanghua Li
Haikai Chen
Qingyu Guo
Xuzhao Li
Zhirong Wang
61
12
0
21 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
108
75
0
17 Aug 2020
Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Abhimanyu Dubey
Alex Pentland
72
29
0
14 Aug 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
88
200
0
10 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
111
94
0
03 Mar 2020
Safe Linear Thompson Sampling with Side Information
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
88
43
0
06 Nov 2019
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
113
470
0
28 Feb 2017
Exploring compact reinforcement-learning representations with linear
  regression
Exploring compact reinforcement-learning representations with linear regression
Thomas J. Walsh
I. Szita
Carlos Diuk
Michael L. Littman
OffRL
174
114
0
09 May 2012
1