A Contextual-Bandit Approach to Personalized News Article Recommendation

28 February 2010

Papers citing "A Contextual-Bandit Approach to Personalized News Article Recommendation"

47 / 47 papers shown

Title
Counterfactual Multi-player Bandits for Explainable Recommendation Diversification Yansen Zhang Bowei He Xiaokun Zhang Haolun Wu Zexu Sun Chen Ma 116 1 0 27 May 2025
Abacus: A Cost-Based Optimizer for Semantic Operator Systems Matthew Russo Sivaprasad Sudhir Gerardo Vitagliano Chunwei Liu Tim Kraska Samuel Madden Michael Cafarella 70 0 0 20 May 2025
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 397 0 0 04 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects Shu Tamano Masanori Nojima OffRL 131 0 0 02 May 2025
Prompt Optimization with Logged Bandit Data Haruka Kiyohara Daniel Yiming Cao Yuta Saito Thorsten Joachims 140 0 0 03 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 274 1 0 06 Mar 2025
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Yuan Sui Yufei He Tri Cao Simeng Han Yulin Chen Bryan Hooi LRM AI4CE 107 5 0 27 Feb 2025
Producers Equilibria and Dynamics in Engagement-Driven Recommender Systems Krishna Acharya Varun Vangala Jingyan Wang Juba Ziani 162 3 0 21 Feb 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability Carlos E. Luis A. Bottero Julia Vinogradska Felix Berkenkamp Jan Peters 145 1 0 20 Feb 2025
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 110 0 0 18 Feb 2025
Linear Bandits with Partially Observable Features Wonyoung Hedge Kim Sungwoo Park G. Iyengar A. Zeevi Min Hwan Oh 122 1 0 10 Feb 2025
Policy Design for Two-sided Platforms with Participation Dynamics Haruka Kiyohara Fan Yao Sarah Dean 118 1 0 03 Feb 2025
Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting Ahmed Ben Yahmed Clément Calauzènes Vianney Perchet 87 1 0 28 Jan 2025
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems Marco Angioli Marcello Barbirotta Abdallah Cheikh Antonio Mastrandrea Francesco Menichelli Mauro Olivieri 106 2 0 22 Jan 2025
A Complete Characterization of Learnability for Stochastic Noisy Bandits Steve Hanneke Kun Wang 87 0 0 20 Jan 2025
A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits Jiannan Li Yiyang Yang Shaojie Tang Yao Wang 117 0 0 18 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time Shen Li Yuyang Zhang Zhaolin Ren Claire Liang Na Li J. Shah 89 0 0 03 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 365 0 0 08 Nov 2024
An Online Learning Approach to Prompt-based Selection of Generative Models Xiaoyan Hu Ho-fung Leung Farzan Farnia 139 3 0 17 Oct 2024
Second Order Bounds for Contextual Bandits with Function Approximation Aldo Pacchiano 152 4 0 24 Sep 2024
Contextual Bandits for Unbounded Context Distributions Puning Zhao Xiaogang Xu Zhe Liu Huiwen Wu Qin Zhang Zong Ke Tianhang Zheng 177 6 0 19 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin Patrick Jaillet K. H. Low 103 5 0 24 Jul 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 96 5 0 19 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs Quang H. Nguyen Duy C. Hoang Juliette Decugis Saurav Manchanda Nitesh Chawla Khoa D. Doan Khoa D. Doan 142 8 0 15 Jul 2024
Compositional Models for Estimating Causal Effects Purva Pruthi David D. Jensen CML 118 0 0 25 Jun 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF Akhil Agnihotri Rahul Jain Deepak Ramachandran Zheng Wen OffRL 104 2 0 13 Jun 2024
Towards Domain Adaptive Neural Contextual Bandits Ziyan Wang Hao Wang Hao Wang 107 0 0 13 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning Subhojyoti Mukherjee Josiah P. Hanna Qiaomin Xie Robert Nowak 143 2 0 07 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning Mingqi Yuan Roger Creus Castanyer Bo Li Xin Jin Glen Berseth Wenjun Zeng 102 0 0 29 May 2024
To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding Rohan Banerjee Rajat Kumar Jenamani Sidharth Vasudev Amal Nanavati Katherine Dimitropoulou Sarah Dean Tapomayukh Bhattacharjee 118 2 0 11 May 2024
Generalized Linear Bandits with Limited Adaptivity Ayush Sawarni Nirjhar Das Siddharth Barman Gaurav Sinha 88 3 0 10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History Yi Xu Weiran Shen Xiao Zhang Jun Xu OffRL 120 0 0 24 Mar 2024
LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Masahiro Kato Shinji Ito 83 0 0 05 Mar 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces Imad Aouali Victor-Emmanuel Brunel David Rohde Anna Korba OffRL 87 5 0 22 Feb 2024
Replicability is Asymptotically Free in Multi-armed Bandits Junpei Komiyama Shinji Ito Yuichi Yoshida Souta Koshino 91 1 0 12 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits Anna L. Trella Walter Dempsey Asim H. Gazi Ziping Xu Finale Doshi-Velez Susan A. Murphy 69 1 0 05 Feb 2024
Learning Personalized Decision Support Policies Umang Bhatt Valerie Chen Katherine M. Collins Parameswaran Kamalaruban Emma Kallina Adrian Weller Ameet Talwalkar OffRL 118 10 0 13 Apr 2023
Selective Uncertainty Propagation in Offline RL Sanath Kumar Krishnamurthy Shrey Modi Tanmay Gangwani S. Katariya Branislav Kveton A. Rangi OffRL 128 0 0 01 Feb 2023
Truncated LinUCB for Stochastic Linear Bandits Yanglei Song Meng zhou 140 0 0 23 Feb 2022
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation Xu He Bo An Yanghua Li Haikai Chen Qingyu Guo Xuzhao Li Zhirong Wang 61 12 0 21 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Yuta Saito Shunsuke Aihara Megumi Matsutani Yusuke Narita OffRL 108 75 0 17 Aug 2020
Kernel Methods for Cooperative Multi-Agent Contextual Bandits Abhimanyu Dubey Alex Pentland 72 29 0 14 Aug 2020
Self-Supervised Reinforcement Learning for Recommender Systems Xin Xin Alexandros Karatzoglou Ioannis Arapakis J. Jose SSL OffRL 88 200 0 10 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems Aldo Pacchiano My Phan Yasin Abbasi-Yadkori Anup B. Rao Julian Zimmert Tor Lattimore Csaba Szepesvári 111 94 0 03 Mar 2020
Safe Linear Thompson Sampling with Side Information Ahmadreza Moradipari Sanae Amani M. Alizadeh Christos Thrampoulidis 88 43 0 06 Nov 2019
Bridging the Gap Between Value and Policy Based Reinforcement Learning Ofir Nachum Mohammad Norouzi Kelvin Xu Dale Schuurmans 113 470 0 28 Feb 2017
Exploring compact reinforcement-learning representations with linear regression Thomas J. Walsh I. Szita Carlos Diuk Michael L. Littman OffRL 174 114 0 09 May 2012