A Contextual-Bandit Approach to Personalized News Article Recommendation

28 February 2010

Papers citing "A Contextual-Bandit Approach to Personalized News Article Recommendation"

47 / 47 papers shown

Title
Counterfactual Multi-player Bandits for Explainable Recommendation Diversification Yansen Zhang Bowei He Xiaokun Zhang Haolun Wu Zexu Sun Chen Ma 84 1 0 27 May 2025
Abacus: A Cost-Based Optimizer for Semantic Operator Systems Matthew Russo Sivaprasad Sudhir Gerardo Vitagliano Chunwei Liu Tim Kraska Samuel Madden Michael Cafarella 56 0 0 20 May 2025
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 370 0 0 04 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects Shu Tamano Masanori Nojima OffRL 111 0 0 02 May 2025
Prompt Optimization with Logged Bandit Data Haruka Kiyohara Daniel Yiming Cao Yuta Saito Thorsten Joachims 123 0 0 03 Apr 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 207 1 0 06 Mar 2025
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Yuan Sui Yufei He Tri Cao Simeng Han Yulin Chen Bryan Hooi LRM AI4CE 97 5 0 27 Feb 2025
Producers Equilibria and Dynamics in Engagement-Driven Recommender Systems Krishna Acharya Varun Vangala Jingyan Wang Juba Ziani 147 3 0 21 Feb 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability Carlos E. Luis A. Bottero Julia Vinogradska Felix Berkenkamp Jan Peters 128 1 0 20 Feb 2025
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 103 0 0 18 Feb 2025
Linear Bandits with Partially Observable Features Wonyoung Hedge Kim Sungwoo Park G. Iyengar A. Zeevi Min Hwan Oh 113 1 0 10 Feb 2025
Policy Design for Two-sided Platforms with Participation Dynamics Haruka Kiyohara Fan Yao Sarah Dean 109 1 0 03 Feb 2025
Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting Ahmed Ben Yahmed Clément Calauzènes Vianney Perchet 74 1 0 28 Jan 2025
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems Marco Angioli Marcello Barbirotta Abdallah Cheikh Antonio Mastrandrea Francesco Menichelli Mauro Olivieri 89 2 0 22 Jan 2025
A Complete Characterization of Learnability for Stochastic Noisy Bandits Steve Hanneke Kun Wang 80 0 0 20 Jan 2025
A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits Jiannan Li Yiyang Yang Shaojie Tang Yao Wang 103 0 0 18 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time Shen Li Yuyang Zhang Zhaolin Ren Claire Liang Na Li J. Shah 82 0 0 03 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 342 0 0 08 Nov 2024
An Online Learning Approach to Prompt-based Selection of Generative Models Xiaoyan Hu Ho-fung Leung Farzan Farnia 126 3 0 17 Oct 2024
Second Order Bounds for Contextual Bandits with Function Approximation Aldo Pacchiano 142 4 0 24 Sep 2024
Contextual Bandits for Unbounded Context Distributions Puning Zhao Xiaogang Xu Zhe Liu Huiwen Wu Qin Zhang Zong Ke Tianhang Zheng 155 6 0 19 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin Patrick Jaillet K. H. Low 84 5 0 24 Jul 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 86 5 0 19 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs Quang H. Nguyen Duy C. Hoang Juliette Decugis Saurav Manchanda Nitesh Chawla Khoa D. Doan Khoa D. Doan 131 8 0 15 Jul 2024
Compositional Models for Estimating Causal Effects Purva Pruthi David D. Jensen CML 111 0 0 25 Jun 2024
Online Bandit Learning with Offline Preference Data for Improved RLHF Akhil Agnihotri Rahul Jain Deepak Ramachandran Zheng Wen OffRL 97 2 0 13 Jun 2024
Towards Domain Adaptive Neural Contextual Bandits Ziyan Wang Hao Wang Hao Wang 96 0 0 13 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning Subhojyoti Mukherjee Josiah P. Hanna Qiaomin Xie Robert Nowak 131 2 0 07 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning Mingqi Yuan Roger Creus Castanyer Bo Li Xin Jin Glen Berseth Wenjun Zeng 90 0 0 29 May 2024
To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding Rohan Banerjee Rajat Kumar Jenamani Sidharth Vasudev Amal Nanavati Katherine Dimitropoulou Sarah Dean Tapomayukh Bhattacharjee 111 2 0 11 May 2024
Generalized Linear Bandits with Limited Adaptivity Ayush Sawarni Nirjhar Das Siddharth Barman Gaurav Sinha 79 3 0 10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History Yi Xu Weiran Shen Xiao Zhang Jun Xu OffRL 114 0 0 24 Mar 2024
LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Masahiro Kato Shinji Ito 76 0 0 05 Mar 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces Imad Aouali Victor-Emmanuel Brunel David Rohde Anna Korba OffRL 78 5 0 22 Feb 2024
Replicability is Asymptotically Free in Multi-armed Bandits Junpei Komiyama Shinji Ito Yuichi Yoshida Souta Koshino 87 1 0 12 Feb 2024
Non-Stationary Latent Auto-Regressive Bandits Anna L. Trella Walter Dempsey Asim H. Gazi Ziping Xu Finale Doshi-Velez Susan A. Murphy 60 1 0 05 Feb 2024
Learning Personalized Decision Support Policies Umang Bhatt Valerie Chen Katherine M. Collins Parameswaran Kamalaruban Emma Kallina Adrian Weller Ameet Talwalkar OffRL 112 10 0 13 Apr 2023
Selective Uncertainty Propagation in Offline RL Sanath Kumar Krishnamurthy Shrey Modi Tanmay Gangwani S. Katariya Branislav Kveton A. Rangi OffRL 113 0 0 01 Feb 2023
Truncated LinUCB for Stochastic Linear Bandits Yanglei Song Meng zhou 121 0 0 23 Feb 2022
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation Xu He Bo An Yanghua Li Haikai Chen Qingyu Guo Xuzhao Li Zhirong Wang 50 12 0 21 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Yuta Saito Shunsuke Aihara Megumi Matsutani Yusuke Narita OffRL 90 74 0 17 Aug 2020
Kernel Methods for Cooperative Multi-Agent Contextual Bandits Abhimanyu Dubey Alex Pentland 61 29 0 14 Aug 2020
Self-Supervised Reinforcement Learning for Recommender Systems Xin Xin Alexandros Karatzoglou Ioannis Arapakis J. Jose SSL OffRL 82 200 0 10 Jun 2020
Model Selection in Contextual Stochastic Bandit Problems Aldo Pacchiano My Phan Yasin Abbasi-Yadkori Anup B. Rao Julian Zimmert Tor Lattimore Csaba Szepesvári 101 93 0 03 Mar 2020
Safe Linear Thompson Sampling with Side Information Ahmadreza Moradipari Sanae Amani M. Alizadeh Christos Thrampoulidis 78 42 0 06 Nov 2019
Bridging the Gap Between Value and Policy Based Reinforcement Learning Ofir Nachum Mohammad Norouzi Kelvin Xu Dale Schuurmans 100 469 0 28 Feb 2017
Exploring compact reinforcement-learning representations with linear regression Thomas J. Walsh I. Szita Carlos Diuk Michael L. Littman OffRL 158 114 0 09 May 2012