v1v2v3v4v5v6v7v8 (latest)

Mostly Exploration-Free Algorithms for Contextual Bandits

28 April 2017

Papers citing "Mostly Exploration-Free Algorithms for Contextual Bandits"

50 / 97 papers shown

Title
Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams Mohammed Almutairi 109 0 0 05 Jun 2025
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback Nan Lu Ethan X. Fang Junwei Lu 420 0 0 27 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System J. Gornet Yilin Mo Bruno Sinopoli 69 0 0 04 Apr 2025
Sparse Nonparametric Contextual Bandits Hamish Flynn Julia Olkhovskaya Paul Rognon-Vael 118 0 0 20 Mar 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 539 1 0 06 Mar 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts Zhuohua Li Maoli Liu Xiangxiang Dai John C. S. Lui 75 2 0 03 Jan 2025
Exploration and Persuasion Aleksandrs Slivkins 420 12 0 22 Oct 2024
Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Rowan Swiers Subash Prabanantham Andrew Maher 21 0 0 13 Sep 2024
Contextual Bandits for Unbounded Context Distributions Puning Zhao Xiaogang Xu Zhe Liu Huiwen Wu Qin Zhang Zong Ke Tianhang Zheng 301 10 0 19 Aug 2024
Jump Starting Bandits with LLM-Generated Prior Knowledge P. A. Alamdari Yanshuai Cao Kevin H. Wilson 73 2 0 27 Jun 2024
Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars Zhaoxuan Wu Xiaoqiang Lin Zhongxiang Dai Wenyang Hu Yao Shu See-Kiong Ng Patrick Jaillet Bryan Kian Hsiang Low 53 11 0 25 May 2024
Batched Nonparametric Contextual Bandits Rong Jiang Cong Ma OffRL 108 1 0 27 Feb 2024
Incentivized Exploration via Filtered Posterior Sampling Anand Kalvit Aleksandrs Slivkins Yonatan Gur 57 2 0 20 Feb 2024
Thompson Sampling in Partially Observable Contextual Bandits Hongju Park Mohamad Kazem Shirani Faradonbeh 67 3 0 15 Feb 2024
Efficient Contextual Bandits with Uninformed Feedback Graphs Mengxiao Zhang Yuheng Zhang Haipeng Luo Paul Mineiro 49 4 0 12 Feb 2024
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces Yaqi Duan Martin J. Wainwright OffRL 52 3 0 10 Jan 2024
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Yuko Kuroki Alberto Rumi Taira Tsuchiya Fabio Vitale Nicolò Cesa-Bianchi 122 7 0 24 Dec 2023
New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem Koji Ichikawa Shinji Ito Daisuke Hatano Hanna Sumita Takuro Fukunaga Naonori Kakimura Ken-ichi Kawarabayashi 23 0 0 19 Dec 2023
Semidiscrete optimal transport with unknown costs Yinchu Zhu I. Ryzhov OT 22 1 0 01 Oct 2023
Kernel $ε$ -Greedy for Multi-Armed Bandits with Covariates Sakshi Arya Bharath K. Sriperumbudur 137 0 0 29 Jun 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo Mohsen Bayati 58 1 0 26 Jun 2023
Strategic Apple Tasting Keegan Harris Chara Podimata Zhiwei Steven Wu 84 7 0 09 Jun 2023
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics Guy Tennenholtz Martin Mladenov Nadav Merlis Robert L. Axtell Craig Boutilier 53 0 0 24 May 2023
Bandit Social Learning: Exploration under Myopic Behavior Kiarash Banihashem Mohammadtaghi Hajiaghayi Suho Shin Aleksandrs Slivkins 430 4 0 15 Feb 2023
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications Johannes Kirschner Tor Lattimore Andreas Krause 93 8 0 07 Feb 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback Wonyoung Hedge Kim G. Iyengar A. Zeevi 34 3 0 31 Jan 2023
Incentive-Aware Recommender Systems in Two-Sided Markets Xiaowu Dai Wenlu Xu Yuan Qi Michael I. Jordan 50 6 0 23 Nov 2022
Transfer Learning for Contextual Multi-armed Bandits Changxiao Cai T. Tony Cai Hongzhe Li 127 19 0 22 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret Felix Schur Parnian Kassraie Jonas Rothfuss Andreas Krause 79 3 0 27 Oct 2022
Advertising Media and Target Audience Optimization via High-dimensional Bandits Wenjia Ba J. Harrison Harikesh S. Nair 57 0 0 17 Sep 2022
Risk-aware linear bandits with convex loss Patrick Saux Odalric-Ambrym Maillard 54 2 0 15 Sep 2022
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Wonyoung Hedge Kim Kyungbok Lee M. Paik 102 14 0 15 Sep 2022
$On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits$ On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$ -Geometry and High Dimensional Contextual Bandits Yuxuan Han Zhicong Liang Zhipeng Liang Yang Wang Yuan Yao Jiheng Zhang 66 1 0 16 Jun 2022
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits Wonyoung Hedge Kim M. Paik Min-whan Oh 56 6 0 11 Jun 2022
Meta Representation Learning with Contextual Linear Bandits Leonardo Cella Karim Lounici Massimiliano Pontil 116 5 0 30 May 2022
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection Peter Henderson Ben Chugg Brandon R. Anderson Kristen M. Altenburger Alex Turk J. Guyton Jacob Goldin Daniel E. Ho OffRL 50 10 0 25 Apr 2022
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations Hongju Park Mohamad Kazem Shirani Faradonbeh OffRL 59 2 0 10 Apr 2022
Truncated LinUCB for Stochastic Linear Bandits Yanglei Song Meng zhou 260 0 0 23 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits Leonardo Cella Karim Lounici Grégoire Pacreau Massimiliano Pontil 105 22 0 21 Feb 2022
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts Hongju Park Mohamad Kazem Shirani Faradonbeh 43 6 0 02 Feb 2022
Multitask Learning and Bandits via Robust Statistics Kan Xu Hamsa Bastani 92 6 0 28 Dec 2021
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Yash J. Patel Mohamad Kazem Shirani Faradonbeh 65 15 0 23 Oct 2021
Active Learning for Contextual Search with Binary Feedbacks Xi Chen Quanquan C. Liu Yining Wang 31 0 0 03 Oct 2021
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification James A. Grant David S. Leslie 82 3 0 29 Sep 2021
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment Eli Ben-Michael D. J. Greiner Kosuke Imai Zhichao Jiang OffRL 229 22 0 22 Sep 2021
Dynamic Selection in Algorithmic Decision-making Jin Li Ye Luo Xiaowei Zhang 89 2 0 28 Aug 2021
Model Selection for Generic Contextual Bandits Avishek Ghosh Abishek Sankararaman Kannan Ramchandran 76 6 0 07 Jul 2021
On component interactions in two-stage recommender systems Jiri Hron K. Krauth Michael I. Jordan Niki Kilbertus CML LRM 74 31 0 28 Jun 2021
Generalized Linear Bandits with Local Differential Privacy Yuxuan Han Zhipeng Liang Yang Wang Jiheng Zhang 87 32 0 07 Jun 2021
Fair Exploration via Axiomatic Bargaining Jackie Baek Vivek F. Farias FaML 83 29 0 04 Jun 2021