Kernel-based methods for bandit convex optimization

Symposium on the Theory of Computing (STOC), 2016

11 July 2016

Papers citing "Kernel-based methods for bandit convex optimization"

50 / 89 papers shown

Title
Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability Yu Zhang Peng Zhao Masashi Sugiyama 311 1 0 12 Jun 2025
Non-stationary Bandit Convex Optimization: A Comprehensive Study Xiaoqi Liu Dorian Baudry Julian Zimmert Patrick Rebeschini Arya Akhavan 204 2 0 03 Jun 2025
Online Episodic Convex Reinforcement Learning B. Moreno Khaled Eldowa Pierre Gaillard Margaux Brégère Nadia Oudjane OffRL 301 0 0 12 May 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 830 1 0 06 Mar 2025
A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise Jingxin Zhan Yuchen Xin Kaicheng Jin Zhihua Zhang 271 0 0 19 Jan 2025
Online Newton Method for Bandit Convex Optimisation Hidde Fokkema Dirk van der Hoeven Tor Lattimore Jack J. Mayo 155 8 0 10 Jun 2024
Adaptive Regret for Bandits Made Possible: Two Queries SufficeInternational Conference on Learning Representations (ICLR), 2024 Zhou Lu Qiuyi Zhang Xinyi Chen Fred Zhang David P. Woodruff Elad Hazan 172 0 0 17 Jan 2024
Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments Tianchen Zhou Jia-Wei Liu Yang Jiao Chaosheng Dong Yetian Chen Yan Gao Yi Sun OffRL 159 4 0 08 Nov 2023
Bayesian Design Principles for Frequentist Sequential LearningInternational Conference on Machine Learning (ICML), 2023 Yunbei Xu A. Zeevi 430 16 0 01 Oct 2023
Anytime Model Selection in Linear BanditsNeural Information Processing Systems (NeurIPS), 2023 Parnian Kassraie N. Emmenegger Andreas Krause Aldo Pacchiano 289 7 0 24 Jul 2023
Fast Submodular Function Maximization Lianke Qin Zhao Song Yitan Wang 157 11 0 15 May 2023
A Certified Radius-Guided Attack Framework to Image Segmentation ModelsEuropean Symposium on Security and Privacy (Euro S&P), 2023 Wenjie Qu Youqi Li Binghui Wang AAML 128 5 0 05 Apr 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge BanditsAnnals of Statistics (Ann. Stat.), 2023 Nived Rajaraman Yanjun Han Jiantao Jiao Kannan Ramchandran 385 3 0 12 Feb 2023
A Second-Order Method for Stochastic Bandit Convex OptimisationAnnual Conference Computational Learning Theory (COLT), 2023 Tor Lattimore András Gyorgy 131 8 0 10 Feb 2023
Bandit Convex Optimisation Revisited: FTRL Achieves $\tilde{O}(t^{1/2})$ Regret David Young D. Leith Georgios Iosifidis 198 0 0 01 Feb 2023
Tight Guarantees for Interactive Decision Making with the Decision-Estimation CoefficientAnnual Conference Computational Learning Theory (COLT), 2023 Dylan J. Foster Noah Golowich Yanjun Han OffRL 202 29 0 19 Jan 2023
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via RegressionAnnual Conference Computational Learning Theory (COLT), 2022 Aleksandrs Slivkins Xingyu Zhou Karthik Abinav Sankararaman Dylan J. Foster 227 28 0 14 Nov 2022
Online Convex Optimization with Unbounded MemoryNeural Information Processing Systems (NeurIPS), 2022 Raunak Kumar Sarah Dean Robert D. Kleinberg 403 9 0 18 Oct 2022
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique Elissa Mhanna Mohamad Assaad 186 4 0 11 Oct 2022
On Adaptivity in Non-stationary Stochastic Optimization With Bandit Feedback Yining Wang 130 6 0 11 Oct 2022
An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low RegretAAAI Conference on Artificial Intelligence (AAAI), 2022 Matthew D. Jones Huy Le Nguyen Thy Nguyen FaML 236 9 0 23 Sep 2022
A Unifying Framework for Online Optimization with Long-Term ConstraintsNeural Information Processing Systems (NeurIPS), 2022 Matteo Castiglioni A. Celli A. Marchesi Giulia Romano N. Gatti 130 48 0 15 Sep 2022
Learning in Stackelberg Games with Non-myopic AgentsACM Conference on Economics and Computation (EC), 2022 Nika Haghtalab Thodoris Lykouris Sloan Nietert Alexander Wei 313 40 0 19 Aug 2022
A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization François Bachoc Tommaso Cesari Roberto Colomboni Andrea Paudice 176 2 0 13 Aug 2022
A Note on Zeroth-Order Optimization on the Simplex Tijana Zrnic Eric Mazumdar 161 0 0 02 Aug 2022
Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement LearningJournal of machine learning research (JMLR), 2022 Ilnura N. Usmanova Yarden As Maryam Kamgarpour Andreas Krause OffRL 192 13 0 21 Jul 2022
On the Complexity of Adversarial Decision MakingNeural Information Processing Systems (NeurIPS), 2022 Dylan J. Foster Alexander Rakhlin Ayush Sekhari Karthik Sridharan AAML 142 30 0 27 Jun 2022
Lazy Queries Can Reduce Variance in Zeroth-order OptimizationIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2022 Quan-Wu Xiao Qing Ling Tianyi Chen 160 3 0 14 Jun 2022
Building Robust Ensembles via Margin BoostingInternational Conference on Machine Learning (ICML), 2022 Dinghuai Zhang Hongyang R. Zhang Aaron Courville Yoshua Bengio Pradeep Ravikumar A. Suggala AAML UQCV 134 17 0 07 Jun 2022
A gradient estimator via L1-randomization for online zero-order optimization with two point feedbackNeural Information Processing Systems (NeurIPS), 2022 A. Akhavan Evgenii Chzhen Massimiliano Pontil Alexandre B. Tsybakov 328 27 0 27 May 2022
Adaptive Bandit Convex Optimization with Heterogeneous CurvatureAnnual Conference Computational Learning Theory (COLT), 2022 Haipeng Luo Mengxiao Zhang Penghui Zhao 182 5 0 12 Feb 2022
Doubly Optimal No-Regret Online Learning in Strongly Monotone Games with Bandit FeedbackOperational Research (OR), 2021 Wenjia Ba Tianyi Lin Jiawei Zhang Zhengyuan Zhou 242 17 0 06 Dec 2021
Bandit problems with fidelity rewards Gábor Lugosi Ciara Pike-Burke Pierre-André Savalle 113 0 0 25 Nov 2021
Uncoupled Bandit Learning towards Rationalizability: Benchmarks, Barriers, and Algorithms Jibang Wu Haifeng Xu Fan Yao 268 0 0 10 Nov 2021
Zeroth-order non-convex learning via hierarchical dual averaging Amélie Héliou Matthieu Martin P. Mertikopoulos Thibaud Rahier 138 11 0 13 Sep 2021
Optimal Gradient-based Algorithms for Non-concave Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021 Baihe Huang Kaixuan Huang Sham Kakade Jason D. Lee Qi Lei Runzhe Wang Jiaqi Yang 495 19 0 09 Jul 2021
Who Leads and Who Follows in Strategic Classification? Tijana Zrnic Eric Mazumdar S. Shankar Sastry Sai Li 167 63 0 23 Jun 2021
Minimax Regret for Bandit Convex Optimisation of Ridge Functions Tor Lattimore 134 3 0 01 Jun 2021
Distributed Zeroth-Order Stochastic Optimization in Time-varying Networks Wenjie Li Mohamad Assaad 154 3 0 26 May 2021
Optimal Stochastic Nonconvex Optimization with Bandit Feedback Puning Zhao Lifeng Lai 253 3 0 30 Mar 2021
No Weighted-Regret Learning in Adversarial Bandits with DelaysJournal of machine learning research (JMLR), 2021 Ilai Bistritz Zhengyuan Zhou Xi Chen Nicholas Bambos Jose H. Blanchet 252 11 0 08 Mar 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form GamesAAAI Conference on Artificial Intelligence (AAAI), 2021 Gabriele Farina Robin Schmucker Tuomas Sandholm 260 22 0 08 Mar 2021
Model-Free Online Learning in Unknown Sequential Decision Making Problems and GamesAAAI Conference on Artificial Intelligence (AAAI), 2021 Gabriele Farina Tuomas Sandholm OffRL 157 22 0 08 Mar 2021
Optimal Regret Algorithm for Pseudo-1d Bandit Convex OptimizationInternational Conference on Machine Learning (ICML), 2021 Aadirupa Saha Nagarajan Natarajan Praneeth Netrapalli Prateek Jain 118 5 0 15 Feb 2021
Online Markov Decision Processes with Aggregate Bandit FeedbackAnnual Conference Computational Learning Theory (COLT), 2021 Alon Cohen Haim Kaplan Tomer Koren Yishay Mansour OffRL 208 9 0 31 Jan 2021
Projection-Free Bandit Optimization with Privacy GuaranteesAAAI Conference on Artificial Intelligence (AAAI), 2020 Alina Ene Huy Le Nguyen Adrian Vladu 149 3 0 22 Dec 2020
Online non-convex optimization with imperfect feedbackNeural Information Processing Systems (NeurIPS), 2020 Amélie Héliou Matthieu Martin P. Mertikopoulos Thibaud Rahier 149 17 0 16 Oct 2020
Boosting One-Point Derivative-Free Online Optimization via Residual Feedback Yan Zhang Yi Zhou Kaiyi Ji Michael M. Zavlanos 231 12 0 14 Oct 2020
Regret minimization in stochastic non-convex learning via a proximal-gradient approach Nadav Hallak P. Mertikopoulos Volkan Cevher 146 24 0 13 Oct 2020
Mirror Descent and the Information RatioAnnual Conference Computational Learning Theory (COLT), 2020 Tor Lattimore András Gyorgy 210 44 0 25 Sep 2020