Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1607.03084
Cited By
Kernel-based methods for bandit convex optimization
Symposium on the Theory of Computing (STOC), 2016
11 July 2016
Sébastien Bubeck
Ronen Eldan
Y. Lee
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Kernel-based methods for bandit convex optimization"
50 / 89 papers shown
Title
Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability
Yu Zhang
Peng Zhao
Masashi Sugiyama
311
1
0
12 Jun 2025
Non-stationary Bandit Convex Optimization: A Comprehensive Study
Xiaoqi Liu
Dorian Baudry
Julian Zimmert
Patrick Rebeschini
Arya Akhavan
204
2
0
03 Jun 2025
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
301
0
0
12 May 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
830
1
0
06 Mar 2025
A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise
Jingxin Zhan
Yuchen Xin
Kaicheng Jin
Zhihua Zhang
271
0
0
19 Jan 2025
Online Newton Method for Bandit Convex Optimisation
Hidde Fokkema
Dirk van der Hoeven
Tor Lattimore
Jack J. Mayo
155
8
0
10 Jun 2024
Adaptive Regret for Bandits Made Possible: Two Queries Suffice
International Conference on Learning Representations (ICLR), 2024
Zhou Lu
Qiuyi Zhang
Xinyi Chen
Fred Zhang
David P. Woodruff
Elad Hazan
172
0
0
17 Jan 2024
Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments
Tianchen Zhou
Jia-Wei Liu
Yang Jiao
Chaosheng Dong
Yetian Chen
Yan Gao
Yi Sun
OffRL
159
4
0
08 Nov 2023
Bayesian Design Principles for Frequentist Sequential Learning
International Conference on Machine Learning (ICML), 2023
Yunbei Xu
A. Zeevi
430
16
0
01 Oct 2023
Anytime Model Selection in Linear Bandits
Neural Information Processing Systems (NeurIPS), 2023
Parnian Kassraie
N. Emmenegger
Andreas Krause
Aldo Pacchiano
289
7
0
24 Jul 2023
Fast Submodular Function Maximization
Lianke Qin
Zhao Song
Yitan Wang
157
11
0
15 May 2023
A Certified Radius-Guided Attack Framework to Image Segmentation Models
European Symposium on Security and Privacy (Euro S&P), 2023
Wenjie Qu
Youqi Li
Binghui Wang
AAML
128
5
0
05 Apr 2023
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits
Annals of Statistics (Ann. Stat.), 2023
Nived Rajaraman
Yanjun Han
Jiantao Jiao
Kannan Ramchandran
385
3
0
12 Feb 2023
A Second-Order Method for Stochastic Bandit Convex Optimisation
Annual Conference Computational Learning Theory (COLT), 2023
Tor Lattimore
András Gyorgy
131
8
0
10 Feb 2023
Bandit Convex Optimisation Revisited: FTRL Achieves
O
~
(
t
1
/
2
)
\tilde{O}(t^{1/2})
O
~
(
t
1/2
)
Regret
David Young
D. Leith
Georgios Iosifidis
198
0
0
01 Feb 2023
Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient
Annual Conference Computational Learning Theory (COLT), 2023
Dylan J. Foster
Noah Golowich
Yanjun Han
OffRL
202
29
0
19 Jan 2023
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
Annual Conference Computational Learning Theory (COLT), 2022
Aleksandrs Slivkins
Xingyu Zhou
Karthik Abinav Sankararaman
Dylan J. Foster
227
28
0
14 Nov 2022
Online Convex Optimization with Unbounded Memory
Neural Information Processing Systems (NeurIPS), 2022
Raunak Kumar
Sarah Dean
Robert D. Kleinberg
403
9
0
18 Oct 2022
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
186
4
0
11 Oct 2022
On Adaptivity in Non-stationary Stochastic Optimization With Bandit Feedback
Yining Wang
130
6
0
11 Oct 2022
An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret
AAAI Conference on Artificial Intelligence (AAAI), 2022
Matthew D. Jones
Huy Le Nguyen
Thy Nguyen
FaML
236
9
0
23 Sep 2022
A Unifying Framework for Online Optimization with Long-Term Constraints
Neural Information Processing Systems (NeurIPS), 2022
Matteo Castiglioni
A. Celli
A. Marchesi
Giulia Romano
N. Gatti
130
48
0
15 Sep 2022
Learning in Stackelberg Games with Non-myopic Agents
ACM Conference on Economics and Computation (EC), 2022
Nika Haghtalab
Thodoris Lykouris
Sloan Nietert
Alexander Wei
313
40
0
19 Aug 2022
A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization
François Bachoc
Tommaso Cesari
Roberto Colomboni
Andrea Paudice
176
2
0
13 Aug 2022
A Note on Zeroth-Order Optimization on the Simplex
Tijana Zrnic
Eric Mazumdar
161
0
0
02 Aug 2022
Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning
Journal of machine learning research (JMLR), 2022
Ilnura N. Usmanova
Yarden As
Maryam Kamgarpour
Andreas Krause
OffRL
192
13
0
21 Jul 2022
On the Complexity of Adversarial Decision Making
Neural Information Processing Systems (NeurIPS), 2022
Dylan J. Foster
Alexander Rakhlin
Ayush Sekhari
Karthik Sridharan
AAML
142
30
0
27 Jun 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
IEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2022
Quan-Wu Xiao
Qing Ling
Tianyi Chen
160
3
0
14 Jun 2022
Building Robust Ensembles via Margin Boosting
International Conference on Machine Learning (ICML), 2022
Dinghuai Zhang
Hongyang R. Zhang
Aaron Courville
Yoshua Bengio
Pradeep Ravikumar
A. Suggala
AAML
UQCV
134
17
0
07 Jun 2022
A gradient estimator via L1-randomization for online zero-order optimization with two point feedback
Neural Information Processing Systems (NeurIPS), 2022
A. Akhavan
Evgenii Chzhen
Massimiliano Pontil
Alexandre B. Tsybakov
328
27
0
27 May 2022
Adaptive Bandit Convex Optimization with Heterogeneous Curvature
Annual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Penghui Zhao
182
5
0
12 Feb 2022
Doubly Optimal No-Regret Online Learning in Strongly Monotone Games with Bandit Feedback
Operational Research (OR), 2021
Wenjia Ba
Tianyi Lin
Jiawei Zhang
Zhengyuan Zhou
242
17
0
06 Dec 2021
Bandit problems with fidelity rewards
Gábor Lugosi
Ciara Pike-Burke
Pierre-André Savalle
113
0
0
25 Nov 2021
Uncoupled Bandit Learning towards Rationalizability: Benchmarks, Barriers, and Algorithms
Jibang Wu
Haifeng Xu
Fan Yao
268
0
0
10 Nov 2021
Zeroth-order non-convex learning via hierarchical dual averaging
Amélie Héliou
Matthieu Martin
P. Mertikopoulos
Thibaud Rahier
138
11
0
13 Sep 2021
Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Neural Information Processing Systems (NeurIPS), 2021
Baihe Huang
Kaixuan Huang
Sham Kakade
Jason D. Lee
Qi Lei
Runzhe Wang
Jiaqi Yang
495
19
0
09 Jul 2021
Who Leads and Who Follows in Strategic Classification?
Tijana Zrnic
Eric Mazumdar
S. Shankar Sastry
Sai Li
167
63
0
23 Jun 2021
Minimax Regret for Bandit Convex Optimisation of Ridge Functions
Tor Lattimore
134
3
0
01 Jun 2021
Distributed Zeroth-Order Stochastic Optimization in Time-varying Networks
Wenjie Li
Mohamad Assaad
154
3
0
26 May 2021
Optimal Stochastic Nonconvex Optimization with Bandit Feedback
Puning Zhao
Lifeng Lai
253
3
0
30 Mar 2021
No Weighted-Regret Learning in Adversarial Bandits with Delays
Journal of machine learning research (JMLR), 2021
Ilai Bistritz
Zhengyuan Zhou
Xi Chen
Nicholas Bambos
Jose H. Blanchet
252
11
0
08 Mar 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games
AAAI Conference on Artificial Intelligence (AAAI), 2021
Gabriele Farina
Robin Schmucker
Tuomas Sandholm
260
22
0
08 Mar 2021
Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games
AAAI Conference on Artificial Intelligence (AAAI), 2021
Gabriele Farina
Tuomas Sandholm
OffRL
157
22
0
08 Mar 2021
Optimal Regret Algorithm for Pseudo-1d Bandit Convex Optimization
International Conference on Machine Learning (ICML), 2021
Aadirupa Saha
Nagarajan Natarajan
Praneeth Netrapalli
Prateek Jain
118
5
0
15 Feb 2021
Online Markov Decision Processes with Aggregate Bandit Feedback
Annual Conference Computational Learning Theory (COLT), 2021
Alon Cohen
Haim Kaplan
Tomer Koren
Yishay Mansour
OffRL
208
9
0
31 Jan 2021
Projection-Free Bandit Optimization with Privacy Guarantees
AAAI Conference on Artificial Intelligence (AAAI), 2020
Alina Ene
Huy Le Nguyen
Adrian Vladu
149
3
0
22 Dec 2020
Online non-convex optimization with imperfect feedback
Neural Information Processing Systems (NeurIPS), 2020
Amélie Héliou
Matthieu Martin
P. Mertikopoulos
Thibaud Rahier
149
17
0
16 Oct 2020
Boosting One-Point Derivative-Free Online Optimization via Residual Feedback
Yan Zhang
Yi Zhou
Kaiyi Ji
Michael M. Zavlanos
231
12
0
14 Oct 2020
Regret minimization in stochastic non-convex learning via a proximal-gradient approach
Nadav Hallak
P. Mertikopoulos
Volkan Cevher
146
24
0
13 Oct 2020
Mirror Descent and the Information Ratio
Annual Conference Computational Learning Theory (COLT), 2020
Tor Lattimore
András Gyorgy
210
44
0
25 Sep 2020
1
2
Next