Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1507.08752
Cited By
An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback
Journal of machine learning research (JMLR), 2015
31 July 2015
Ohad Shamir
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback"
50 / 114 papers shown
ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models
Lejs Deen Behric
Liang Zhang
Bingcong Li
K. K. Thekumparampil
168
0
0
04 Nov 2025
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
Shaocong Ma
Heng Huang
188
3
0
22 Oct 2025
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Xuchen Gong
Tian Li
183
0
0
17 Oct 2025
Multi-Objective
min-max
\textit{min-max}
min-max
Online Convex Optimization
Rahul Vaze
Sumiran Mishra
202
0
0
15 Oct 2025
Achieve Performatively Optimal Policy for Performative Reinforcement Learning
Ziyi Chen
Heng Huang
134
0
0
06 Oct 2025
High-Probability Analysis of Online and Federated Zero-Order Optimisation
Arya Akhavan
David Janz
El-Mahdi El-Mhamdi
FedML
317
0
0
25 Sep 2025
SUA: Stealthy Multimodal Large Language Model Unlearning Attack
Xianren Zhang
Hui Liu
Delvin Ce Zhang
Xianfeng Tang
Qi He
Dongwon Lee
Suhang Wang
MU
AAML
317
2
0
10 Jun 2025
A Structured Tour of Optimization with Finite Differences
Marco Rando
C. Molinari
Lorenzo Rosasco
S. Villa
473
0
0
26 May 2025
Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Qitao Tan
Sung-En Chang
Rui Xia
Huidong Ji
Chence Yang
...
Zhou Zou
Yijiao Wang
Yanzhi Wang
Jin Lu
Geng Yuan
534
7
0
28 Apr 2025
Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Yequan Zhao
Xinling Yu
Xian Xiao
Zhe Chen
Ziyue Liu
G. Kurczveil
R. Beausoleil
Shixuan Liu
Zheng Zhang
411
1
0
17 Feb 2025
Solving Infinite-Player Games with Player-to-Strategy Networks
Carlos Martin
Tuomas Sandholm
233
0
0
17 Jan 2025
Accelerated zero-order SGD under high-order smoothness and overparameterized regime
Nelineinaya Dinamika (ND), 2024
Georgii Bychkov
D. Dvinskikh
Anastasia Antsiferova
Alexander Gasnikov
Aleksandr Lobanov
318
1
0
21 Nov 2024
Online Convex Optimization with Memory and Limited Predictions
Lintao Ye
Zhengmiao Wang
Zhi-Wei Liu
Ming Chi
Xiaoling Wang
Housheng Su
347
0
0
31 Oct 2024
Improved Sample Complexity for Private Nonsmooth Nonconvex Optimization
Guy Kornowski
Daogao Liu
Kunal Talwar
328
3
0
08 Oct 2024
Risk-averse learning with delayed feedback
Siyi Wang
Zifan Wang
Karl H. Johansson
Sandra Hirche
318
0
0
25 Sep 2024
Distributed Online Bandit Nonconvex Optimization with One-Point Residual Feedback via Dynamic Regret
Youqing Hua
Shuai Liu
Yiguang Hong
Karl Henrik Johansson
Guangchen Wang
222
2
0
24 Sep 2024
Joint-perturbation simultaneous pseudo-gradient
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Carlos Martin
Tuomas Sandholm
370
2
0
17 Aug 2024
Private Zeroth-Order Nonsmooth Nonconvex Optimization
Qinzi Zhang
Hoang Tran
Ashok Cutkosky
313
7
0
27 Jun 2024
First-Order Methods for Linearly Constrained Bilevel Optimization
Guy Kornowski
Swati Padmanabhan
Kai Wang
Zhe Zhang
S. Sra
474
15
0
18 Jun 2024
AlphaZeroES: Direct score maximization outperforms planning loss minimization
Carlos Martin
Tuomas Sandholm
204
0
0
12 Jun 2024
Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization
Emre Sahinoglu
Shahin Shahrampour
369
10
0
03 Jun 2024
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
318
2
0
28 May 2024
A New Formulation for Zeroth-Order Optimization of Adversarial EXEmples in Malware Detection
Marco Rando
Christian Scano
Lorenzo Rosasco
Fabio Roli
AAML
351
3
0
23 May 2024
Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
International Conference on Machine Learning (ICML), 2024
S. Reifenstein
T. Leleu
Yoshihisa Yamamoto
300
3
0
02 May 2024
Test-Time Model Adaptation with Only Forward Passes
International Conference on Machine Learning (ICML), 2024
Shuaicheng Niu
Chunyan Miao
Guohao Chen
Pengcheng Wu
Peilin Zhao
TTA
532
76
0
02 Apr 2024
Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization
International Conference on Learning Representations (ICLR), 2024
M. Pedramfar
Yididiya Y. Nadew
Christopher J. Quinn
Vaneet Aggarwal
255
4
0
15 Mar 2024
Improved Regret for Bandit Convex Optimization with Delayed Feedback
Yuanyu Wan
Chang Yao
Weilong Dai
Lijun Zhang
391
9
0
14 Feb 2024
Federated Learning Can Find Friends That Are Advantageous
N. Tupitsa
Samuel Horváth
Martin Takávc
Eduard A. Gorbunov
FedML
528
2
0
07 Feb 2024
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Yijiang Pang
Jiayu Zhou
490
2
0
02 Feb 2024
ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order Optimization
Shuoran Jiang
Qingcai Chen
Youcheng Pan
Yang Xiang
Yukang Lin
Xiangping Wu
Chuanyi Liu
Xiaobao Song
ODL
238
24
0
23 Dec 2023
Federated Online and Bandit Convex Optimization
International Conference on Machine Learning (ICML), 2023
Kumar Kshitij Patel
Lingxiao Wang
Aadirupa Saha
Nathan Srebro
FedML
334
11
0
29 Nov 2023
Payoff-based learning with matrix multiplicative weights in quantum games
Neural Information Processing Systems (NeurIPS), 2023
Kyriakos Lotidis
P. Mertikopoulos
Nicholas Bambos
Jose Blanchet
210
2
0
04 Nov 2023
Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex Optimization
AAAI Conference on Artificial Intelligence (AAAI), 2023
Zhenwei Lin
Jingfan Xia
Qi Deng
Luo Luo
236
10
0
18 Oct 2023
Multi-point Feedback of Bandit Convex Optimization with Hard Constraints
Yasunari Hikima
340
0
0
17 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
527
24
0
14 Oct 2023
Tensor-Compressed Back-Propagation-Free Training for (Physics-Informed) Neural Networks
Yequan Zhao
Xinling Yu
Zhixiong Chen
Ziyue Liu
Sijia Liu
Zheng Zhang
PINN
239
14
0
18 Aug 2023
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
Tuomas Sandholm
241
0
0
16 Aug 2023
An Algorithm with Optimal Dimension-Dependence for Zero-Order Nonsmooth Nonconvex Stochastic Optimization
Journal of machine learning research (JMLR), 2023
Guy Kornowski
Ohad Shamir
414
29
0
10 Jul 2023
Gradient is All You Need? How Consensus-Based Optimization can be Interpreted as a Stochastic Relaxation of Gradient Descent
Konstantin Riedl
T. Klock
Carina Geldhauser
M. Fornasier
309
10
0
16 Jun 2023
Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm
A. Akhavan
Evgenii Chzhen
Massimiliano Pontil
Alexandre B. Tsybakov
276
20
0
03 Jun 2023
Fine-Tuning Language Models with Just Forward Passes
Neural Information Processing Systems (NeurIPS), 2023
Sadhika Malladi
Tianyu Gao
Eshaan Nichani
Alexandru Damian
Jason D. Lee
Danqi Chen
Sanjeev Arora
698
370
0
27 May 2023
A Unified Approach for Maximizing Continuous DR-submodular Functions
Neural Information Processing Systems (NeurIPS), 2023
M. Pedramfar
Christopher J. Quinn
Vaneet Aggarwal
384
10
0
26 May 2023
Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits
Computational Management Science (CMS), 2023
Yuriy Dorn
Kornilov Nikita
N. Kutuzov
A. Nazin
Eduard A. Gorbunov
Alexander Gasnikov
374
5
0
11 May 2023
Performative Prediction with Bandit Feedback: Learning through Reparameterization
International Conference on Machine Learning (ICML), 2023
Yatong Chen
Wei Tang
Chien-Ju Ho
Yang Liu
538
12
0
01 May 2023
PyXAB -- A Python Library for
X
\mathcal{X}
X
-Armed Bandit and Online Blackbox Optimization Algorithms
Wenjie Li
Haoze Li
Jean Honorio
Qifan Song
GP
244
6
0
07 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
IEEE Control Systems Letters (L-CSS), 2023
Xiangyuan Zhang
Tamer Basar
387
25
0
25 Feb 2023
Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback
Y. Kim
Dabeen Lee
397
5
0
26 Jan 2023
ApproxED: Approximate exploitability descent via learned best responses
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Carlos Martin
Tuomas Sandholm
495
2
0
20 Jan 2023
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization
International Conference on Machine Learning (ICML), 2023
Le‐Yu Chen
Jing Xu
Luo Luo
313
26
0
16 Jan 2023
Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networks
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Carlos Martin
Tuomas Sandholm
243
11
0
29 Nov 2022
1
2
3
Next
Page 1 of 3