An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

Journal of machine learning research (JMLR), 2015

31 July 2015

Ohad Shamir

ArXiv (abs)PDF HTML

Papers citing "An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback"

50 / 114 papers shown

ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models

168

04 Nov 2025

On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

Shaocong Ma

Heng Huang

188

22 Oct 2025

Zeroth-Order Sharpness-Aware Learning with Exponential Tilting

Xuchen Gong

Tian Li

183

17 Oct 2025

$Multi-Objective $\textit{min-max}$ Online Convex Optimization$

Multi-Objective

\textit{min-max}

Online Convex Optimization

Rahul Vaze

Sumiran Mishra

202

15 Oct 2025

Achieve Performatively Optimal Policy for Performative Reinforcement Learning

Ziyi Chen

Heng Huang

134

06 Oct 2025

High-Probability Analysis of Online and Federated Zero-Order Optimisation

317

25 Sep 2025

SUA: Stealthy Multimodal Large Language Model Unlearning Attack

317

10 Jun 2025

A Structured Tour of Optimization with Finite Differences

473

26 May 2025

Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training

...

534

28 Apr 2025

Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks

411

17 Feb 2025

Solving Infinite-Player Games with Player-to-Strategy Networks

Carlos Martin

Tuomas Sandholm

233

17 Jan 2025

Accelerated zero-order SGD under high-order smoothness and overparameterized regimeNelineinaya Dinamika (ND), 2024

Georgii Bychkov

D. Dvinskikh

Anastasia Antsiferova

Alexander Gasnikov

Aleksandr Lobanov

318

21 Nov 2024

Online Convex Optimization with Memory and Limited Predictions

347

31 Oct 2024

Improved Sample Complexity for Private Nonsmooth Nonconvex Optimization

Guy Kornowski

Daogao Liu

Kunal Talwar

328

08 Oct 2024

Risk-averse learning with delayed feedback

318

25 Sep 2024

Distributed Online Bandit Nonconvex Optimization with One-Point Residual Feedback via Dynamic Regret

Youqing Hua

Shuai Liu

Yiguang Hong

Karl Henrik Johansson

Guangchen Wang

222

24 Sep 2024

Joint-perturbation simultaneous pseudo-gradientInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

Carlos Martin

Tuomas Sandholm

370

17 Aug 2024

Private Zeroth-Order Nonsmooth Nonconvex Optimization

Qinzi Zhang

Hoang Tran

Ashok Cutkosky

313

27 Jun 2024

First-Order Methods for Linearly Constrained Bilevel Optimization

474

18 Jun 2024

AlphaZeroES: Direct score maximization outperforms planning loss minimization

Carlos Martin

Tuomas Sandholm

204

12 Jun 2024

Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

Emre Sahinoglu

Shahin Shahrampour

369

03 Jun 2024

Mollification Effects of Policy Gradient Methods

Tao Wang

Sylvia Herbert

Sicun Gao

318

28 May 2024

A New Formulation for Zeroth-Order Optimization of Adversarial EXEmples in Malware Detection

Fabio Roli

351

23 May 2024

Dynamic Anisotropic Smoothing for Noisy Derivative-Free OptimizationInternational Conference on Machine Learning (ICML), 2024

S. Reifenstein

T. Leleu

Yoshihisa Yamamoto

300

02 May 2024

Test-Time Model Adaptation with Only Forward PassesInternational Conference on Machine Learning (ICML), 2024

532

02 Apr 2024

Unified Projection-Free Algorithms for Adversarial DR-Submodular OptimizationInternational Conference on Learning Representations (ICLR), 2024

255

15 Mar 2024

Improved Regret for Bandit Convex Optimization with Delayed Feedback

391

14 Feb 2024

Federated Learning Can Find Friends That Are Advantageous

528

07 Feb 2024

Stochastic Two Points Method for Deep Model Zeroth-order Optimization

Yijiang Pang

Jiayu Zhou

490

02 Feb 2024

ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order Optimization

Shuoran Jiang

238

23 Dec 2023

Federated Online and Bandit Convex OptimizationInternational Conference on Machine Learning (ICML), 2023

334

29 Nov 2023

Payoff-based learning with matrix multiplicative weights in quantum gamesNeural Information Processing Systems (NeurIPS), 2023

210

04 Nov 2023

Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2023

236

18 Oct 2023

Multi-point Feedback of Bandit Convex Optimization with Hard Constraints

Yasunari Hikima

340

17 Oct 2023

DPZero: Private Fine-Tuning of Language Models without Backpropagation

527

14 Oct 2023

Tensor-Compressed Back-Propagation-Free Training for (Physics-Informed) Neural Networks

239

18 Aug 2023

AI planning in the imagination: High-level planning on learned abstract search spaces

Carlos Martin

Tuomas Sandholm

241

16 Aug 2023

An Algorithm with Optimal Dimension-Dependence for Zero-Order Nonsmooth Nonconvex Stochastic OptimizationJournal of machine learning research (JMLR), 2023

Guy Kornowski

Ohad Shamir

414

10 Jul 2023

Gradient is All You Need? How Consensus-Based Optimization can be Interpreted as a Stochastic Relaxation of Gradient Descent

309

16 Jun 2023

Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm

A. Akhavan

Evgenii Chzhen

Massimiliano Pontil

Alexandre B. Tsybakov

276

03 Jun 2023

Fine-Tuning Language Models with Just Forward PassesNeural Information Processing Systems (NeurIPS), 2023

698

370

27 May 2023

A Unified Approach for Maximizing Continuous DR-submodular FunctionsNeural Information Processing Systems (NeurIPS), 2023

M. Pedramfar

Christopher J. Quinn

Vaneet Aggarwal

384

26 May 2023

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed banditsComputational Management Science (CMS), 2023

Alexander Gasnikov

374

11 May 2023

Performative Prediction with Bandit Feedback: Learning through ReparameterizationInternational Conference on Machine Learning (ICML), 2023

Yatong Chen

Wei Tang

Chien-Ju Ho

Yang Liu

538

01 May 2023

$PyXAB -- A Python Library for $\mathcal{X}$-Armed Bandit and Online Blackbox Optimization Algorithms$

PyXAB -- A Python Library for

\mathcal{X}

-Armed Bandit and Online Blackbox Optimization Algorithms

244

07 Mar 2023

Revisiting LQR Control from the Perspective of Receding-Horizon Policy GradientIEEE Control Systems Letters (L-CSS), 2023

Xiangyuan Zhang

Tamer Basar

387

25 Feb 2023

Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback

Y. Kim

Dabeen Lee

397

26 Jan 2023

ApproxED: Approximate exploitability descent via learned best responsesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023

Carlos Martin

Tuomas Sandholm

495

20 Jan 2023

Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic OptimizationInternational Conference on Machine Learning (ICML), 2023

Le‐Yu Chen

Jing Xu

Luo Luo

313

16 Jan 2023

Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networksInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Carlos Martin

Tuomas Sandholm

243

29 Nov 2022