ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.08752
  4. Cited By
An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with
  Two-Point Feedback

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

Journal of machine learning research (JMLR), 2015
31 July 2015
Ohad Shamir
ArXiv (abs)PDFHTML

Papers citing "An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback"

50 / 114 papers shown
ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models
ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models
Lejs Deen Behric
Liang Zhang
Bingcong Li
K. K. Thekumparampil
168
0
0
04 Nov 2025
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
Shaocong Ma
Heng Huang
188
3
0
22 Oct 2025
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Xuchen Gong
Tian Li
183
0
0
17 Oct 2025
Multi-Objective $\textit{min-max}$ Online Convex Optimization
Multi-Objective min-max\textit{min-max}min-max Online Convex Optimization
Rahul Vaze
Sumiran Mishra
202
0
0
15 Oct 2025
Achieve Performatively Optimal Policy for Performative Reinforcement Learning
Achieve Performatively Optimal Policy for Performative Reinforcement Learning
Ziyi Chen
Heng Huang
134
0
0
06 Oct 2025
High-Probability Analysis of Online and Federated Zero-Order Optimisation
High-Probability Analysis of Online and Federated Zero-Order Optimisation
Arya Akhavan
David Janz
El-Mahdi El-Mhamdi
FedML
317
0
0
25 Sep 2025
SUA: Stealthy Multimodal Large Language Model Unlearning Attack
SUA: Stealthy Multimodal Large Language Model Unlearning Attack
Xianren Zhang
Hui Liu
Delvin Ce Zhang
Xianfeng Tang
Qi He
Dongwon Lee
Suhang Wang
MUAAML
317
2
0
10 Jun 2025
A Structured Tour of Optimization with Finite Differences
A Structured Tour of Optimization with Finite Differences
Marco Rando
C. Molinari
Lorenzo Rosasco
S. Villa
473
0
0
26 May 2025
Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Qitao Tan
Sung-En Chang
Rui Xia
Huidong Ji
Chence Yang
...
Zhou Zou
Yijiao Wang
Yanzhi Wang
Jin Lu
Geng Yuan
534
7
0
28 Apr 2025
Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Yequan Zhao
Xinling Yu
Xian Xiao
Zhe Chen
Ziyue Liu
G. Kurczveil
R. Beausoleil
Shixuan Liu
Zheng Zhang
411
1
0
17 Feb 2025
Solving Infinite-Player Games with Player-to-Strategy Networks
Solving Infinite-Player Games with Player-to-Strategy Networks
Carlos Martin
Tuomas Sandholm
233
0
0
17 Jan 2025
Accelerated zero-order SGD under high-order smoothness and
  overparameterized regime
Accelerated zero-order SGD under high-order smoothness and overparameterized regimeNelineinaya Dinamika (ND), 2024
Georgii Bychkov
D. Dvinskikh
Anastasia Antsiferova
Alexander Gasnikov
Aleksandr Lobanov
318
1
0
21 Nov 2024
Online Convex Optimization with Memory and Limited Predictions
Online Convex Optimization with Memory and Limited Predictions
Lintao Ye
Zhengmiao Wang
Zhi-Wei Liu
Ming Chi
Xiaoling Wang
Housheng Su
347
0
0
31 Oct 2024
Improved Sample Complexity for Private Nonsmooth Nonconvex Optimization
Improved Sample Complexity for Private Nonsmooth Nonconvex Optimization
Guy Kornowski
Daogao Liu
Kunal Talwar
328
3
0
08 Oct 2024
Risk-averse learning with delayed feedback
Risk-averse learning with delayed feedback
Siyi Wang
Zifan Wang
Karl H. Johansson
Sandra Hirche
318
0
0
25 Sep 2024
Distributed Online Bandit Nonconvex Optimization with One-Point Residual
  Feedback via Dynamic Regret
Distributed Online Bandit Nonconvex Optimization with One-Point Residual Feedback via Dynamic Regret
Youqing Hua
Shuai Liu
Yiguang Hong
Karl Henrik Johansson
Guangchen Wang
222
2
0
24 Sep 2024
Joint-perturbation simultaneous pseudo-gradient
Joint-perturbation simultaneous pseudo-gradientInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Carlos Martin
Tuomas Sandholm
370
2
0
17 Aug 2024
Private Zeroth-Order Nonsmooth Nonconvex Optimization
Private Zeroth-Order Nonsmooth Nonconvex Optimization
Qinzi Zhang
Hoang Tran
Ashok Cutkosky
313
7
0
27 Jun 2024
First-Order Methods for Linearly Constrained Bilevel Optimization
First-Order Methods for Linearly Constrained Bilevel Optimization
Guy Kornowski
Swati Padmanabhan
Kai Wang
Zhe Zhang
S. Sra
474
15
0
18 Jun 2024
AlphaZeroES: Direct score maximization outperforms planning loss
  minimization
AlphaZeroES: Direct score maximization outperforms planning loss minimization
Carlos Martin
Tuomas Sandholm
204
0
0
12 Jun 2024
Online Optimization Perspective on First-Order and Zero-Order
  Decentralized Nonsmooth Nonconvex Stochastic Optimization
Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization
Emre Sahinoglu
Shahin Shahrampour
369
10
0
03 Jun 2024
Mollification Effects of Policy Gradient Methods
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
318
2
0
28 May 2024
A New Formulation for Zeroth-Order Optimization of Adversarial EXEmples in Malware Detection
A New Formulation for Zeroth-Order Optimization of Adversarial EXEmples in Malware Detection
Marco Rando
Christian Scano
Lorenzo Rosasco
Fabio Roli
AAML
351
3
0
23 May 2024
Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
Dynamic Anisotropic Smoothing for Noisy Derivative-Free OptimizationInternational Conference on Machine Learning (ICML), 2024
S. Reifenstein
T. Leleu
Yoshihisa Yamamoto
300
3
0
02 May 2024
Test-Time Model Adaptation with Only Forward Passes
Test-Time Model Adaptation with Only Forward PassesInternational Conference on Machine Learning (ICML), 2024
Shuaicheng Niu
Chunyan Miao
Guohao Chen
Pengcheng Wu
Peilin Zhao
TTA
532
76
0
02 Apr 2024
Unified Projection-Free Algorithms for Adversarial DR-Submodular
  Optimization
Unified Projection-Free Algorithms for Adversarial DR-Submodular OptimizationInternational Conference on Learning Representations (ICLR), 2024
M. Pedramfar
Yididiya Y. Nadew
Christopher J. Quinn
Vaneet Aggarwal
255
4
0
15 Mar 2024
Improved Regret for Bandit Convex Optimization with Delayed Feedback
Improved Regret for Bandit Convex Optimization with Delayed Feedback
Yuanyu Wan
Chang Yao
Weilong Dai
Lijun Zhang
391
9
0
14 Feb 2024
Federated Learning Can Find Friends That Are Advantageous
Federated Learning Can Find Friends That Are Advantageous
N. Tupitsa
Samuel Horváth
Martin Takávc
Eduard A. Gorbunov
FedML
528
2
0
07 Feb 2024
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Yijiang Pang
Jiayu Zhou
490
2
0
02 Feb 2024
ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and
  Uncertainty in Zeroth-order Optimization
ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order Optimization
Shuoran Jiang
Qingcai Chen
Youcheng Pan
Yang Xiang
Yukang Lin
Xiangping Wu
Chuanyi Liu
Xiaobao Song
ODL
238
24
0
23 Dec 2023
Federated Online and Bandit Convex Optimization
Federated Online and Bandit Convex OptimizationInternational Conference on Machine Learning (ICML), 2023
Kumar Kshitij Patel
Lingxiao Wang
Aadirupa Saha
Nathan Srebro
FedML
334
11
0
29 Nov 2023
Payoff-based learning with matrix multiplicative weights in quantum
  games
Payoff-based learning with matrix multiplicative weights in quantum gamesNeural Information Processing Systems (NeurIPS), 2023
Kyriakos Lotidis
P. Mertikopoulos
Nicholas Bambos
Jose Blanchet
210
2
0
04 Nov 2023
Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex
  Optimization
Decentralized Gradient-Free Methods for Stochastic Non-Smooth Non-Convex OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhenwei Lin
Jingfan Xia
Qi Deng
Luo Luo
236
10
0
18 Oct 2023
Multi-point Feedback of Bandit Convex Optimization with Hard Constraints
Multi-point Feedback of Bandit Convex Optimization with Hard Constraints
Yasunari Hikima
340
0
0
17 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
527
24
0
14 Oct 2023
Tensor-Compressed Back-Propagation-Free Training for (Physics-Informed)
  Neural Networks
Tensor-Compressed Back-Propagation-Free Training for (Physics-Informed) Neural Networks
Yequan Zhao
Xinling Yu
Zhixiong Chen
Ziyue Liu
Sijia Liu
Zheng Zhang
PINN
239
14
0
18 Aug 2023
AI planning in the imagination: High-level planning on learned abstract
  search spaces
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
Tuomas Sandholm
241
0
0
16 Aug 2023
An Algorithm with Optimal Dimension-Dependence for Zero-Order Nonsmooth
  Nonconvex Stochastic Optimization
An Algorithm with Optimal Dimension-Dependence for Zero-Order Nonsmooth Nonconvex Stochastic OptimizationJournal of machine learning research (JMLR), 2023
Guy Kornowski
Ohad Shamir
414
29
0
10 Jul 2023
Gradient is All You Need? How Consensus-Based Optimization can be Interpreted as a Stochastic Relaxation of Gradient Descent
Gradient is All You Need? How Consensus-Based Optimization can be Interpreted as a Stochastic Relaxation of Gradient Descent
Konstantin Riedl
T. Klock
Carina Geldhauser
M. Fornasier
309
10
0
16 Jun 2023
Gradient-free optimization of highly smooth functions: improved analysis
  and a new algorithm
Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm
A. Akhavan
Evgenii Chzhen
Massimiliano Pontil
Alexandre B. Tsybakov
276
20
0
03 Jun 2023
Fine-Tuning Language Models with Just Forward Passes
Fine-Tuning Language Models with Just Forward PassesNeural Information Processing Systems (NeurIPS), 2023
Sadhika Malladi
Tianyu Gao
Eshaan Nichani
Alexandru Damian
Jason D. Lee
Danqi Chen
Sanjeev Arora
698
370
0
27 May 2023
A Unified Approach for Maximizing Continuous DR-submodular Functions
A Unified Approach for Maximizing Continuous DR-submodular FunctionsNeural Information Processing Systems (NeurIPS), 2023
M. Pedramfar
Christopher J. Quinn
Vaneet Aggarwal
384
10
0
26 May 2023
Implicitly normalized forecaster with clipping for linear and non-linear
  heavy-tailed multi-armed bandits
Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed banditsComputational Management Science (CMS), 2023
Yuriy Dorn
Kornilov Nikita
N. Kutuzov
A. Nazin
Eduard A. Gorbunov
Alexander Gasnikov
374
5
0
11 May 2023
Performative Prediction with Bandit Feedback: Learning through
  Reparameterization
Performative Prediction with Bandit Feedback: Learning through ReparameterizationInternational Conference on Machine Learning (ICML), 2023
Yatong Chen
Wei Tang
Chien-Ju Ho
Yang Liu
538
12
0
01 May 2023
PyXAB -- A Python Library for $\mathcal{X}$-Armed Bandit and Online
  Blackbox Optimization Algorithms
PyXAB -- A Python Library for X\mathcal{X}X-Armed Bandit and Online Blackbox Optimization Algorithms
Wenjie Li
Haoze Li
Jean Honorio
Qifan Song
GP
244
6
0
07 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy
  Gradient
Revisiting LQR Control from the Perspective of Receding-Horizon Policy GradientIEEE Control Systems Letters (L-CSS), 2023
Xiangyuan Zhang
Tamer Basar
387
25
0
25 Feb 2023
Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback
Y. Kim
Dabeen Lee
397
5
0
26 Jan 2023
ApproxED: Approximate exploitability descent via learned best responses
ApproxED: Approximate exploitability descent via learned best responsesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Carlos Martin
Tuomas Sandholm
495
2
0
20 Jan 2023
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic
  Optimization
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic OptimizationInternational Conference on Machine Learning (ICML), 2023
Le‐Yu Chen
Jing Xu
Luo Luo
313
26
0
16 Jan 2023
Finding mixed-strategy equilibria of continuous-action games without
  gradients using randomized policy networks
Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networksInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Carlos Martin
Tuomas Sandholm
243
11
0
29 Nov 2022
123
Next
Page 1 of 3