ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.17333
  4. Cited By
Fine-Tuning Language Models with Just Forward Passes
v1v2v3 (latest)

Fine-Tuning Language Models with Just Forward Passes

Neural Information Processing Systems (NeurIPS), 2023
27 May 2023
Sadhika Malladi
Tianyu Gao
Eshaan Nichani
Alexandru Damian
Jason D. Lee
Danqi Chen
Sanjeev Arora
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)

Papers citing "Fine-Tuning Language Models with Just Forward Passes"

50 / 188 papers shown
ZO-ASR: Zeroth-Order Fine-Tuning of Speech Foundation Models without Back-Propagation
Yuezhang Peng
Yu‐Xin Liu
Yao Li
S. Wang
Fei Wen
Xie Chen
111
0
0
01 Dec 2025
Dialect Identification Using Resource-Efficient Fine-Tuning Approaches
Dialect Identification Using Resource-Efficient Fine-Tuning ApproachesAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2025
Zirui Lin
Haris Gulzar
Monnika Roslianna Busto
Akiko Masaki
Takeharu Eda
K. Nakadai
68
0
0
30 Nov 2025
Ghosting Your LLM: Without The Knowledge of Your Gradient and Data
Ghosting Your LLM: Without The Knowledge of Your Gradient and Data
Abeer Matar A. Almalky
Ziyan Wang
Mohaiminul Al Nahian
Li Yang
Adnan Siraj Rakin
AAML
207
0
0
27 Nov 2025
Low-Rank Curvature for Zeroth-Order Optimization in LLM Fine-Tuning
Low-Rank Curvature for Zeroth-Order Optimization in LLM Fine-Tuning
Hyunseok Seung
Jaewoo Lee
Hyunsuk Ko
73
0
0
11 Nov 2025
Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach
Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach
Dandan Liang
Jianing Zhang
Evan Chen
Zhe Li
Rui Li
Haibo Yang
FedML
186
1
0
24 Oct 2025
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning
Wanhao Yu
Zheng Wang
Shuteng Niu
Sen Lin
Li Yang
CLL
243
0
0
23 Oct 2025
Language Ranker: A Lightweight Ranking framework for LLM Decoding
Language Ranker: A Lightweight Ranking framework for LLM Decoding
Chenheng Zhang
Tianqi Du
Jizhe Zhang
Mingqing Xiao
Yifei Wang
Yisen Wang
Zhouchen Lin
ALM
207
0
0
23 Oct 2025
Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations
Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned PerturbationsInternational Conference on Learning Representations (ICLR), 2025
Shaocong Ma
Heng Huang
153
12
0
22 Oct 2025
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
Shaocong Ma
Heng Huang
138
2
0
22 Oct 2025
Towards Fast LLM Fine-tuning through Zeroth-Order Optimization with Projected Gradient-Aligned Perturbations
Towards Fast LLM Fine-tuning through Zeroth-Order Optimization with Projected Gradient-Aligned Perturbations
Zhendong Mi
Qitao Tan
Grace Li Zhang
Zhaozhuo Xu
Geng Yuan
Shaoyi Huang
145
0
0
21 Oct 2025
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Xuchen Gong
Tian Li
148
0
0
17 Oct 2025
Noise-Adaptive Layerwise Learning Rates: Accelerating Geometry-Aware Optimization for Deep Neural Network Training
Noise-Adaptive Layerwise Learning Rates: Accelerating Geometry-Aware Optimization for Deep Neural Network Training
Jie Hao
Xiaochuan Gong
Jie Xu
Z. Wang
Mingrui Liu
AI4CE
152
0
0
15 Oct 2025
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices
Congzheng Song
Xinyu Tang
123
0
0
03 Oct 2025
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Kairun Zhang
Xue Yang
Yanjun Zhao
Yifan Sun
Huan Zhang
179
0
0
01 Oct 2025
Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Yicheng Lang
Yihua Zhang
Chongyu Fan
Changsheng Wang
Jinghan Jia
Sijia Liu
MU
356
0
0
01 Oct 2025
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
Xin Qiu
Yulu Gan
Conor F. Hayes
Qiyao Liang
Elliot Meyerson
Babak Hodjat
Risto Miikkulainen
196
3
0
29 Sep 2025
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
Boao Kong
Junzhu Liang
Yuxi Liu
Renjia Deng
Kun Yuan
160
1
0
23 Sep 2025
The Multi-Query Paradox in Zeroth-Order Optimization
The Multi-Query Paradox in Zeroth-Order Optimization
Wei Lin
Qingyu Song
Hong Xu
170
0
0
19 Sep 2025
Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers
Low-rank surrogate modeling and stochastic zero-order optimization for training of neural networks with black-box layers
Andrei Chertkov
Artem Basharin
Mikhail Saygin
Evgeny Frolov
Stanislav Straupe
Ivan Oseledets
142
0
0
18 Sep 2025
Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training
Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training
Chuan He
Zhanwang Deng
Zhaosong Lu
BDL
168
2
0
15 Sep 2025
L1RA: Dynamic Rank Assignment in LoRA Fine-Tuning
L1RA: Dynamic Rank Assignment in LoRA Fine-Tuning
Raul Singh
Nicolo Brunello
Vincenzo Scotti
Mark James Carman
110
0
0
05 Sep 2025
Warming Up for Zeroth-Order Federated Pre-Training with Low Resource Clients
Warming Up for Zeroth-Order Federated Pre-Training with Low Resource Clients
G. Legate
Irina Rish
Eugene Belilovsky
FedML
112
0
0
03 Sep 2025
Forward-Only Continual Learning
Forward-Only Continual Learning
Jiao Chen
Jiayi He
Fangfang Chen
Zuohong Lv
Jianhua Tang
CLL
166
1
0
01 Sep 2025
GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping
GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping
Qifu Wen
Xi Zeng
Zihan Zhou
Shuaijun Liu
M. Hosseinzadeh
Ningxin Su
Reza Rawassizadeh
268
0
0
01 Sep 2025
On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo
Junxiao Wang
Fushuo Huo
Laizhong Cui
Song Guo
Jie Gui
Dacheng Tao
109
0
0
22 Aug 2025
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
Qitao Tan
Xiaoying Song
Jin Lu
Guoming Li
Jun Liu
...
Jundong Li
Xiaoming Zhai
Shaoyi Huang
Wei Niu
Geng Yuan
MQ
226
0
0
21 Aug 2025
Efficient Knowledge Graph Unlearning with Zeroth-order Information
Efficient Knowledge Graph Unlearning with Zeroth-order Information
Yang Xiao
Ruimeng Ye
Bohan Liu
Xiaolong Ma
Bo Hui
MU
160
1
0
19 Aug 2025
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Tianxiao Cao
Kyohei Atarashi
H. Kashima
227
0
0
14 Aug 2025
Communication-Efficient Zero-Order and First-Order Federated Learning Methods over Wireless Networks
Communication-Efficient Zero-Order and First-Order Federated Learning Methods over Wireless Networks
Mohamad Assaad
Zeinab Nehme
Mérouane Debbah
97
0
0
11 Aug 2025
RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory
RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory
Jun Liu
Zhenglun Kong
Changdi Yang
Fan Yang
Tianqi Li
...
Wenbin Zhang
P. Zhao
Xue Lin
Dong Huang
Yanzhi Wang
208
4
0
06 Aug 2025
Test-Time Model Adaptation for Quantized Neural Networks
Test-Time Model Adaptation for Quantized Neural Networks
Zeshuai Deng
Guohao Chen
Shuaicheng Niu
Hui Luo
Shuhai Zhang
Yifan Yang
Renjie Chen
Wei Luo
Mingkui Tan
MQ
155
1
0
04 Aug 2025
DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling
DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling
Boheng Li
Junjie Wang
Yiming Li
Zhiyang Hu
Leyi Qi
Jianshuo Dong
Run Wang
Han Qiu
Zhan Qin
Tianwei Zhang
229
1
0
22 Jul 2025
Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
Seokeon Choi
S. Park
Hyoungwoo Park
J. Kim
Sungrack Yun
151
1
0
14 Jul 2025
Greedy Low-Rank Gradient Compression for Distributed Learning with Convergence Guarantees
Greedy Low-Rank Gradient Compression for Distributed Learning with Convergence Guarantees
Chuyan Chen
Yutong He
Pengrui Li
Weichen Jia
Kun Yuan
625
4
0
11 Jul 2025
SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes
SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes
Yifan Yang
Zhen-ying Zhang
Rupak Vignesh Swaminathan
Jing Liu
Nathan Susanj
Zheng Zhang
VLM
191
1
0
26 Jun 2025
Private Training & Data Generation by Clustering Embeddings
Private Training & Data Generation by Clustering Embeddings
Felix Y. Zhou
Samson Zhou
Vahab Mirrokni
Alessandro Epasto
Vincent Cohen-Addad
191
0
0
20 Jun 2025
Memory-Efficient Differentially Private Training with Gradient Random Projection
Memory-Efficient Differentially Private Training with Gradient Random Projection
Alex Mulrooney
Devansh Gupta
James Flemings
Huanyu Zhang
Murali Annavaram
Meisam Razaviyayn
Xinwei Zhang
243
1
0
18 Jun 2025
Private Aggregation for Byzantine-Resilient Heterogeneous Federated Learning
Maximilian Egger
Rawad Bitar
279
0
0
11 Jun 2025
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs
Zhenyan Lu
Daliang Xu
Dongqi Cai
Zexi Li
Wei Liu
Fangming Liu
Shangguang Wang
Mengwei Xu
KELM
211
1
0
05 Jun 2025
Learning long range dependencies through time reversal symmetry breaking
Guillaume Pourcel
Maxence Ernoult
351
3
0
05 Jun 2025
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
Egor Petrov
Grigoriy Evseev
Aleksey Antonov
Andrey Veprikov
Nikolay Bushkov
Nikolay Bushkov
Stanislav Moiseev
410
2
0
04 Jun 2025
Provable Reinforcement Learning from Human Feedback with an Unknown Link Function
Provable Reinforcement Learning from Human Feedback with an Unknown Link Function
Qining Zhang
Lei Ying
252
0
0
03 Jun 2025
Reconciling Hessian-Informed Acceleration and Scalar-Only Communication for Efficient Federated Zeroth-Order Fine-Tuning
Reconciling Hessian-Informed Acceleration and Scalar-Only Communication for Efficient Federated Zeroth-Order Fine-Tuning
Zhe Li
Bicheng Ying
Zidong Liu
Chaosheng Dong
Haibo Yang
FedML
242
0
0
03 Jun 2025
MLorc: Momentum Low-rank Compression for Memory Efficient Large Language Model Adaptation
MLorc: Momentum Low-rank Compression for Memory Efficient Large Language Model Adaptation
Wei Shen
Zhang Yaxiang
Minhui Huang
Mengfan Xu
Jiawei Zhang
Cong Shen
AI4CE
326
1
0
02 Jun 2025
Structured Gradient Guidance for Few-Shot Adaptation in Large Language Models
Structured Gradient Guidance for Few-Shot Adaptation in Large Language Models
Hongye Zheng
Yichen Wang
Ray Pan
Guiran Liu
Binrong Zhu
Hanlu Zhang
141
9
0
31 May 2025
A Structured Tour of Optimization with Finite Differences
A Structured Tour of Optimization with Finite Differences
Marco Rando
C. Molinari
Lorenzo Rosasco
S. Villa
364
0
0
26 May 2025
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
Zhendong Mi
Qitao Tan
Xiaodong Yu
Zining Zhu
Geng Yuan
Shaoyi Huang
356
4
0
24 May 2025
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Subquadratic Algorithms and Hardness for Attention with Any Temperature
Shreya Gupta
Boyang Huang
Barna Saha
Yinzhan Xu
Christopher Ye
265
2
0
20 May 2025
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
Sifeng Shang
Jiayi Zhou
Chenyu Lin
Minxian Li
Kaiyang Zhou
MQ
353
1
0
19 May 2025
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
Yezhen Wang
Zhouhao Yang
Brian K Chen
Fanyi Pu
Yue Liu
Tianyu Gao
Kenji Kawaguchi
236
0
0
03 May 2025
1234
Next