ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.17333
  4. Cited By
Fine-Tuning Language Models with Just Forward Passes

Fine-Tuning Language Models with Just Forward Passes

27 May 2023
Sadhika Malladi
Tianyu Gao
Eshaan Nichani
Alexandru Damian
Jason D. Lee
Danqi Chen
Sanjeev Arora
ArXivPDFHTML

Papers citing "Fine-Tuning Language Models with Just Forward Passes"

50 / 144 papers shown
Title
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
Yezhen Wang
Zhouhao Yang
Brian K Chen
Fanyi Pu
Bo-wen Li
Tianyu Gao
Kenji Kawaguchi
34
0
0
03 May 2025
Stochastic Subspace Descent Accelerated via Bi-fidelity Line Search
Stochastic Subspace Descent Accelerated via Bi-fidelity Line Search
Nuojin Cheng
Alireza Doostan
Stephen Becker
34
0
0
30 Apr 2025
Private Federated Learning using Preference-Optimized Synthetic Data
Private Federated Learning using Preference-Optimized Synthetic Data
Charlie Hou
Mei-Yu Wang
Yige Zhu
Daniel Lazar
Giulia Fanti
FedML
Presented at ResearchTrend Connect | FedML on 07 May 2025
54
0
0
23 Apr 2025
Efficient Model Editing with Task-Localized Sparse Fine-tuning
Efficient Model Editing with Task-Localized Sparse Fine-tuning
Leonardo Iurada
Marco Ciccone
Tatiana Tommasi
KELM
MoMe
40
0
0
03 Apr 2025
Efficient Personalization of Quantized Diffusion Model without Backpropagation
Efficient Personalization of Quantized Diffusion Model without Backpropagation
H. Seo
Wongi Jeong
Kyungryeol Lee
Se Young Chun
DiffM
MQ
73
0
0
19 Mar 2025
Augmented Adversarial Trigger Learning
Augmented Adversarial Trigger Learning
Zhe Wang
Yanjun Qi
46
0
0
16 Mar 2025
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory
Liangyu Wang
Jie Ren
Hang Xu
Junxiao Wang
Huanyi Xie
David E. Keyes
Di Wang
58
0
0
16 Mar 2025
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
Hao Chen
S. Hu
Wayne Luk
Timothy M. Hospedales
Hongxiang Fan
MoMe
67
0
0
16 Mar 2025
A Survey on Federated Fine-tuning of Large Language Models
A Survey on Federated Fine-tuning of Large Language Models
Yebo Wu
Chunlin Tian
Jingguang Li
He Sun
Kahou Tam
Li Li
Chengzhong Xu
FedML
81
0
0
15 Mar 2025
Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models
Alessio Galatolo
Zhenbang Dai
Katie Winkle
Meriem Beloucif
47
0
0
05 Mar 2025
Towards hyperparameter-free optimization with differential privacy
Zhiqi Bu
Ruixuan Liu
24
1
0
02 Mar 2025
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization
Shubhankar Borse
K. Bhardwaj
Mohammad Reza Karimi Dastjerdi
Hyojin Park
Shreya Kadambi
...
Prathamesh Mandke
Ankita Nayak
Harris Teague
Munawar Hayat
Fatih Porikli
DiffM
75
1
0
27 Feb 2025
LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
Yehonathan Refael
Iftach Arbel
Ofir Lindenbaum
Tom Tirer
64
0
0
26 Feb 2025
QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
Jiajun Zhou
Yifan Yang
Kai Zhen
Z. Liu
Yequan Zhao
Ershad Banijamali
Athanasios Mouchtaris
Ngai Wong
Zheng Zhang
MQ
41
0
0
17 Feb 2025
Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks
Yequan Zhao
Xinling Yu
Xian Xiao
Z. Chen
Z. Liu
G. Kurczveil
R. Beausoleil
S. Liu
Z. Zhang
45
0
0
17 Feb 2025
An Efficient Row-Based Sparse Fine-Tuning
An Efficient Row-Based Sparse Fine-Tuning
Cen-Jhih Li
Aditya Bhaskara
44
0
0
17 Feb 2025
Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices
Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices
Mohamed Aboelenien Ahmed
Kilian Pfeiffer
R. Khalili
Heba Khdr
J. Henkel
FedML
80
0
0
17 Feb 2025
A Survey of Personalized Large Language Models: Progress and Future Directions
A Survey of Personalized Large Language Models: Progress and Future Directions
Jiahong Liu
Zexuan Qiu
Zhongyang Li
Quanyu Dai
Jieming Zhu
Minda Hu
Menglin Yang
Irwin King
LM&MA
46
2
0
17 Feb 2025
Model Diffusion for Certifiable Few-shot Transfer Learning
Model Diffusion for Certifiable Few-shot Transfer Learning
Fady Rezk
Royson Lee
H. Gouk
Timothy M. Hospedales
Minyoung Kim
45
0
0
10 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
110
0
0
31 Jan 2025
Decentralized Low-Rank Fine-Tuning of Large Language Models
Sajjad Ghiasvand
Mahnoosh Alizadeh
Ramtin Pedarsani
ALM
64
0
0
26 Jan 2025
An Enhanced Zeroth-Order Stochastic Frank-Wolfe Framework for Constrained Finite-Sum Optimization
An Enhanced Zeroth-Order Stochastic Frank-Wolfe Framework for Constrained Finite-Sum Optimization
Haishan Ye
Yinghui Huang
Hao Di
Xiangyu Chang
38
0
0
13 Jan 2025
Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators
Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators
Zekun Shi
Zheyuan Hu
Min-Bin Lin
Kenji Kawaguchi
113
4
0
27 Nov 2024
Poor Man's Training on MCUs: A Memory-Efficient Quantized
  Back-Propagation-Free Approach
Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach
Yequan Zhao
Hai Li
Ian Young
Zheng-Wei Zhang
MQ
24
2
0
07 Nov 2024
Stepping Forward on the Last Mile
Stepping Forward on the Last Mile
Chen Feng
Shaojie Zhuo
Xiaopeng Zhang
R. Ramakrishnan
Zhaocong Yuan
Andrew Zou Li
25
0
0
06 Nov 2024
Normalization Layer Per-Example Gradients are Sufficient to Predict
  Gradient Noise Scale in Transformers
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers
Gavia Gray
Aman Tiwari
Shane Bergsma
Joel Hestness
25
1
0
01 Nov 2024
On the Crucial Role of Initialization for Matrix Factorization
On the Crucial Role of Initialization for Matrix Factorization
Bingcong Li
Liang Zhang
Aryan Mokhtari
Niao He
26
1
0
24 Oct 2024
CKSP: Cross-species Knowledge Sharing and Preserving for Universal
  Animal Activity Recognition
CKSP: Cross-species Knowledge Sharing and Preserving for Universal Animal Activity Recognition
Axiu Mao
Meilu Zhu
Zhaojin Guo
Zheng He
Tomas Norton
Kai Liu
21
0
0
22 Oct 2024
Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
H. Fernando
Han Shen
Parikshit Ram
Yi Zhou
Horst Samulowitz
Nathalie Baracaldo
Tianyi Chen
CLL
50
2
0
20 Oct 2024
Implicit Regularization of Sharpness-Aware Minimization for
  Scale-Invariant Problems
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems
Bingcong Li
Liang Zhang
Niao He
36
3
0
18 Oct 2024
A Theoretical Survey on Foundation Models
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
21
0
0
15 Oct 2024
Federated Data-Efficient Instruction Tuning for Large Language Models
Federated Data-Efficient Instruction Tuning for Large Language Models
Zhen Qin
Zhaomin Wu
Bingsheng He
Shuiguang Deng
FedML
35
2
0
14 Oct 2024
Divide, Reweight, and Conquer: A Logit Arithmetic Approach for
  In-Context Learning
Divide, Reweight, and Conquer: A Logit Arithmetic Approach for In-Context Learning
Chengsong Huang
Langlin Huang
Jiaxin Huang
MoMe
27
1
0
14 Oct 2024
Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for
  Fine-Tuning Large Language Models
Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
Fei Wang
Li Shen
Liang Ding
Chao Xue
Ye Liu
Changxing Ding
28
0
0
13 Oct 2024
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces
Ziming Yu
Pan Zhou
Sike Wang
Jia Li
Hua Huang
18
0
0
11 Oct 2024
Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and
  Performance of SGD for Fine-Tuning Language Models
Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
Zeman Li
Xinwei Zhang
Peilin Zhong
Yuan Deng
Meisam Razaviyayn
Vahab Mirrokni
13
2
0
09 Oct 2024
Chemistry-Inspired Diffusion with Non-Differentiable Guidance
Chemistry-Inspired Diffusion with Non-Differentiable Guidance
Yuchen Shen
Chenhao Zhang
Sijie Fu
Chenghui Zhou
Newell Washburn
Barnabás Póczos
45
0
0
09 Oct 2024
FLOPS: Forward Learning with OPtimal Sampling
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren
Zishi Zhang
Jinyang Jiang
Guanghao Li
Zeliang Zhang
Mingqian Feng
Yijie Peng
30
1
0
08 Oct 2024
LoRTA: Low Rank Tensor Adaptation of Large Language Models
LoRTA: Low Rank Tensor Adaptation of Large Language Models
Ignacio Hounie
Charilaos I. Kanatsoulis
Arnuv Tandon
Alejandro Ribeiro
29
0
0
05 Oct 2024
On Unsupervised Prompt Learning for Classification with Black-box
  Language Models
On Unsupervised Prompt Learning for Classification with Black-box Language Models
Zhen-Yu Zhang
Jiandong Zhang
Huaxiu Yao
Gang Niu
Masashi Sugiyama
21
2
0
04 Oct 2024
Variance-Reduced Gradient Estimator for Nonconvex Zeroth-Order
  Distributed Optimization
Variance-Reduced Gradient Estimator for Nonconvex Zeroth-Order Distributed Optimization
Huaiyi Mu
Yujie Tang
Zhongkui Li
13
0
0
29 Sep 2024
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Qining Zhang
Lei Ying
OffRL
35
1
0
25 Sep 2024
A Unified Causal Framework for Auditing Recommender Systems for Ethical
  Concerns
A Unified Causal Framework for Auditing Recommender Systems for Ethical Concerns
Vibhhu Sharma
Shantanu Gupta
Nil-Jana Akpinar
Zachary C. Lipton
Liu Leqi
CML
MLAU
15
0
0
20 Sep 2024
Self-Contrastive Forward-Forward Algorithm
Self-Contrastive Forward-Forward Algorithm
Xing Chen
Dongshu Liu
Jérémie Laydevant
Julie Grollier
28
2
0
17 Sep 2024
Ferret: Federated Full-Parameter Tuning at Scale for Large Language
  Models
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
Yao Shu
Wenyang Hu
S. Ng
Bryan Kian Hsiang Low
Fei Richard Yu
FedML
32
0
0
10 Sep 2024
Scalable Multitask Learning Using Gradient-based Estimation of Task
  Affinity
Scalable Multitask Learning Using Gradient-based Estimation of Task Affinity
Dongyue Li
Aneesh Sharma
Hongyang R. Zhang
64
1
0
09 Sep 2024
Towards training digitally-tied analog blocks via hybrid gradient
  computation
Towards training digitally-tied analog blocks via hybrid gradient computation
Timothy Nest
M. Ernoult
39
1
0
05 Sep 2024
Towards General Industrial Intelligence: A Survey on IIoT-Enhanced
  Continual Large Models
Towards General Industrial Intelligence: A Survey on IIoT-Enhanced Continual Large Models
Jiao Chen
Jiayi He
Fangfang Chen
Zuohong Lv
Jianhua Tang
Weihua Li
Zuozhu Liu
Howard H. Yang
Guangjie Han
AI4CE
34
1
0
02 Sep 2024
Towards Efficient Large Language Models for Scientific Text: A Review
Towards Efficient Large Language Models for Scientific Text: A Review
H. To
Ming Liu
Guangyan Huang
35
1
0
20 Aug 2024
Fine-Tuning and Deploying Large Language Models Over Edges: Issues and
  Approaches
Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches
Yanjie Dong
Xiaoyi Fan
Fangxin Wang
Chengming Li
Victor C. M. Leung
Xiping Hu
26
4
0
20 Aug 2024
123
Next