Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.18506
Cited By
Faster Convergence for Transformer Fine-tuning with Line Search Methods
27 March 2024
Philip Kenneweg
Leonardo Galli
Tristan Kenneweg
Barbara Hammer
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster Convergence for Transformer Fine-tuning with Line Search Methods"
3 / 3 papers shown
Title
Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Frederik Kunstner
Jacques Chen
J. Lavington
Mark W. Schmidt
40
66
0
27 Apr 2023
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,943
0
20 Apr 2018
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
247
36,237
0
25 Aug 2016
1