Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.11948
Cited By
The instabilities of large learning rate training: a loss landscape view
22 July 2023
Lawrence Wang
Stephen J. Roberts
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The instabilities of large learning rate training: a loss landscape view"
2 / 2 papers shown
Title
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes
Nikita Kiselev
Andrey Grabovoy
84
1
0
18 Sep 2024
Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang
Guozheng Ma
Cong Yu
Ning Gui
Linrui Zhang
...
Sen Zhang
Li Shen
Xueqian Wang
Peilin Zhao
Dacheng Tao
KELM
109
24
0
20 Sep 2023
1