Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.10650
Cited By
The Implicit Bias of Adam on Separable Data
15 June 2024
Chenyang Zhang
Difan Zou
Yuan Cao
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Implicit Bias of Adam on Separable Data"
5 / 5 papers shown
Title
Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks
Chenyang Zhang
Peifeng Gao
Difan Zou
Yuan Cao
OOD
MLT
59
0
0
11 Apr 2025
On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions
Yusu Hong
Junhong Lin
38
10
0
06 Feb 2024
Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Frederik Kunstner
Jacques Chen
J. Lavington
Mark W. Schmidt
38
66
0
27 Apr 2023
Does Momentum Change the Implicit Regularization on Separable Data?
Bohan Wang
Qi Meng
Huishuai Zhang
Ruoyu Sun
Wei-Neng Chen
Zhirui Ma
Tie-Yan Liu
39
15
0
08 Oct 2021
On Margin Maximization in Linear and ReLU Networks
Gal Vardi
Ohad Shamir
Nathan Srebro
34
27
0
06 Oct 2021
1