Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.01110
Cited By
Batch Normalization Preconditioning for Neural Network Training
2 August 2021
Susanna Lange
Kyle E. Helfrich
Qiang Ye
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Batch Normalization Preconditioning for Neural Network Training"
4 / 4 papers shown
Title
Preconditioning for Accelerated Gradient Descent Optimization and Regularization
Qiang Ye
AI4CE
21
0
0
30 Sep 2024
fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions
Alireza Afzal Aghaei
27
47
0
11 Jun 2024
Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction
Kaifeng Lyu
Zhiyuan Li
Sanjeev Arora
FAtt
35
69
0
14 Jun 2022
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,888
0
15 Sep 2016
1