Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.01021
Cited By
Large batch size training of neural networks with adversarial training and second-order information
2 October 2018
Z. Yao
A. Gholami
Daiyaan Arfeen
Richard Liaw
Joseph E. Gonzalez
Kurt Keutzer
Michael W. Mahoney
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large batch size training of neural networks with adversarial training and second-order information"
12 / 12 papers shown
Title
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction
Yuanyuan Gao
Hao Li
Jiaqi Chen
Zhengyu Zou
Zhihang Zhong
Dingwen Zhang
Xiao-Fu Sun
Junwei Han
3DGS
58
0
0
29 Mar 2025
OmniLearn: A Framework for Distributed Deep Learning over Heterogeneous Clusters
S. Tyagi
Prateek Sharma
68
0
0
21 Mar 2025
P
2
^2
2
-ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer
Huihong Shi
Xin Cheng
Wendong Mao
Zhongfeng Wang
MQ
50
3
0
30 May 2024
Accelerating Distributed ML Training via Selective Synchronization
S. Tyagi
Martin Swany
FedML
41
3
0
16 Jul 2023
When Do Flat Minima Optimizers Work?
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
28
58
0
01 Feb 2022
Concurrent Adversarial Learning for Large-Batch Training
Yong Liu
Xiangning Chen
Minhao Cheng
Cho-Jui Hsieh
Yang You
ODL
36
13
0
01 Jun 2021
Multi-Agent Semi-Siamese Training for Long-tail and Shallow Face Learning
Hailin Shi
Dan Zeng
Yichun Tai
Hang Du
Yibo Hu
Zicheng Zhang
Tao Mei
CVBM
39
4
0
10 May 2021
On the Utility of Gradient Compression in Distributed Training Systems
Saurabh Agarwal
Hongyi Wang
Shivaram Venkataraman
Dimitris Papailiopoulos
38
46
0
28 Feb 2021
Deep Learning for Surface Wave Identification in Distributed Acoustic Sensing Data
Vincent Dumont
V. R. Tribaldos
J. Ajo-Franklin
Kesheng Wu
13
20
0
15 Oct 2020
HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Zhen Dong
Z. Yao
Yaohui Cai
Daiyaan Arfeen
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
45
274
0
10 Nov 2019
Parameter Re-Initialization through Cyclical Batch Size Schedules
Norman Mu
Z. Yao
A. Gholami
Kurt Keutzer
Michael W. Mahoney
ODL
30
8
0
04 Dec 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
310
2,896
0
15 Sep 2016
1