Large batch size training of neural networks with adversarial training
and second-order information

Large batch size training of neural networks with adversarial training and second-order information

2 October 2018

Joseph E. Gonzalez

Michael W. Mahoney

Papers citing "Large batch size training of neural networks with adversarial training and second-order information"

12 / 12 papers shown

Title
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction Yuanyuan Gao Hao Li Jiaqi Chen Zhengyu Zou Zhihang Zhong Dingwen Zhang Xiao-Fu Sun Junwei Han 3DGS 58 0 0 29 Mar 2025
OmniLearn: A Framework for Distributed Deep Learning over Heterogeneous Clusters S. Tyagi Prateek Sharma 68 0 0 21 Mar 2025
P $^2$ -ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer Huihong Shi Xin Cheng Wendong Mao Zhongfeng Wang MQ 50 3 0 30 May 2024
Accelerating Distributed ML Training via Selective Synchronization S. Tyagi Martin Swany FedML 41 3 0 16 Jul 2023
When Do Flat Minima Optimizers Work? Jean Kaddour Linqing Liu Ricardo M. A. Silva Matt J. Kusner ODL 28 58 0 01 Feb 2022
Concurrent Adversarial Learning for Large-Batch Training Yong Liu Xiangning Chen Minhao Cheng Cho-Jui Hsieh Yang You ODL 36 13 0 01 Jun 2021
Multi-Agent Semi-Siamese Training for Long-tail and Shallow Face Learning Hailin Shi Dan Zeng Yichun Tai Hang Du Yibo Hu Zicheng Zhang Tao Mei CVBM 39 4 0 10 May 2021
On the Utility of Gradient Compression in Distributed Training Systems Saurabh Agarwal Hongyi Wang Shivaram Venkataraman Dimitris Papailiopoulos 38 46 0 28 Feb 2021
Deep Learning for Surface Wave Identification in Distributed Acoustic Sensing Data Vincent Dumont V. R. Tribaldos J. Ajo-Franklin Kesheng Wu 13 20 0 15 Oct 2020
HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks Zhen Dong Z. Yao Yaohui Cai Daiyaan Arfeen A. Gholami Michael W. Mahoney Kurt Keutzer MQ 45 274 0 10 Nov 2019
Parameter Re-Initialization through Cyclical Batch Size Schedules Norman Mu Z. Yao A. Gholami Kurt Keutzer Michael W. Mahoney ODL 30 8 0 04 Dec 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 310 2,896 0 15 Sep 2016