How Does Batch Normalization Help Optimization?

29 May 2018

Papers citing "How Does Batch Normalization Help Optimization?"

50 / 141 papers shown

Title
NetSight: Graph Attention Based Traffic Forecasting in Computer Networks Jinming Xing Guoheng Sun Hui Sun Linchao Pan Shakir Mahmood Xuanhao Luo Muhammad Shahzad 28 0 0 11 May 2025
SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and Challenges Ce Ju Reinmar J. Kobler Antoine Collas M. Kawanabe Cuntai Guan Bertrand Thirion 41 0 0 26 Apr 2025
Decentralized Federated Domain Generalization with Style Sharing: A Formal Modeling and Convergence Analysis Shahryar Zehtabi Dong-Jun Han Seyyedali Hosseinalipour Christopher G. Brinton FedML AI4CE 45 0 0 08 Apr 2025
A Real-time Multimodal Transformer Neural Network-powered Wildfire Forecasting System Qijun Chen Shaofan Li 41 0 0 07 Mar 2025
Beyond R-barycenters: an effective averaging method on Stiefel and Grassmann manifolds Florent Bouchard Nils Laurent Salem Said N. L. Bihan 29 1 0 20 Jan 2025
Quantum Cognition-Inspired EEG-based Recommendation via Graph Neural Networks Jinkun Han Wei Li Y. Li Zhipeng Cai 37 2 0 05 Jan 2025
Data-Efficient Discovery of Hyperelastic TPMS Metamaterials with Extreme Energy Dissipation Maxine Perroni-Scharf Zachary Ferguson Thomas Butrille Carlos Portela Mina Konaković Luković 27 0 0 29 May 2024
Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm Regularization Aditya Biswas 36 0 0 29 Apr 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks Takuro Kutsuna LLMSV 19 1 0 08 Mar 2024
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation Yilin Lyu Liyuan Wang Xingxing Zhang Zicheng Sun Hang Su Jun Zhu Liping Jing 34 8 0 13 Oct 2023
Weakly Supervised Multi-Task Representation Learning for Human Activity Analysis Using Wearables Taoran Sheng M. Huber SSL HAI 16 20 0 06 Aug 2023
AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization Chuyan Zhang Yuncheng Yang Hao Zheng Yun Gu 24 0 0 28 Jul 2023
Reinterpreting survival analysis in the universal approximator age Sören Dittmer M. Roberts J. Preller AIX-COVNET Collaboration James H. F. Rudd J. Aston Carola-Bibiane Schönlieb 27 0 0 25 Jul 2023
Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis Jiajie Fan L. Vuaille Hongya Wang Thomas Bäck AI4CE 17 5 0 19 Jul 2023
The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks Yuan Cao Difan Zou Yuan-Fang Li Quanquan Gu MLT 29 5 0 20 Jun 2023
Group channel pruning and spatial attention distilling for object detection Yun Chu Pu Li Yong Bai Zhuhua Hu Yongqing Chen Jiafeng Lu VLM 24 13 0 02 Jun 2023
On the Weight Dynamics of Deep Normalized Networks Christian H. X. Ali Mehmeti-Göpel Michael Wand 25 1 0 01 Jun 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs Javier Campos Zhen Dong Javier Mauricio Duarte A. Gholami Michael W. Mahoney Jovan Mitrevski Nhan Tran MQ 24 3 0 13 Apr 2023
Inductive biases in deep learning models for weather prediction Jannik Thümmel Matthias Karlbauer S. Otte C. Zarfl Georg Martius ... Thomas Scholten Ulrich Friedrich V. Wulfmeyer B. Goswami Martin Volker Butz AI4CE 38 5 0 06 Apr 2023
NU-AIR -- A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles Craig Iaboni Thomas Kelly Pramod Abichandani 16 2 0 18 Feb 2023
Novel Building Detection and Location Intelligence Collection in Aerial Satellite Imagery Sandeep Singh Christian Wiles A. Bilal 18 0 0 06 Feb 2023
Modality-Agnostic Variational Compression of Implicit Neural Representations Jonathan Richard Schwarz Jihoon Tack Yee Whye Teh Jaeho Lee Jinwoo Shin 24 25 0 23 Jan 2023
Low PAPR MIMO-OFDM Design Based on Convolutional Autoencoder Yara Huleihel H. Permuter 22 6 0 11 Jan 2023
Disentangled Explanations of Neural Network Predictions by Finding Relevant Subspaces Pattarawat Chormai J. Herrmann Klaus-Robert Muller G. Montavon FAtt 43 17 0 30 Dec 2022
Stable Learning via Sparse Variable Independence Han Yu Peng Cui Yue He Zheyan Shen Yong Lin Renzhe Xu Xingxuan Zhang OOD 28 13 0 02 Dec 2022
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics Ancheng Lin Jun Yu Li Yusheng Xiang Wei Bian Mukesh Prasad 3DPC ViT 43 2 0 19 Nov 2022
We need to talk about random seeds Steven Bethard 31 8 0 24 Oct 2022
Dynamical Isometry for Residual Networks Advait Gadhikar R. Burkholz ODL AI4CE 32 2 0 05 Oct 2022
RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank Q. Garrido Randall Balestriero Laurent Najman Yann LeCun SSL 46 72 0 05 Oct 2022
Batch Normalization Explained Randall Balestriero Richard G. Baraniuk AAML 28 16 0 29 Sep 2022
On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning T. Pham Chaoning Zhang Axi Niu Kang Zhang Chang-Dong Yoo 36 11 0 11 Aug 2022
EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification Xiang Yu Zhe Geng Xiaohua Huang Qinglu Wang Daiyin Zhu 30 5 0 03 Aug 2022
Continuous locomotion mode recognition and gait phase estimation based on a shank-mounted IMU with artificial neural networks F. Weigand Andreas Höhl Julian Zeiss U. Konigorski M. Grimmer 9 3 0 01 Aug 2022
Generative Domain Adaptation for Face Anti-Spoofing Qianyu Zhou Ke-Yue Zhang Taiping Yao Ran Yi Kekai Sheng Shouhong Ding Lizhuang Ma CVBM 32 48 0 20 Jul 2022
Lipschitz Continuity Retained Binary Neural Network Yuzhang Shang Dan Xu Bin Duan Ziliang Zong Liqiang Nie Yan Yan 11 19 0 13 Jul 2022
PointNorm: Dual Normalization is All You Need for Point Cloud Analysis Shen Zheng Jinqian Pan Chang-Tien Lu Gaurav Gupta 3DPC 27 7 0 13 Jul 2022
Understanding and Improving Group Normalization Agus Gunawan Xu Yin Kang Zhang 13 3 0 05 Jul 2022
Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction Kaifeng Lyu Zhiyuan Li Sanjeev Arora FAtt 37 69 0 14 Jun 2022
SmartGD: A GAN-Based Graph Drawing Framework for Diverse Aesthetic Goals Xiaoqi Wang Kevin Yen Yifan Hu Hang Shen 24 4 0 13 Jun 2022
How to Find Actionable Static Analysis Warnings: A Case Study with FindBugs Rahul Yedida Hong Jin Kang Huy Tu Xueqi Yang David Lo Tim Menzies 27 12 0 21 May 2022
Masterful: A Training Platform for Computer Vision Models S. Wookey Yaoshiang Ho Thomas D. Rikert Juan David Gil Lopez Juan Manuel Munoz Beancur ... Ray Tawil Aaron Sabin Jack Lynch Travis Harper Nikhil Gajendrakumar VLM 18 1 0 21 May 2022
FairNorm: Fair and Fast Graph Neural Network Training Öykü Deniz Köse Yanning Shen AI4CE 11 4 0 20 May 2022
Impact of L1 Batch Normalization on Analog Noise Resistant Property of Deep Learning Models Omobayode Fagbohungbe Lijun Qian 27 0 0 07 May 2022
On Fragile Features and Batch Normalization in Adversarial Training Nils Philipp Walter David Stutz Bernt Schiele AAML 13 5 0 26 Apr 2022
Online Convolutional Re-parameterization Mu Hu Junyi Feng Jiashen Hua Baisheng Lai Jianqiang Huang Xiaojin Gong Xiansheng Hua 13 26 0 02 Apr 2022
Testing Feedforward Neural Networks Training Programs Houssem Ben Braiek Foutse Khomh AAML 11 14 0 01 Apr 2022
Continual Normalization: Rethinking Batch Normalization for Online Continual Learning Quang-Cuong Pham Chenghao Liu S. Hoi BDL OnRL 28 57 0 30 Mar 2022
Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing Jun Qi Chao-Han Huck Yang Pin-Yu Chen Javier Tejedor 25 16 0 11 Mar 2022
Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning Seunghyun Lee B. Song 19 8 0 05 Mar 2022
Variational Autoencoders Without the Variation Gregory A. Daly J. Fieldsend G. Tabor 17 2 0 01 Mar 2022