ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11604
  4. Cited By
How Does Batch Normalization Help Optimization?

How Does Batch Normalization Help Optimization?

29 May 2018
Shibani Santurkar
Dimitris Tsipras
Andrew Ilyas
A. Madry
    ODL
ArXivPDFHTML

Papers citing "How Does Batch Normalization Help Optimization?"

50 / 141 papers shown
Title
NetSight: Graph Attention Based Traffic Forecasting in Computer Networks
NetSight: Graph Attention Based Traffic Forecasting in Computer Networks
Jinming Xing
Guoheng Sun
Hui Sun
Linchao Pan
Shakir Mahmood
Xuanhao Luo
Muhammad Shahzad
28
0
0
11 May 2025
SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and Challenges
SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and Challenges
Ce Ju
Reinmar J. Kobler
Antoine Collas
M. Kawanabe
Cuntai Guan
Bertrand Thirion
41
0
0
26 Apr 2025
Decentralized Federated Domain Generalization with Style Sharing: A Formal Modeling and Convergence Analysis
Decentralized Federated Domain Generalization with Style Sharing: A Formal Modeling and Convergence Analysis
Shahryar Zehtabi
Dong-Jun Han
Seyyedali Hosseinalipour
Christopher G. Brinton
FedML
AI4CE
45
0
0
08 Apr 2025
A Real-time Multimodal Transformer Neural Network-powered Wildfire Forecasting System
Qijun Chen
Shaofan Li
41
0
0
07 Mar 2025
Beyond R-barycenters: an effective averaging method on Stiefel and Grassmann manifolds
Beyond R-barycenters: an effective averaging method on Stiefel and Grassmann manifolds
Florent Bouchard
Nils Laurent
Salem Said
N. L. Bihan
29
1
0
20 Jan 2025
Quantum Cognition-Inspired EEG-based Recommendation via Graph Neural Networks
Jinkun Han
Wei Li
Y. Li
Zhipeng Cai
37
2
0
05 Jan 2025
Data-Efficient Discovery of Hyperelastic TPMS Metamaterials with Extreme
  Energy Dissipation
Data-Efficient Discovery of Hyperelastic TPMS Metamaterials with Extreme Energy Dissipation
Maxine Perroni-Scharf
Zachary Ferguson
Thomas Butrille
Carlos Portela
Mina Konaković Luković
27
0
0
29 May 2024
Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm
  Regularization
Hidden Synergy: L1L_1L1​ Weight Normalization and 1-Path-Norm Regularization
Aditya Biswas
36
0
0
29 Apr 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster
  Training of Neural Networks
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks
Takuro Kutsuna
LLMSV
19
1
0
08 Mar 2024
Overcoming Recency Bias of Normalization Statistics in Continual
  Learning: Balance and Adaptation
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
Yilin Lyu
Liyuan Wang
Xingxing Zhang
Zicheng Sun
Hang Su
Jun Zhu
Liping Jing
34
8
0
13 Oct 2023
Weakly Supervised Multi-Task Representation Learning for Human Activity
  Analysis Using Wearables
Weakly Supervised Multi-Task Representation Learning for Human Activity Analysis Using Wearables
Taoran Sheng
M. Huber
SSL
HAI
16
20
0
06 Aug 2023
AC-Norm: Effective Tuning for Medical Image Analysis via Affine
  Collaborative Normalization
AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Chuyan Zhang
Yuncheng Yang
Hao Zheng
Yun Gu
24
0
0
28 Jul 2023
Reinterpreting survival analysis in the universal approximator age
Reinterpreting survival analysis in the universal approximator age
Sören Dittmer
M. Roberts
J. Preller
AIX-COVNET Collaboration
James H. F. Rudd
J. Aston
Carola-Bibiane Schönlieb
27
0
0
25 Jul 2023
Adversarial Latent Autoencoder with Self-Attention for Structural Image
  Synthesis
Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Jiajie Fan
L. Vuaille
Hongya Wang
Thomas Bäck
AI4CE
17
5
0
19 Jul 2023
The Implicit Bias of Batch Normalization in Linear Models and Two-layer
  Linear Convolutional Neural Networks
The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks
Yuan Cao
Difan Zou
Yuan-Fang Li
Quanquan Gu
MLT
29
5
0
20 Jun 2023
Group channel pruning and spatial attention distilling for object
  detection
Group channel pruning and spatial attention distilling for object detection
Yun Chu
Pu Li
Yong Bai
Zhuhua Hu
Yongqing Chen
Jiafeng Lu
VLM
24
13
0
02 Jun 2023
On the Weight Dynamics of Deep Normalized Networks
On the Weight Dynamics of Deep Normalized Networks
Christian H. X. Ali Mehmeti-Göpel
Michael Wand
25
1
0
01 Jun 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs
  and ASICs
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
Javier Campos
Zhen Dong
Javier Mauricio Duarte
A. Gholami
Michael W. Mahoney
Jovan Mitrevski
Nhan Tran
MQ
24
3
0
13 Apr 2023
Inductive biases in deep learning models for weather prediction
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
38
5
0
06 Apr 2023
NU-AIR -- A Neuromorphic Urban Aerial Dataset for Detection and
  Localization of Pedestrians and Vehicles
NU-AIR -- A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles
Craig Iaboni
Thomas Kelly
Pramod Abichandani
16
2
0
18 Feb 2023
Novel Building Detection and Location Intelligence Collection in Aerial
  Satellite Imagery
Novel Building Detection and Location Intelligence Collection in Aerial Satellite Imagery
Sandeep Singh
Christian Wiles
A. Bilal
18
0
0
06 Feb 2023
Modality-Agnostic Variational Compression of Implicit Neural
  Representations
Modality-Agnostic Variational Compression of Implicit Neural Representations
Jonathan Richard Schwarz
Jihoon Tack
Yee Whye Teh
Jaeho Lee
Jinwoo Shin
24
25
0
23 Jan 2023
Low PAPR MIMO-OFDM Design Based on Convolutional Autoencoder
Low PAPR MIMO-OFDM Design Based on Convolutional Autoencoder
Yara Huleihel
H. Permuter
22
6
0
11 Jan 2023
Disentangled Explanations of Neural Network Predictions by Finding
  Relevant Subspaces
Disentangled Explanations of Neural Network Predictions by Finding Relevant Subspaces
Pattarawat Chormai
J. Herrmann
Klaus-Robert Muller
G. Montavon
FAtt
43
17
0
30 Dec 2022
Stable Learning via Sparse Variable Independence
Stable Learning via Sparse Variable Independence
Han Yu
Peng Cui
Yue He
Zheyan Shen
Yong Lin
Renzhe Xu
Xingxuan Zhang
OOD
28
13
0
02 Dec 2022
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics
Ancheng Lin
Jun Yu Li
Yusheng Xiang
Wei Bian
Mukesh Prasad
3DPC
ViT
43
2
0
19 Nov 2022
We need to talk about random seeds
We need to talk about random seeds
Steven Bethard
31
8
0
24 Oct 2022
Dynamical Isometry for Residual Networks
Dynamical Isometry for Residual Networks
Advait Gadhikar
R. Burkholz
ODL
AI4CE
32
2
0
05 Oct 2022
RankMe: Assessing the downstream performance of pretrained
  self-supervised representations by their rank
RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank
Q. Garrido
Randall Balestriero
Laurent Najman
Yann LeCun
SSL
46
72
0
05 Oct 2022
Batch Normalization Explained
Batch Normalization Explained
Randall Balestriero
Richard G. Baraniuk
AAML
28
16
0
29 Sep 2022
On the Pros and Cons of Momentum Encoder in Self-Supervised Visual
  Representation Learning
On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning
T. Pham
Chaoning Zhang
Axi Niu
Kang Zhang
Chang-Dong Yoo
36
11
0
11 Aug 2022
EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for
  SAR Target Classification
EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Xiang Yu
Zhe Geng
Xiaohua Huang
Qinglu Wang
Daiyin Zhu
30
5
0
03 Aug 2022
Continuous locomotion mode recognition and gait phase estimation based
  on a shank-mounted IMU with artificial neural networks
Continuous locomotion mode recognition and gait phase estimation based on a shank-mounted IMU with artificial neural networks
F. Weigand
Andreas Höhl
Julian Zeiss
U. Konigorski
M. Grimmer
9
3
0
01 Aug 2022
Generative Domain Adaptation for Face Anti-Spoofing
Generative Domain Adaptation for Face Anti-Spoofing
Qianyu Zhou
Ke-Yue Zhang
Taiping Yao
Ran Yi
Kekai Sheng
Shouhong Ding
Lizhuang Ma
CVBM
32
48
0
20 Jul 2022
Lipschitz Continuity Retained Binary Neural Network
Lipschitz Continuity Retained Binary Neural Network
Yuzhang Shang
Dan Xu
Bin Duan
Ziliang Zong
Liqiang Nie
Yan Yan
11
19
0
13 Jul 2022
PointNorm: Dual Normalization is All You Need for Point Cloud Analysis
PointNorm: Dual Normalization is All You Need for Point Cloud Analysis
Shen Zheng
Jinqian Pan
Chang-Tien Lu
Gaurav Gupta
3DPC
27
7
0
13 Jul 2022
Understanding and Improving Group Normalization
Understanding and Improving Group Normalization
Agus Gunawan
Xu Yin
Kang Zhang
13
3
0
05 Jul 2022
Understanding the Generalization Benefit of Normalization Layers:
  Sharpness Reduction
Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction
Kaifeng Lyu
Zhiyuan Li
Sanjeev Arora
FAtt
37
69
0
14 Jun 2022
SmartGD: A GAN-Based Graph Drawing Framework for Diverse Aesthetic Goals
SmartGD: A GAN-Based Graph Drawing Framework for Diverse Aesthetic Goals
Xiaoqi Wang
Kevin Yen
Yifan Hu
Hang Shen
24
4
0
13 Jun 2022
How to Find Actionable Static Analysis Warnings: A Case Study with
  FindBugs
How to Find Actionable Static Analysis Warnings: A Case Study with FindBugs
Rahul Yedida
Hong Jin Kang
Huy Tu
Xueqi Yang
David Lo
Tim Menzies
27
12
0
21 May 2022
Masterful: A Training Platform for Computer Vision Models
Masterful: A Training Platform for Computer Vision Models
S. Wookey
Yaoshiang Ho
Thomas D. Rikert
Juan David Gil Lopez
Juan Manuel Munoz Beancur
...
Ray Tawil
Aaron Sabin
Jack Lynch
Travis Harper
Nikhil Gajendrakumar
VLM
18
1
0
21 May 2022
FairNorm: Fair and Fast Graph Neural Network Training
FairNorm: Fair and Fast Graph Neural Network Training
Öykü Deniz Köse
Yanning Shen
AI4CE
11
4
0
20 May 2022
Impact of L1 Batch Normalization on Analog Noise Resistant Property of
  Deep Learning Models
Impact of L1 Batch Normalization on Analog Noise Resistant Property of Deep Learning Models
Omobayode Fagbohungbe
Lijun Qian
27
0
0
07 May 2022
On Fragile Features and Batch Normalization in Adversarial Training
On Fragile Features and Batch Normalization in Adversarial Training
Nils Philipp Walter
David Stutz
Bernt Schiele
AAML
13
5
0
26 Apr 2022
Online Convolutional Re-parameterization
Online Convolutional Re-parameterization
Mu Hu
Junyi Feng
Jiashen Hua
Baisheng Lai
Jianqiang Huang
Xiaojin Gong
Xiansheng Hua
13
26
0
02 Apr 2022
Testing Feedforward Neural Networks Training Programs
Testing Feedforward Neural Networks Training Programs
Houssem Ben Braiek
Foutse Khomh
AAML
11
14
0
01 Apr 2022
Continual Normalization: Rethinking Batch Normalization for Online
  Continual Learning
Continual Normalization: Rethinking Batch Normalization for Online Continual Learning
Quang-Cuong Pham
Chenghao Liu
S. Hoi
BDL
OnRL
28
57
0
30 Mar 2022
Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on
  Riemannian Gradient Descent With Illustrations of Speech Processing
Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing
Jun Qi
Chao-Han Huck Yang
Pin-Yu Chen
Javier Tejedor
25
16
0
11 Mar 2022
Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter
  Pruning
Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning
Seunghyun Lee
B. Song
19
8
0
05 Mar 2022
Variational Autoencoders Without the Variation
Variational Autoencoders Without the Variation
Gregory A. Daly
J. Fieldsend
G. Tabor
17
2
0
01 Mar 2022
123
Next