ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11604
  4. Cited By
How Does Batch Normalization Help Optimization?

How Does Batch Normalization Help Optimization?

29 May 2018
Shibani Santurkar
Dimitris Tsipras
Andrew Ilyas
A. Madry
    ODL
ArXivPDFHTML

Papers citing "How Does Batch Normalization Help Optimization?"

50 / 141 papers shown
Title
Temporal Efficient Training of Spiking Neural Network via Gradient
  Re-weighting
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting
Shi-Wee Deng
Yuhang Li
Shanghang Zhang
Shi Gu
114
243
0
24 Feb 2022
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain
Chuan-Xian Ren
Yong-Jin Liu
Xiwen Zhang
Ke-Kun Huang
AAML
OOD
18
91
0
22 Feb 2022
Diagnosing Batch Normalization in Class Incremental Learning
Diagnosing Batch Normalization in Class Incremental Learning
Minghao Zhou
Quanziang Wang
Jun Shu
Qian Zhao
Deyu Meng
CLL
35
6
0
16 Feb 2022
How Do Vision Transformers Work?
How Do Vision Transformers Work?
Namuk Park
Songkuk Kim
ViT
30
465
0
14 Feb 2022
DeepStability: A Study of Unstable Numerical Methods and Their Solutions
  in Deep Learning
DeepStability: A Study of Unstable Numerical Methods and Their Solutions in Deep Learning
Eliska Kloberdanz
Kyle G. Kloberdanz
Wei Le
14
15
0
07 Feb 2022
Architecture Matters in Continual Learning
Architecture Matters in Continual Learning
Seyed Iman Mirzadeh
Arslan Chaudhry
Dong Yin
Timothy Nguyen
Razvan Pascanu
Dilan Görür
Mehrdad Farajtabar
OOD
KELM
114
58
0
01 Feb 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
30
100
0
11 Jan 2022
GNN-Geo: A Graph Neural Network-based Fine-grained IP geolocation
  Framework
GNN-Geo: A Graph Neural Network-based Fine-grained IP geolocation Framework
Shichang Ding
Xiangyang Luo
Jinwei Wang
Xiaoming Fu
19
14
0
18 Dec 2021
Super-resolution reconstruction of cytoskeleton image based on A-net
  deep learning network
Super-resolution reconstruction of cytoskeleton image based on A-net deep learning network
Qian Chen
Hao Bai
Bingchen Che
Tianyun Zhao
Ce Zhang
Kaige Wang
Jintao Bai
Wei Zhao
25
3
0
17 Dec 2021
Consistent Depth Prediction under Various Illuminations using Dilated
  Cross Attention
Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention
Zitian Zhang
Chuhua Xian
3DV
MDE
27
0
0
15 Dec 2021
Curriculum Learning for Vision-and-Language Navigation
Curriculum Learning for Vision-and-Language Navigation
Jiwen Zhang
Zhongyu Wei
Jianqing Fan
J. Peng
LM&Ro
26
20
0
14 Nov 2021
Blending Anti-Aliasing into Vision Transformer
Blending Anti-Aliasing into Vision Transformer
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
21
20
0
28 Oct 2021
Revisiting Batch Norm Initialization
Revisiting Batch Norm Initialization
Jim Davis
Logan Frank
16
4
0
26 Oct 2021
Training Deep Neural Networks with Joint Quantization and Pruning of
  Weights and Activations
Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations
Xinyu Zhang
Ian Colbert
Ken Kreutz-Delgado
Srinjoy Das
MQ
26
11
0
15 Oct 2021
The Unreasonable Effectiveness of the Final Batch Normalization Layer
The Unreasonable Effectiveness of the Final Batch Normalization Layer
Veysel Kocaman
O. M. Shir
T. Baeck
13
1
0
18 Sep 2021
Unsupervised Domain Adaptation for Retinal Vessel Segmentation with
  Adversarial Learning and Transfer Normalization
Unsupervised Domain Adaptation for Retinal Vessel Segmentation with Adversarial Learning and Transfer Normalization
Wei Feng
Lie Ju
Lin Wang
Kaimin Song
Xin Wang
Xin Zhao
Qingyi Tao
Z. Ge
OOD
MedIm
15
4
0
04 Aug 2021
Batch Normalization Preconditioning for Neural Network Training
Batch Normalization Preconditioning for Neural Network Training
Susanna Lange
Kyle E. Helfrich
Qiang Ye
22
9
0
02 Aug 2021
SimROD: A Simple Adaptation Method for Robust Object Detection
SimROD: A Simple Adaptation Method for Robust Object Detection
Rindranirina Ramamonjison
Amin Banitalebi-Dehkordi
Xinyu Kang
Xiaolong Bai
Yong Zhang
ObjD
TTA
24
53
0
28 Jul 2021
DeltaCharger: Charging Robot with Inverted Delta Mechanism and
  CNN-driven High Fidelity Tactile Perception for Precise 3D Positioning
DeltaCharger: Charging Robot with Inverted Delta Mechanism and CNN-driven High Fidelity Tactile Perception for Precise 3D Positioning
Iaroslav Okunevich
Daria Trinitatova
Pavel Kopanev
Dzmitry Tsetserukou
17
11
0
22 Jul 2021
Unsupervised Model Drift Estimation with Batch Normalization Statistics
  for Dataset Shift Detection and Model Selection
Unsupervised Model Drift Estimation with Batch Normalization Statistics for Dataset Shift Detection and Model Selection
Won-Jo Lee
Seokhyun Byun
Jooeun Kim
Minje Park
Kirill Chechil
AI4TS
19
2
0
01 Jul 2021
The Values Encoded in Machine Learning Research
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
25
273
0
29 Jun 2021
GemNet: Universal Directional Graph Neural Networks for Molecules
GemNet: Universal Directional Graph Neural Networks for Molecules
Johannes Klicpera
Florian Becker
Stephan Günnemann
AI4CE
19
433
0
02 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
16
52
0
11 May 2021
"BNN - BN = ?": Training Binary Neural Networks without Batch
  Normalization
"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen
Zhenyu (Allen) Zhang
Xu Ouyang
Zechun Liu
Zhiqiang Shen
Zhangyang Wang
MQ
33
36
0
16 Apr 2021
Deep Recursive Embedding for High-Dimensional Data
Zixia Zhou
Yuanyuan Wang
B. Lelieveldt
Qian Tao
24
7
0
12 Apr 2021
Disentangled Contrastive Learning for Learning Robust Textual
  Representations
Disentangled Contrastive Learning for Learning Robust Textual Representations
Xiang Chen
Xin Xie
Zhen Bi
Hongbin Ye
Shumin Deng
Ningyu Zhang
Huajun Chen
33
5
0
11 Apr 2021
Relating Adversarially Robust Generalization to Flat Minima
Relating Adversarially Robust Generalization to Flat Minima
David Stutz
Matthias Hein
Bernt Schiele
OOD
24
65
0
09 Apr 2021
Delving into Variance Transmission and Normalization: Shift of Average
  Gradient Makes the Network Collapse
Delving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse
YuXiang Liu
Jidong Ge
Chuanyi Li
Jie Gui
13
2
0
22 Mar 2021
High-Performance Large-Scale Image Recognition Without Normalization
High-Performance Large-Scale Image Recognition Without Normalization
Andrew Brock
Soham De
Samuel L. Smith
Karen Simonyan
VLM
223
512
0
11 Feb 2021
Locally Adaptive Label Smoothing for Predictive Churn
Locally Adaptive Label Smoothing for Predictive Churn
Dara Bahri
Heinrich Jiang
NoLa
29
8
0
09 Feb 2021
Consensus Control for Decentralized Deep Learning
Consensus Control for Decentralized Deep Learning
Lingjing Kong
Tao R. Lin
Anastasia Koloskova
Martin Jaggi
Sebastian U. Stich
19
75
0
09 Feb 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
124
673
0
24 Jan 2021
Advances in Electron Microscopy with Deep Learning
Advances in Electron Microscopy with Deep Learning
Jeffrey M. Ede
29
2
0
04 Jan 2021
Understanding and Increasing Efficiency of Frank-Wolfe Adversarial
  Training
Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training
Theodoros Tsiligkaridis
Jay Roberts
AAML
11
11
0
22 Dec 2020
Meta-Generating Deep Attentive Metric for Few-shot Classification
Meta-Generating Deep Attentive Metric for Few-shot Classification
Lei Zhang
Fei Zhou
Wei Wei
Yanning Zhang
VLM
34
28
0
03 Dec 2020
Design Space for Graph Neural Networks
Design Space for Graph Neural Networks
Jiaxuan You
Rex Ying
J. Leskovec
GNN
AI4CE
17
315
0
17 Nov 2020
DTGAN: Dual Attention Generative Adversarial Networks for Text-to-Image
  Generation
DTGAN: Dual Attention Generative Adversarial Networks for Text-to-Image Generation
Zhenxing Zhang
Lambert Schomaker
GAN
31
34
0
05 Nov 2020
Revisiting Batch Normalization for Training Low-latency Deep Spiking
  Neural Networks from Scratch
Revisiting Batch Normalization for Training Low-latency Deep Spiking Neural Networks from Scratch
Youngeun Kim
Priyadarshini Panda
20
170
0
05 Oct 2020
Weight and Gradient Centralization in Deep Neural Networks
Weight and Gradient Centralization in Deep Neural Networks
Wolfgang Fuhl
Enkelejda Kasneci
ODL
8
18
0
02 Oct 2020
Conditional Image Generation with One-Vs-All Classifier
Conditional Image Generation with One-Vs-All Classifier
Xiangrui Xu
Yaqin Li
Cao Yuan
VLM
GAN
25
12
0
18 Sep 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
26
79
0
17 Sep 2020
GraphNorm: A Principled Approach to Accelerating Graph Neural Network
  Training
GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training
Tianle Cai
Shengjie Luo
Keyulu Xu
Di He
Tie-Yan Liu
Liwei Wang
GNN
16
158
0
07 Sep 2020
Improving robustness against common corruptions by covariate shift
  adaptation
Improving robustness against common corruptions by covariate shift adaptation
Steffen Schneider
E. Rusak
L. Eck
Oliver Bringmann
Wieland Brendel
Matthias Bethge
VLM
31
457
0
30 Jun 2020
Enhancement of a CNN-Based Denoiser Based on Spatial and Spectral
  Analysis
Enhancement of a CNN-Based Denoiser Based on Spatial and Spectral Analysis
Rui Zhao
K. Lam
D. Lun
17
7
0
28 Jun 2020
DO-Conv: Depthwise Over-parameterized Convolutional Layer
DO-Conv: Depthwise Over-parameterized Convolutional Layer
Jinming Cao
Yangyan Li
Mingchao Sun
Ying Chen
Dani Lischinski
Daniel Cohen-Or
Baoquan Chen
Changhe Tu
OOD
31
165
0
22 Jun 2020
New Interpretations of Normalization Methods in Deep Learning
New Interpretations of Normalization Methods in Deep Learning
Jiacheng Sun
Xiangyong Cao
Hanwen Liang
Weiran Huang
Zewei Chen
Zhenguo Li
11
34
0
16 Jun 2020
Training Deep Spiking Neural Networks
Training Deep Spiking Neural Networks
Eimantas Ledinauskas
J. Ruseckas
Alfonsas Jursenas
Giedrius Burachas
6
54
0
08 Jun 2020
Normalized Convolutional Neural Network
Normalized Convolutional Neural Network
Dongsuk Kim
Geonhee Lee
Myungjae Lee
S. Kang
Dongmin Kim
18
1
0
11 May 2020
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators
  Towards Local and in Time Domain
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators Towards Local and in Time Domain
Weitao Li
Pengfei Xu
Yang Katie Zhao
Haitong Li
Yuan Xie
Yingyan Lin
9
68
0
03 May 2020
A Batch Normalized Inference Network Keeps the KL Vanishing Away
A Batch Normalized Inference Network Keeps the KL Vanishing Away
Qile Zhu
Jianlin Su
Wei Bi
Xiaojiang Liu
Xiyao Ma
Xiaolin Li
D. Wu
BDL
DRL
29
61
0
27 Apr 2020
Previous
123
Next