ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 13,253 papers shown
Advances in Very Deep Convolutional Neural Networks for LVCSR
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu
Vaibhava Goel
211
44
0
06 Apr 2016
Training Constrained Deconvolutional Networks for Road Scene Semantic
  Segmentation
Training Constrained Deconvolutional Networks for Road Scene Semantic Segmentation
G. Ros
Simon Stent
P. Alcantarilla
Tomoki Watanabe
138
56
0
06 Apr 2016
Deep Cross Residual Learning for Multitask Visual Recognition
Deep Cross Residual Learning for Multitask Visual Recognition
Brendan Jou
Shih-Fu Chang
ObjD
239
95
0
05 Apr 2016
Revisiting Distributed Synchronous SGD
Revisiting Distributed Synchronous SGD
Jianmin Chen
Xinghao Pan
R. Monga
Samy Bengio
Rafal Jozefowicz
344
836
0
04 Apr 2016
A Fully Convolutional Neural Network for Cardiac Segmentation in
  Short-Axis MRI
A Fully Convolutional Neural Network for Cardiac Segmentation in Short-Axis MRI
Phi Vu Tran
165
334
0
02 Apr 2016
A Semisupervised Approach for Language Identification based on Ladder
  Networks
A Semisupervised Approach for Language Identification based on Ladder Networks
Ehud Ben-Reuven
Jacob Goldberger
58
5
0
01 Apr 2016
Learning Multiscale Features Directly From Waveforms
Learning Multiscale Features Directly From Waveforms
Zhenyao Zhu
Jesse Engel
Awni Y. Hannun
184
65
0
31 Mar 2016
Deep Networks with Stochastic Depth
Deep Networks with Stochastic Depth
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
577
2,558
0
30 Mar 2016
Unsupervised Learning of Visual Representations by Solving Jigsaw
  Puzzles
Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles
M. Noroozi
Paolo Favaro
SSL
833
3,175
0
30 Mar 2016
Recurrent Batch Normalization
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
633
414
0
30 Mar 2016
Rich Image Captioning in the Wild
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
151
128
0
30 Mar 2016
Shuffle and Learn: Unsupervised Learning using Temporal Order
  Verification
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification
Ishan Misra
C. L. Zitnick
M. Hebert
SSL
163
68
0
28 Mar 2016
Colorful Image Colorization
Colorful Image Colorization
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
628
3,670
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for
  Automated Image Annotation
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
138
385
0
28 Mar 2016
Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing
Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing
S. K. Esser
P. Merolla
John V. Arthur
A. Cassidy
R. Appuswamy
...
Pallab Datta
A. Amir
B. Taba
M. Flickner
D. Modha
3DH
251
744
0
28 Mar 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
688
11,097
0
27 Mar 2016
Resnet in Resnet: Generalizing Residual Architectures
Resnet in Resnet: Generalizing Residual Architectures
S. Targ
Diogo Almeida
Kevin Lyman
SSeg
205
1,000
0
25 Mar 2016
Stacked Hourglass Networks for Human Pose Estimation
Stacked Hourglass Networks for Human Pose Estimation
Alejandro Newell
Kaiyu Yang
Gaowen Liu
3DH
1.3K
5,359
0
22 Mar 2016
Convolution in Convolution for Network in Network
Convolution in Convolution for Network in Network
Yanwei Pang
Manli Sun
Xiaoheng Jiang
Xuelong Li
220
180
0
22 Mar 2016
Learning Representations for Automatic Colorization
Learning Representations for Automatic Colorization
Gustav Larsson
Michael Maire
Gregory Shakhnarovich
VLMSSL
433
1,046
0
22 Mar 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE3DV
383
1,433
0
21 Mar 2016
A Fast Unified Model for Parsing and Sentence Understanding
A Fast Unified Model for Parsing and Sentence Understanding
Samuel R. Bowman
Jon Gauthier
Abhinav Rastogi
Raghav Gupta
Christopher D. Manning
Christopher Potts
336
319
0
19 Mar 2016
Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain
  Lesion Segmentation
Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain Lesion Segmentation
Konstantinos Kamnitsas
C. Ledig
Virginia Newcombe
Joanna P. Simpson
A. D. Kane
David Menon
Daniel Rueckert
Ben Glocker
MedIm3DV
426
3,204
0
18 Mar 2016
Transferring Learned Microcalcification Group Detection from 2D
  Mammography to 3D Digital Breast Tomosynthesis Using a Hierarchical Model and
  Scope-based Normalization Features
Transferring Learned Microcalcification Group Detection from 2D Mammography to 3D Digital Breast Tomosynthesis Using a Hierarchical Model and Scope-based Normalization Features
Yin Yin
S. V. Fotin
Hrishikesh Haldankar
J. Hoffmeister
S. Periaswamy
51
4
0
18 Mar 2016
Generative Image Modeling using Style and Structure Adversarial Networks
Generative Image Modeling using Style and Structure Adversarial Networks
Xinyu Wang
Abhinav Gupta
GAN
271
633
0
17 Mar 2016
Accelerating Deep Neural Network Training with Inconsistent Stochastic
  Gradient Descent
Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent
Linnan Wang
Yi Yang
Martin Renqiang Min
S. Chakradhar
259
95
0
17 Mar 2016
Neural Aggregation Network for Video Face Recognition
Neural Aggregation Network for Video Face Recognition
Jiaolong Yang
Peiran Ren
Dongqing Zhang
Dong Chen
Fang Wen
Hongdong Li
G. Hua
CVBM3DH
327
389
0
17 Mar 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural
  Networks
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
670
4,599
0
16 Mar 2016
Understanding and Improving Convolutional Neural Networks via
  Concatenated Rectified Linear Units
Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units
Wenling Shang
Kihyuk Sohn
Diogo Almeida
Honglak Lee
236
519
0
16 Mar 2016
Suppressing the Unusual: towards Robust CNNs using Symmetric Activation
  Functions
Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions
Qiyang Zhao
Lewis D. Griffin
AAML
163
31
0
16 Mar 2016
Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
1.3K
10,907
0
16 Mar 2016
Combining the Best of Convolutional Layers and Recurrent Layers: A
  Hybrid Network for Semantic Segmentation
Combining the Best of Convolutional Layers and Recurrent Layers: A Hybrid Network for Semantic Segmentation
Zhicheng Yan
Huatian Zhang
Yangqing Jia
Thomas Breuel
Yizhou Yu
SSeg
171
42
0
15 Mar 2016
Revisiting Batch Normalization For Practical Domain Adaptation
Revisiting Batch Normalization For Practical Domain Adaptation
Yanghao Li
Naiyan Wang
Jianping Shi
Jiaying Liu
Xiaodi Hou
OOD
335
650
0
15 Mar 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed
  Systems
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
493
11,556
0
14 Mar 2016
Learning Typographic Style
Learning Typographic Style
S. Baluja
126
15
0
13 Mar 2016
Texture Networks: Feed-forward Synthesis of Textures and Stylized Images
Texture Networks: Feed-forward Synthesis of Textures and Stylized Images
Dmitry Ulyanov
V. Lebedev
Andrea Vedaldi
Victor Lempitsky
3DH
307
988
0
10 Mar 2016
Low-rank passthrough neural networks
Low-rank passthrough neural networks
Antonio Valerio Miceli Barone
296
14
0
10 Mar 2016
DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range
  Data
DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data
Lucas Beyer
Alexander Hermans
Bastian Leibe
183
46
0
08 Mar 2016
Variational Autoencoders for Semi-supervised Text Classification
Variational Autoencoders for Semi-supervised Text Classification
Weidi Xu
Haoze Sun
C. Deng
Y. Tan
DRL
176
7
0
08 Mar 2016
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning
  and Large-Scale Data Collection
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection
Sergey Levine
P. Pastor
A. Krizhevsky
Deirdre Quillen
1.2K
2,160
0
07 Mar 2016
Normalization Propagation: A Parametric Technique for Removing Internal
  Covariate Shift in Deep Networks
Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks
Devansh Arpit
Yingbo Zhou
Bhargava U. Kota
V. Govindaraju
273
131
0
04 Mar 2016
Learning Physical Intuition of Block Towers by Example
Learning Physical Intuition of Block Towers by Example
Adam Lerer
Sam Gross
Rob Fergus
PINN
270
308
0
03 Mar 2016
HyperFace: A Deep Multi-task Learning Framework for Face Detection,
  Landmark Localization, Pose Estimation, and Gender Recognition
HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition
Rajeev Ranjan
Vishal M. Patel
Rama Chellappa
CVBM3DH
380
1,271
0
03 Mar 2016
Learning Functions: When Is Deep Better Than Shallow
Learning Functions: When Is Deep Better Than Shallow
H. Mhaskar
Q. Liao
T. Poggio
336
148
0
03 Mar 2016
Molecular Graph Convolutions: Moving Beyond Fingerprints
Molecular Graph Convolutions: Moving Beyond Fingerprints
S. Kearnes
Kevin McCloskey
Marc Berndl
Vijay S. Pande
Patrick F. Riley
GNN
535
1,536
0
02 Mar 2016
Cascaded Subpatch Networks for Effective CNNs
Cascaded Subpatch Networks for Effective CNNs
Xiaoheng Jiang
Yanwei Pang
Manli Sun
Xuelong Li
217
40
0
01 Mar 2016
Scalable and Sustainable Deep Learning via Randomized Hashing
Scalable and Sustainable Deep Learning via Randomized Hashing
Ryan Spring
Anshumali Shrivastava
286
135
0
26 Feb 2016
Weight Normalization: A Simple Reparameterization to Accelerate Training
  of Deep Neural Networks
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
Tim Salimans
Diederik P. Kingma
ODL
579
2,053
0
25 Feb 2016
Learning values across many orders of magnitude
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
250
185
0
24 Feb 2016
Group Equivariant Convolutional Networks
Group Equivariant Convolutional Networks
Taco S. Cohen
Max Welling
BDL
940
2,179
0
24 Feb 2016
Previous
123...261262263264265266
Next
Page 262 of 266
Pageof 266