ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 13,238 papers shown
When Embedding Models Meet: Procrustes Bounds and Applications
When Embedding Models Meet: Procrustes Bounds and Applications
Lucas Maystre
Alvaro Ortega Gonzalez
Charles Park
Rares Dolga
Tudor Berariu
Yu Zhao
K. Ciosek
158
0
0
15 Oct 2025
Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings
Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings
Riddhish Thakare
Kingdom Mutala Akugri
DiffMSyDa
129
0
0
15 Oct 2025
Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning
Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning
Ozan K. Tonguz
Federico Taschin
88
0
0
14 Oct 2025
Learning Human Motion with Temporally Conditional Mamba
Learning Human Motion with Temporally Conditional Mamba
Quang Minh Nguyen
T. H. Le
Baoru Huang
M. Vu
Ngan Le
Thieu Vo
Anh Duc Nguyen
Mamba
213
0
0
14 Oct 2025
Behavioral Biometrics for Automatic Detection of User Familiarity in VR
Behavioral Biometrics for Automatic Detection of User Familiarity in VR
Numan Zafar
Priyo Ranjan Kundu Prosun
Shafique Ahmad Chaudhry
92
0
0
14 Oct 2025
Layer-Aware Influence for Online Data Valuation Estimation
Layer-Aware Influence for Online Data Valuation Estimation
Ziao Yang
Longbo Huang
Hongfu Liu
TDI
257
0
0
14 Oct 2025
Joint Discriminative-Generative Modeling via Dual Adversarial Training
Joint Discriminative-Generative Modeling via Dual Adversarial Training
Xuwang Yin
Claire Zhang
Julie Steele
Nir Shavit
T. T. Wang
GAN
421
0
0
13 Oct 2025
MIEO: encoding clinical data to enhance cardiovascular event prediction
MIEO: encoding clinical data to enhance cardiovascular event prediction
Davide Borghini
Davide Marchi
Angelo Nardone
Giordano Scerra
Silvia Giulia Galfrè
Alessandro Pingitore
Giuseppe Prencipe
Corrado Priami
Alina Sîrbu
36
0
0
13 Oct 2025
Self-Training with Dynamic Weighting for Robust Gradual Domain Adaptation
Self-Training with Dynamic Weighting for Robust Gradual Domain Adaptation
Zixi Wang
Yushe Cao
Yubo Huang
Jinzhu Wei
Jingzehua Xu
Shuai Zhang
Xin Lai
140
0
0
13 Oct 2025
Deep semi-supervised approach based on consistency regularization and similarity learning for weeds classification
Deep semi-supervised approach based on consistency regularization and similarity learning for weeds classification
Farouq Benchallal
A. Hafiane
Nicolas Ragot
R. Canals
144
0
0
12 Oct 2025
Understanding Self-supervised Contrastive Learning through Supervised Objectives
Understanding Self-supervised Contrastive Learning through Supervised Objectives
Byeongchan Lee
SSL
208
0
0
12 Oct 2025
On the Implicit Adversariality of Catastrophic Forgetting in Deep Continual Learning
On the Implicit Adversariality of Catastrophic Forgetting in Deep Continual Learning
Ze Peng
Jian Zhang
Jintao Guo
Lei Qi
Yang Gao
Yinghuan Shi
AAMLCLL
97
1
0
10 Oct 2025
MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest
MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest
Xiao Yang
Peifeng Yin
Abe Engle
Jinfeng Zhuang
Ling Leng
60
0
0
10 Oct 2025
A Unified Framework for Lifted Training and Inversion Approaches
A Unified Framework for Lifted Training and Inversion Approaches
Xiaoyu Wang
Alexandra Valavanis
Azhir Mahmood
Andreas Mang
Martin Benning
Audrey Repetti
124
0
0
10 Oct 2025
A Generic Machine Learning Framework for Radio Frequency Fingerprinting
A Generic Machine Learning Framework for Radio Frequency Fingerprinting
Alex Hiles
Bashar I. Ahmad
162
0
0
10 Oct 2025
Phase-Aware Deep Learning with Complex-Valued CNNs for Audio Signal Applications
Phase-Aware Deep Learning with Complex-Valued CNNs for Audio Signal Applications
Naman Agrawal
73
0
0
10 Oct 2025
Learning Regularizers: Learning Optimizers that can Regularize
Learning Regularizers: Learning Optimizers that can Regularize
Suraj Kumar Sahoo
Narayanan C Krishnan
143
0
0
10 Oct 2025
Deep Neural Networks Inspired by Differential Equations
Deep Neural Networks Inspired by Differential Equations
Y. Liu
Lianfang Wang
Kuilin Qin
Qinghua Zhang
Faqiang Wang
Li-min Cui
Jun Liu
Yuping Duan
T. Zeng
AI4TSAI4CE
207
0
0
09 Oct 2025
Random Window Augmentations for Deep Learning Robustness in CT and Liver Tumor Segmentation
Random Window Augmentations for Deep Learning Robustness in CT and Liver Tumor Segmentation
E. A. Østmo
Kristoffer Wickstrøm
Keyur Radiya
Michael C. Kampffmeyer
Karl Øyvind Mikalsen
Robert Jenssen
OOD
134
0
0
09 Oct 2025
Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
Zilin Kang
Chonghua Liao
Tingqiang Xu
Huazhe Xu
216
1
0
09 Oct 2025
Demystifying Deep Learning-based Brain Tumor Segmentation with 3D UNets and Explainable AI (XAI): A Comparative Analysis
Demystifying Deep Learning-based Brain Tumor Segmentation with 3D UNets and Explainable AI (XAI): A Comparative Analysis
Ming Jie Ong
Sze Yinn Ung
Sim Kuan Goh
Jimmy Y. Zhong
102
0
0
09 Oct 2025
Noise or Signal? Deconstructing Contradictions and An Adaptive Remedy for Reversible Normalization in Time Series Forecasting
Noise or Signal? Deconstructing Contradictions and An Adaptive Remedy for Reversible Normalization in Time Series Forecasting
Fanzhe Fu
Yang Yang
AI4TS
89
0
0
06 Oct 2025
Beyond Random: Automatic Inner-loop Optimization in Dataset Distillation
Beyond Random: Automatic Inner-loop Optimization in Dataset Distillation
Muquan Li
Hang Gou
Dongyang Zhang
Shuang Liang
Xiurui Xie
Deqiang Ouyang
Ke Qin
DD
219
1
0
06 Oct 2025
How does the optimizer implicitly bias the model merging loss landscape?
How does the optimizer implicitly bias the model merging loss landscape?
Chenxiang Zhang
Alexander Theus
Damien Teney
Antonio Orvieto
Jun Pang
S. Mauw
MoMe
189
1
0
06 Oct 2025
Discretized Quadratic Integrate-and-Fire Neuron Model for Deep Spiking Neural Networks
Discretized Quadratic Integrate-and-Fire Neuron Model for Deep Spiking Neural Networks
Eric Jahns
Davi Moreno
Milan Stojkov
Michel Kinsy
116
0
0
05 Oct 2025
FHEON: A Configurable Framework for Developing Privacy-Preserving Neural Networks Using Homomorphic Encryption
FHEON: A Configurable Framework for Developing Privacy-Preserving Neural Networks Using Homomorphic Encryption
Nges Brian Njungle
Eric Jahns
Michel Kinsy
FedML
119
0
0
05 Oct 2025
On residual network depth
On residual network depth
Benoit Dherin
Michael Munn
MDE
257
0
0
03 Oct 2025
Image Enhancement Based on Pigment Representation
Image Enhancement Based on Pigment Representation
Se-Ho Lee
Keunsoo Ko
Seung Wook Kim
102
0
0
03 Oct 2025
Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale
Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal ScaleIEEE Transactions on robotics (IEEE TRO), 2025
Yongbo Chen
Yanhao Zhang
Shaifali Parashar
Bo Pan
Shoudong Huang
128
0
0
02 Oct 2025
Ensemble Threshold Calibration for Stable Sensitivity Control
Ensemble Threshold Calibration for Stable Sensitivity Control
John N. Daras
36
0
0
02 Oct 2025
Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning
Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning
Ahmed Hendawy
Henrik Metternich
Théo Vincent
Mahdi Kallel
Jan Peters
Carlo DÉramo
OffRL
159
0
0
02 Oct 2025
Interactive Training: Feedback-Driven Neural Network Optimization
Interactive Training: Feedback-Driven Neural Network Optimization
Wentao Zhang
Y. Lu
Yuntian Deng
116
1
0
02 Oct 2025
Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring
Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring
Harbir Antil
Deepanshu Verma
69
0
0
01 Oct 2025
On the Benefits of Weight Normalization for Overparameterized Matrix Sensing
On the Benefits of Weight Normalization for Overparameterized Matrix Sensing
Yudong Wei
Liang Zhang
Bingcong Li
Niao He
116
0
0
01 Oct 2025
Generalized Parallel Scaling with Interdependent Generations
Generalized Parallel Scaling with Interdependent Generations
Harry Dong
David Brandfonbrener
Eryk Helenowski
Yun He
Mrinal Kumar
Han Fang
Yuejie Chi
Karthik Abinav Sankararaman
LRM
144
0
0
01 Oct 2025
ProbMed: A Probabilistic Framework for Medical Multimodal Binding
ProbMed: A Probabilistic Framework for Medical Multimodal Binding
Yuan Gao
Sangwook Kim
Jianzhong You
Chris McIntosh
145
0
0
30 Sep 2025
FedMuon: Federated Learning with Bias-corrected LMO-based Optimization
FedMuon: Federated Learning with Bias-corrected LMO-based Optimization
Yuki Takezawa
Anastasia Koloskova
Xiaowen Jiang
Sebastian U. Stich
151
0
0
30 Sep 2025
Reconcile Certified Robustness and Accuracy for DNN-based Smoothed Majority Vote Classifier
Reconcile Certified Robustness and Accuracy for DNN-based Smoothed Majority Vote Classifier
Gaojie Jin
Xinping Yi
Xiaowei Huang
AAML
137
1
0
30 Sep 2025
CODED-SMOOTHING: Coding Theory Helps Generalization
CODED-SMOOTHING: Coding Theory Helps Generalization
Parsa Moradi
Tayyebeh Jahaninezhad
M. Maddah-ali
225
0
0
30 Sep 2025
Marginal Flow: a flexible and efficient framework for density estimation
Marginal Flow: a flexible and efficient framework for density estimation
M. Negri
Jonathan Aellen
Manuel Jahn
AmirEhsan Khorashadizadeh
Volker Roth
128
0
0
30 Sep 2025
Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs
Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs
Arpit Garg
Hemanth Saratchandran
Ravi Garg
Simon Lucey
MUCLL
121
1
0
29 Sep 2025
BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression
BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression
David González Martínez
169
0
0
29 Sep 2025
Neural Visibility of Point Sets
Neural Visibility of Point Sets
Jun-Hao Wang
Yi-Yang Tian
Baoquan Chen
Peng-Shuai Wang
3DPC3DV
179
0
0
29 Sep 2025
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
Hakan Emre Gedik
Andrew Martin
Mustafa Munir
Oguzhan Baser
R. Marculescu
Sandeep Chinchali
Alan C. Bovik
ViT
112
0
0
29 Sep 2025
ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection
ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection
Tao Yin
Xiaohong Zhang
Shaochen Fu
Zhibin Zhang
Li Huang
Yiyuan Yang
Kaixiang Yang
Meng Yan
AI4TS
273
0
0
29 Sep 2025
XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning
XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning
Daniel Palenicek
Florian Vogt
Joe Watson
Ingmar Posner
Jan Peters
157
0
0
29 Sep 2025
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Tianrui Wang
Haoyu Wang
Meng Ge
Cheng Gong
Chunyu Qiang
...
Xiaobao Wang
Eng Siong Chng
Xie Chen
Longbiao Wang
Jianwu Dang
151
0
0
29 Sep 2025
Spatial-Functional awareness Transformer-based graph archetype contrastive learning for Decoding Visual Neural Representations from EEG
Spatial-Functional awareness Transformer-based graph archetype contrastive learning for Decoding Visual Neural Representations from EEG
Yueming Sun
Long Yang
167
0
0
29 Sep 2025
Differentiable Sparsity via $D$-Gating: Simple and Versatile Structured Penalization
Differentiable Sparsity via DDD-Gating: Simple and Versatile Structured Penalization
Chris Kolb
Laetitia Frost
J. Herbinger
David Rügamer
383
0
0
28 Sep 2025
Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery
Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery
Zekun Wang
Ethan L. Haarer
Tianyi Zhu
Zhiyi Dai
Christopher MacLellan
BDL
147
0
0
28 Sep 2025
Previous
123456...263264265
Next