v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015

Sergey Ioffe

Christian Szegedy

OOD

ArXiv (abs)PDF HTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 13,238 papers shown

When Embedding Models Meet: Procrustes Bounds and Applications

Lucas Maystre

Alvaro Ortega Gonzalez

158

15 Oct 2025

Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings

Riddhish Thakare

Kingdom Mutala Akugri

DiffM SyDa

129

15 Oct 2025

Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning

Ozan K. Tonguz

Federico Taschin

14 Oct 2025

Learning Human Motion with Temporally Conditional Mamba

213

14 Oct 2025

Behavioral Biometrics for Automatic Detection of User Familiarity in VR

Numan Zafar

Priyo Ranjan Kundu Prosun

Shafique Ahmad Chaudhry

14 Oct 2025

Layer-Aware Influence for Online Data Valuation Estimation

257

14 Oct 2025

Joint Discriminative-Generative Modeling via Dual Adversarial Training

421

13 Oct 2025

MIEO: encoding clinical data to enhance cardiovascular event prediction

13 Oct 2025

Self-Training with Dynamic Weighting for Robust Gradual Domain Adaptation

140

13 Oct 2025

Deep semi-supervised approach based on consistency regularization and similarity learning for weeds classification

144

12 Oct 2025

Understanding Self-supervised Contrastive Learning through Supervised Objectives

Byeongchan Lee

SSL

208

12 Oct 2025

On the Implicit Adversariality of Catastrophic Forgetting in Deep Continual Learning

10 Oct 2025

MTMD: A Multi-Task Multi-Domain Framework for Unified Ad Lightweight Ranking at Pinterest

10 Oct 2025

A Unified Framework for Lifted Training and Inversion Approaches

124

10 Oct 2025

A Generic Machine Learning Framework for Radio Frequency Fingerprinting

Alex Hiles

Bashar I. Ahmad

162

10 Oct 2025

Phase-Aware Deep Learning with Complex-Valued CNNs for Audio Signal Applications

Naman Agrawal

10 Oct 2025

Learning Regularizers: Learning Optimizers that can Regularize

Suraj Kumar Sahoo

Narayanan C Krishnan

143

10 Oct 2025

Deep Neural Networks Inspired by Differential Equations

207

09 Oct 2025

Random Window Augmentations for Deep Learning Robustness in CT and Liver Tumor Segmentation

E. A. Østmo

Kristoffer Wickstrøm

Keyur Radiya

Michael C. Kampffmeyer

Karl Øyvind Mikalsen

Robert Jenssen

OOD

134

09 Oct 2025

Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints

216

09 Oct 2025

Demystifying Deep Learning-based Brain Tumor Segmentation with 3D UNets and Explainable AI (XAI): A Comparative Analysis

102

09 Oct 2025

Noise or Signal? Deconstructing Contradictions and An Adaptive Remedy for Reversible Normalization in Time Series Forecasting

Fanzhe Fu

Yang Yang

AI4TS

06 Oct 2025

Beyond Random: Automatic Inner-loop Optimization in Dataset Distillation

219

06 Oct 2025

How does the optimizer implicitly bias the model merging loss landscape?

189

06 Oct 2025

Discretized Quadratic Integrate-and-Fire Neuron Model for Deep Spiking Neural Networks

116

05 Oct 2025

FHEON: A Configurable Framework for Developing Privacy-Preserving Neural Networks Using Homomorphic Encryption

119

05 Oct 2025

On residual network depth

Benoit Dherin

Michael Munn

MDE

257

03 Oct 2025

Image Enhancement Based on Pigment Representation

Se-Ho Lee

Keunsoo Ko

Seung Wook Kim

102

03 Oct 2025

Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal ScaleIEEE Transactions on robotics (IEEE TRO), 2025

128

02 Oct 2025

Ensemble Threshold Calibration for Stable Sensitivity Control

John N. Daras

02 Oct 2025

Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning

159

02 Oct 2025

Interactive Training: Feedback-Driven Neural Network Optimization

Wentao Zhang

Y. Lu

Yuntian Deng

116

02 Oct 2025

Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring

Harbir Antil

Deepanshu Verma

01 Oct 2025

On the Benefits of Weight Normalization for Overparameterized Matrix Sensing

116

01 Oct 2025

Generalized Parallel Scaling with Interdependent Generations

Karthik Abinav Sankararaman

LRM

144

01 Oct 2025

ProbMed: A Probabilistic Framework for Medical Multimodal Binding

145

30 Sep 2025

FedMuon: Federated Learning with Bias-corrected LMO-based Optimization

151

30 Sep 2025

Reconcile Certified Robustness and Accuracy for DNN-based Smoothed Majority Vote Classifier

137

30 Sep 2025

CODED-SMOOTHING: Coding Theory Helps Generalization

Parsa Moradi

Tayyebeh Jahaninezhad

M. Maddah-ali

225

30 Sep 2025

Marginal Flow: a flexible and efficient framework for density estimation

M. Negri

Jonathan Aellen

Manuel Jahn

AmirEhsan Khorashadizadeh

Volker Roth

128

30 Sep 2025

Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

Arpit Garg

Hemanth Saratchandran

Ravi Garg

Simon Lucey

MU CLL

121

29 Sep 2025

BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression

David González Martínez

169

29 Sep 2025

Neural Visibility of Point Sets

179

29 Sep 2025

AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs

112

29 Sep 2025

ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection

273

29 Sep 2025

XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning

157

29 Sep 2025

Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis

...

151

29 Sep 2025

Spatial-Functional awareness Transformer-based graph archetype contrastive learning for Decoding Visual Neural Representations from EEG

Yueming Sun

Long Yang

167

29 Sep 2025

Differentiable Sparsity via

D

-Gating: Simple and Versatile Structured Penalization

383

28 Sep 2025

Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery

Christopher MacLellan

BDL

147

28 Sep 2025