v1v2v3 (latest)

Towards Understanding the Importance of Shortcut Connections in Residual Networks

Neural Information Processing Systems (NeurIPS), 2019

10 September 2019

Papers citing "Towards Understanding the Importance of Shortcut Connections in Residual Networks"

22 / 22 papers shown

Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime

Yuqing Wang

Shangding Gu

198

30 Jun 2025

Cross-Layer Cache Aggregation for Token Reduction in Ultra-Fine-Grained Image RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Edwin Arkel Rios

Jansen Christopher Yuanda

101

03 Jan 2025

Theoretical characterisation of the Gauss-Newton conditioning in Neural NetworksNeural Information Processing Systems (NeurIPS), 2024

490

04 Nov 2024

Residual Connections and Normalization Can Provably Prevent Oversmoothing in GNNs

526

05 Jun 2024

Progressive Feedforward Collapse of ResNet Training

263

02 May 2024

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and AttentionInternational Conference on Learning Representations (ICLR), 2023

Yuandong Tian

398

01 Oct 2023

Generalization Ability of Wide Residual Networks

162

29 May 2023

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer TransformerNeural Information Processing Systems (NeurIPS), 2023

483

25 May 2023

Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single NeuronAnnual Conference Computational Learning Theory (COLT), 2023

Weihang Xu

S. Du

308

20 Feb 2023

SML:Enhance the Network Smoothness with Skip Meta Logit for CTR Prediction

156

09 Oct 2022

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

257

29 Mar 2022

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation GuaranteesNeural Information Processing Systems (NeurIPS), 2021

275

10 Nov 2021

Augmented Shortcuts for Vision TransformersNeural Information Processing Systems (NeurIPS), 2021

233

30 Jun 2021

SurvNAM: The machine learning survival model explanationNeural Networks (NN), 2021

201

18 Apr 2021

Spectral Analysis of the Neural Tangent Kernel for Deep Residual NetworksJournal of machine learning research (JMLR), 2021

175

07 Apr 2021

Learning Frequency Domain Approximation for Binary Neural NetworksNeural Information Processing Systems (NeurIPS), 2021

291

01 Mar 2021

Continuous-in-Depth Neural Networks

281

05 Aug 2020

Proactive Network Maintenance using Fast, Accurate Anomaly Localization and Classification on 1-D Data SeriesInternational Conference on Prognostics and Health Management (PHM), 2020

J. Zhu

K. Sundaresan

J. Rupe

17 Jul 2020

On the Demystification of Knowledge Distillation: A Residual Network Perspective

N. Jha

Rajat Saini

Sparsh Mittal

142

30 Jun 2020

A Mean-field Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From DepthInternational Conference on Machine Learning (ICML), 2020

Chao Ma

305

11 Mar 2020

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? -- A Neural Tangent Kernel PerspectiveNeural Information Processing Systems (NeurIPS), 2020

Kaixuan Huang

121

103

14 Feb 2020

On a Sparse Shortcut Topology of Artificial Neural NetworksIEEE Transactions on Artificial Intelligence (IEEE TAI), 2018

Dayang Wang

377

22 Nov 2018