Mean Field Residual Networks: On the Edge of Chaos

Neural Information Processing Systems (NeurIPS), 2017

24 December 2017

Greg Yang

S. Schoenholz

ArXiv (abs)PDF HTML

Papers citing "Mean Field Residual Networks: On the Edge of Chaos"

50 / 130 papers shown

When Vision Transformers Outperform ResNets without Pre-training or Strong Data AugmentationsInternational Conference on Learning Representations (ICLR), 2021

372

375

03 Jun 2021

Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training DynamicsInternational Conference on Machine Learning (ICML), 2021

Greg Yang

Etai Littwin

173

08 May 2021

Initialization and Regularization of Factorized Neural LayersInternational Conference on Learning Representations (ICLR), 2021

420

03 May 2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed TrainingInternational Conference on Machine Learning (ICML), 2021

Jianfei Chen

224

29 Apr 2021

Towards Deepening Graph Neural Networks: A GNTK-based Optimization PerspectiveInternational Conference on Learning Representations (ICLR), 2021

Wei Huang

208

03 Mar 2021

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired PerspectiveInternational Conference on Learning Representations (ICLR), 2021

501

275

23 Feb 2021

Formalising the Use of the Activation Function in Neural InferenceComplex Systems (CS), 2021

D. A. R. Sakthivadivel

189

02 Feb 2021

Characterizing signal propagation to close the performance gap in unnormalized ResNetsInternational Conference on Learning Representations (ICLR), 2021

Andrew Brock

Soham De

Samuel L. Smith

434

134

21 Jan 2021

Advances in Electron Microscopy with Deep Learning

Jeffrey M. Ede

709

04 Jan 2021

Analyzing Finite Neural Networks: Can We Trust Neural Tangent Kernel Theory?

Mariia Seleznova

Gitta Kutyniok

AAML

246

08 Dec 2020

Feature Learning in Infinite-Width Neural Networks

Greg Yang

J. E. Hu

MLT

429

182

30 Nov 2020

Towards NNGP-guided Neural Architecture Search

Jascha Narain Sohl-Dickstein

BDL

184

11 Nov 2020

Stable ResNetInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

George Deligiannidis

158

24 Oct 2020

BYOL works even without batch statistics

Pierre Harvey Richemond

...

Bilal Piot

486

120

20 Oct 2020

Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit

208

14 Oct 2020

Understanding Approximate Fisher Information for Fast Convergence of Natural Gradient Descent in Wide Neural NetworksNeural Information Processing Systems (NeurIPS), 2020

Ryo Karakida

Kazuki Osawa

269

02 Oct 2020

Tensor Programs III: Neural Matrix Laws

Greg Yang

361

22 Sep 2020

Review: Deep Learning in Electron Microscopy

Jeffrey M. Ede

949

17 Sep 2020

Continuous-in-Depth Neural Networks

289

05 Aug 2020

Finite Versus Infinite Neural Networks: an Empirical StudyNeural Information Processing Systems (NeurIPS), 2020

Jascha Narain Sohl-Dickstein

310

229

31 Jul 2020

Doubly infinite residual neural networks: a diffusion process approach

Stefano Peluchetti

Stefano Favaro

124

07 Jul 2020

Beyond Signal Propagation: Is Feature Diversity Necessary in Deep Neural Network Initialization?

199

02 Jul 2020

Tensor Programs II: Neural Tangent Kernel for Any Architecture

Greg Yang

476

157

25 Jun 2020

Fractional moment-preserving initialization schemes for training deep neural networks

Mert Gurbuzbalaban

Yuanhan Hu

249

25 May 2020

Understanding the Difficulty of Training TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Xiaodong Liu

290

286

17 Apr 2020

On the Neural Tangent Kernel of Deep Networks with Orthogonal InitializationInternational Joint Conference on Artificial Intelligence (IJCAI), 2020

Wei Huang

Weitao Du

R. Xu

175

13 Apr 2020

On Infinite-Width Hypernetworks

460

27 Mar 2020

Towards a General Theory of Infinite-Width Limits of Neural ClassifiersInternational Conference on Machine Learning (ICML), 2020

Eugene Golikov

AI4CE

126

12 Mar 2020

ReZero is All You Need: Fast Convergence at Large DepthConference on Uncertainty in Artificial Intelligence (UAI), 2020

Thomas C. Bachlechner

Bodhisattwa Prasad Majumder

399

329

10 Mar 2020

Correlated Initialization for Correlated DataNeural Processing Letters (NPL), 2020

Johannes Schneider

203

09 Mar 2020

Convolutional Spectral Kernel LearningArtificial Intelligence (AIJ), 2020

Jian Li

Yong Liu

Weiping Wang

BDL

28 Feb 2020

Using a thousand optimization tasks to learn hyperparameter search strategies

Jascha Narain Sohl-Dickstein

341

27 Feb 2020

Robust Pruning at InitializationInternational Conference on Learning Representations (ICLR), 2020

189

19 Feb 2020

On the distance between two neural networks and the stability of learningNeural Information Processing Systems (NeurIPS), 2020

507

09 Feb 2020

On Random Kernels of Residual Architectures

Etai Littwin

Tomer Galanti

Lior Wolf

248

28 Jan 2020

Disentangling Trainability and Generalization in Deep Neural Networks

Lechao Xiao

Jeffrey Pennington

S. Schoenholz

204

30 Dec 2019

Towards Efficient Training for Neural Network Quantization

Qing Jin

Linjie Yang

Zhenyu A. Liao

244

21 Dec 2019

Mean field theory for deep dropout networks: digging up gradient backpropagation deeplyEuropean Conference on Artificial Intelligence (ECAI), 2019

Wei Huang

160

19 Dec 2019

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

343

178

19 Dec 2019

Is Feature Diversity Necessary in Neural Network Initialization?

Yaniv Blumenfeld

D. Gilboa

Daniel Soudry

149

11 Dec 2019

Neural Tangents: Fast and Easy Infinite Neural Networks in PythonInternational Conference on Learning Representations (ICLR), 2019

Jascha Narain Sohl-Dickstein

S. Schoenholz

253

249

05 Dec 2019

Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian ProcessesNeural Information Processing Systems (NeurIPS), 2019

Greg Yang

492

221

28 Oct 2019

Pathological spectra of the Fisher information metric and its variants in deep neural networksNeural Computation (Neural Comput.), 2019

Ryo Karakida

S. Akaho

S. Amari

211

14 Oct 2019

Large Deviation Analysis of Function Sensitivity in Random Deep Neural Networks

Bo Li

D. Saad

136

13 Oct 2019

On the expected behaviour of noise regularised deep neural networks as Gaussian processesPattern Recognition Letters (PR), 2019

Arnu Pretorius

Herman Kamper

Steve Kroon

187

12 Oct 2019

The Expressivity and Training of Deep Neural Networks: toward the Edge of Chaos?

Gege Zhang

Gang-cheng Li

Ningwei Shen

Weidong Zhang

173

11 Oct 2019

Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective

Guan-Horng Liu

Evangelos A. Theodorou

AI4CE

306

28 Aug 2019

Almost Sure Asymptotic Freeness of Neural Network Jacobian with Orthogonal Weights

Tomohiro Hayase

113

11 Aug 2019

A Fine-Grained Spectral Perspective on Neural Networks

Greg Yang

Hadi Salman

384

117

24 Jul 2019

Order and Chaos: NTK views on DNN Normalization, Checkerboard and Boundary Artifacts

149

11 Jul 2019