Path-SGD: Path-Normalized Optimization in Deep Neural Networks

Neural Information Processing Systems (NeurIPS), 2015

8 June 2015

Papers citing "Path-SGD: Path-Normalized Optimization in Deep Neural Networks"

50 / 195 papers shown

On regularization of gradient descent, layer imbalance and flat minima

Boris Ginsburg

18 Jul 2020

RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr

Haoyi Xiong

127

07 Jul 2020

Using Human Psychophysics to Evaluate Generalization in Scene Text Recognition Models

30 Jun 2020

Learning compositional functions via multiplicative weight updates

246

25 Jun 2020

Shape Matters: Understanding the Implicit Bias of the Noise Covariance

615

109

15 Jun 2020

FLeet: Online Federated Learning via Staleness Awareness and Performance PredictionInternational Middleware Conference (Middleware), 2020

Francois Taiani

256

12 Jun 2020

Tangent Space Sensitivity and Distribution of Linear Regions in ReLU Networks

Balint Daroczy

AAML

100

11 Jun 2020

Neural Path Features and Neural Path Kernel : Understanding the role of gates in deep learningNeural Information Processing Systems (NeurIPS), 2020

Chandrashekar Lakshminarayanan

Amit Singh

AI4CE

183

11 Jun 2020

Banach Space Representer Theorems for Neural Networks and Ridge Splines

Rahul Parhi

Robert D. Nowak

226

10 Jun 2020

Pruning neural networks without any data by iteratively conserving synaptic flow

559

769

09 Jun 2020

Statistical Guarantees for Regularized Neural NetworksNeural Networks (NN), 2020

Mahsa Taheri

Fang Xie

Johannes Lederer

278

30 May 2020

Scaling-up Distributed Processing of Data Streams for Machine Learning

M. Nokleby

Haroon Raja

W. Bajwa

232

18 May 2020

Dropout: Explicit Forms and Capacity ControlInternational Conference on Machine Learning (ICML), 2020

213

06 Mar 2020

On the distance between two neural networks and the stability of learningNeural Information Processing Systems (NeurIPS), 2020

507

09 Feb 2020

Understanding Generalization in Deep Learning via Tensor MethodsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Furong Huang

431

14 Jan 2020

Relative Flatness and GeneralizationNeural Information Processing Systems (NeurIPS), 2020

375

03 Jan 2020

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

346

179

19 Dec 2019

A priori generalization error for two-layer ReLU neural network through minimum norm solution

184

06 Dec 2019

Fantastic Generalization Measures and Where to Find ThemInternational Conference on Learning Representations (ICLR), 2019

462

673

04 Dec 2019

Information-Theoretic Local Minima Characterization and RegularizationInternational Conference on Machine Learning (ICML), 2019

Zhiwei Jia

Hao Su

249

19 Nov 2019

On Generalization Bounds of a Family of Recurrent Neural NetworksInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2018

Minshuo Chen

Xingguo Li

T. Zhao

251

28 Oct 2019

Interpreting Basis Path Set in Neural Networks

18 Oct 2019

Student Specialization in Deep ReLU Networks With Finite Width and Input Dimension

Yuandong Tian

MLT

218

30 Sep 2019

Quantum Natural GradientQuantum (Quantum), 2019

241

479

04 Sep 2019

Gradient Descent Maximizes the Margin of Homogeneous Neural NetworksInternational Conference on Learning Representations (ICLR), 2019

Kaifeng Lyu

Jian Li

540

373

13 Jun 2019

The Implicit Bias of AdaGrad on Separable DataNeural Information Processing Systems (NeurIPS), 2019

Qian Qian

Xiaoyuan Qian

135

09 Jun 2019

Inductive Bias of Gradient Descent based Adversarial Training on Separable Data

273

07 Jun 2019

On Dropout and Nuclear Norm RegularizationInternational Conference on Machine Learning (ICML), 2019

Poorya Mianjy

R. Arora

264

28 May 2019

Quantifying the generalization error in deep learning in terms of data distribution and neural network smoothnessNeural Networks (NN), 2019

351

27 May 2019

Exploring Structural Sparsity of Deep Networks via Inverse Scale SpacesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

261

23 May 2019

Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep ModelsInternational Conference on Machine Learning (ICML), 2019

204

17 May 2019

Implicit Regularization of Discrete Gradient Dynamics in Linear Neural NetworksNeural Information Processing Systems (NeurIPS), 2019

204

170

30 Apr 2019

Iterative Normalization: Beyond Standardization towards Efficient Whitening

196

177

06 Apr 2019

Positively Scale-Invariant Flatness of ReLU Neural Networks

174

06 Mar 2019

A Priori Estimates of the Population Risk for Residual Networks

E. Weinan

Chao Ma

Qingcan Wang

UQCV

222

06 Mar 2019

Equi-normalization of Neural NetworksInternational Conference on Learning Representations (ICLR), 2019

Pierre Stock

Benjamin Graham

Rémi Gribonval

Edouard Grave

ODL

144

27 Feb 2019

A Scale Invariant Flatness Measure for Deep Network Minima

163

06 Feb 2019

Are All Layers Created Equal?

Chiyuan Zhang

Samy Bengio

Y. Singer

337

158

06 Feb 2019

Trajectory Normalized Gradients for Distributed Optimization

132

24 Jan 2019

A Theoretical Analysis of Deep Q-Learning

604

711

01 Jan 2019

A Differential Topological View of Challenges in Learning with Feedforward Neural Networks

Hao Shen

AAML AI4CE

148

26 Nov 2018

Deep Frank-Wolfe For Neural Network OptimizationInternational Conference on Learning Representations (ICLR), 2018

204

19 Nov 2018

A Bayesian Perspective of Convolutional Neural Networks through a Deconvolutional Generative Model

Richard G. Baraniuk

250

01 Nov 2018

The loss surface of deep linear networks viewed through the algebraic geometry lens

233

17 Oct 2018

Regularization Matters: Generalization and Optimization of Neural Nets v.s. their Induced Kernel

735

263

12 Oct 2018

Capacity Control of ReLU Neural Networks by Basis-path Norm

159

19 Sep 2018

Approximation and Estimation for High-Dimensional Deep Learning Networks

Andrew R. Barron

Jason M. Klusowski

222

10 Sep 2018

Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex

Hongyang R. Zhang

Junru Shao

Ruslan Salakhutdinov

285

06 Jun 2018

Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced

465

264

04 Jun 2018

Implicit Bias of Gradient Descent on Linear Convolutional Networks

468

444

01 Jun 2018