ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02617
  4. Cited By
Path-SGD: Path-Normalized Optimization in Deep Neural Networks

Path-SGD: Path-Normalized Optimization in Deep Neural Networks

Neural Information Processing Systems (NeurIPS), 2015
8 June 2015
Behnam Neyshabur
Ruslan Salakhutdinov
Nathan Srebro
    ODL
ArXiv (abs)PDFHTML

Papers citing "Path-SGD: Path-Normalized Optimization in Deep Neural Networks"

50 / 195 papers shown
Vision Transformers provably learn spatial structure
Vision Transformers provably learn spatial structureNeural Information Processing Systems (NeurIPS), 2022
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViTMLT
225
102
0
13 Oct 2022
PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized
  Deep Neural Networks
PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized Deep Neural Networks
Liu Yang
Jifan Zhang
Joseph Shenouda
Dimitris Papailiopoulos
Kangwook Lee
Robert D. Nowak
354
2
0
06 Oct 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent
  Kernel
Scale-invariant Bayesian Neural Networks with Connectivity Tangent KernelInternational Conference on Learning Representations (ICLR), 2022
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
183
7
0
30 Sep 2022
Small Transformers Compute Universal Metric Embeddings
Small Transformers Compute Universal Metric EmbeddingsJournal of machine learning research (JMLR), 2022
Anastasis Kratsios
Valentin Debarnot
Ivan Dokmanić
317
14
0
14 Sep 2022
Quiver neural networks
Quiver neural networks
I. Ganev
Robin Walters
147
4
0
26 Jul 2022
Towards understanding how momentum improves generalization in deep
  learning
Towards understanding how momentum improves generalization in deep learningInternational Conference on Machine Learning (ICML), 2022
Samy Jelassi
Yuanzhi Li
ODLMLTAI4CE
201
46
0
13 Jul 2022
Predicting Out-of-Domain Generalization with Neighborhood Invariance
Predicting Out-of-Domain Generalization with Neighborhood Invariance
Nathan Ng
Neha Hulkund
Dong Wang
Marzyeh Ghassemi
OOD
371
6
0
05 Jul 2022
Local Identifiability of Deep ReLU Neural Networks: the Theory
Local Identifiability of Deep ReLU Neural Networks: the TheoryNeural Information Processing Systems (NeurIPS), 2022
Joachim Bona-Pellissier
Franccois Malgouyres
François Bachoc
FAtt
296
11
0
15 Jun 2022
Symmetry Teleportation for Accelerated Optimization
Symmetry Teleportation for Accelerated OptimizationNeural Information Processing Systems (NeurIPS), 2022
B. Zhao
Nima Dehmamy
Robin Walters
Rose Yu
ODL
443
28
0
21 May 2022
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep
  Neural Network, a Survey
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a SurveyArtificial Intelligence Review (Artif Intell Rev), 2022
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
DD
336
33
0
17 May 2022
Cracking White-box DNN Watermarks via Invariant Neuron Transforms
Cracking White-box DNN Watermarks via Invariant Neuron TransformsKnowledge Discovery and Data Mining (KDD), 2022
Yifan Yan
Xudong Pan
Yining Wang
Mi Zhang
Min Yang
AAML
148
20
0
30 Apr 2022
Higher-Order Generalization Bounds: Learning Deep Probabilistic Programs
  via PAC-Bayes Objectives
Higher-Order Generalization Bounds: Learning Deep Probabilistic Programs via PAC-Bayes Objectives
J. Warrell
M. Gerstein
GP
183
1
0
30 Mar 2022
A Local Convergence Theory for the Stochastic Gradient Descent Method in
  Non-Convex Optimization With Non-isolated Local Minima
A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima
Tae-Eon Ko
Xiantao Li
232
2
0
21 Mar 2022
Penalizing Gradient Norm for Efficiently Improving Generalization in
  Deep Learning
Penalizing Gradient Norm for Efficiently Improving Generalization in Deep LearningInternational Conference on Machine Learning (ICML), 2022
Yang Zhao
Hao Zhang
Xiuyuan Hu
519
153
0
08 Feb 2022
Training invariances and the low-rank phenomenon: beyond linear networks
Training invariances and the low-rank phenomenon: beyond linear networksInternational Conference on Learning Representations (ICLR), 2022
Thien Le
Stefanie Jegelka
254
36
0
28 Jan 2022
Approximation bounds for norm constrained neural networks with
  applications to regression and GANs
Approximation bounds for norm constrained neural networks with applications to regression and GANsApplied and Computational Harmonic Analysis (ACHA), 2022
Yuling Jiao
Yang Wang
Yunfei Yang
232
26
0
24 Jan 2022
Weight Expansion: A New Perspective on Dropout and Generalization
Weight Expansion: A New Perspective on Dropout and Generalization
Gao Jin
Xinping Yi
Pengfei Yang
Lijun Zhang
S. Schewe
Xiaowei Huang
283
6
0
23 Jan 2022
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning
  Optimization Landscape
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization LandscapeInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Devansh Bisla
Jing Wang
A. Choromańska
331
45
0
20 Jan 2022
Measuring Complexity of Learning Schemes Using Hessian-Schatten Total
  Variation
Measuring Complexity of Learning Schemes Using Hessian-Schatten Total Variation
Shayan Aziznejad
Joaquim Campos
M. Unser
249
12
0
12 Dec 2021
GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both
  Homophily and Heterophily
GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and HeterophilyThe Web Conference (WWW), 2021
Lun Du
Xiaozhou Shi
Qiang Fu
Xiaojun Ma
Hengyu Liu
Shi Han
Dongmei Zhang
496
143
0
29 Oct 2021
In Search of Probeable Generalization Measures
In Search of Probeable Generalization MeasuresInternational Conference on Machine Learning and Applications (ICMLA), 2021
Jonathan Jaegerman
Khalil Damouni
M. M. Ankaralı
Konstantinos N. Plataniotis
158
2
0
23 Oct 2021
Path Regularization: A Convexity and Sparsity Inducing Regularization
  for Parallel ReLU Networks
Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks
Tolga Ergen
Mert Pilanci
441
18
0
18 Oct 2021
The Role of Permutation Invariance in Linear Mode Connectivity of Neural
  Networks
The Role of Permutation Invariance in Linear Mode Connectivity of Neural NetworksInternational Conference on Learning Representations (ICLR), 2021
R. Entezari
Hanie Sedghi
O. Saukh
Behnam Neyshabur
MoMe
605
273
0
12 Oct 2021
Perturbated Gradients Updating within Unit Space for Deep Learning
Perturbated Gradients Updating within Unit Space for Deep Learning
Ching-Hsun Tseng
Liu Cheng
Shin-Jye Lee
Xiaojun Zeng
378
5
0
01 Oct 2021
Understanding neural networks with reproducing kernel Banach spaces
Understanding neural networks with reproducing kernel Banach spaces
Francesca Bartolucci
Ernesto De Vito
Lorenzo Rosasco
Stefano Vigogna
278
59
0
20 Sep 2021
Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks
Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks
Rahul Parhi
Robert D. Nowak
356
43
0
18 Sep 2021
Batch Normalization Preconditioning for Neural Network Training
Batch Normalization Preconditioning for Neural Network Training
Susanna Lange
Kyle E. Helfrich
Qiang Ye
210
14
0
02 Aug 2021
An Embedding of ReLU Networks and an Analysis of their Identifiability
An Embedding of ReLU Networks and an Analysis of their IdentifiabilityConstructive approximation (Constr. Approx.), 2021
Pierre Stock
Rémi Gribonval
283
24
0
20 Jul 2021
Universal approximation and model compression for radial neural networks
Universal approximation and model compression for radial neural networks
I. Ganev
Twan van Laarhoven
Robin Walters
303
10
0
06 Jul 2021
Practical Assessment of Generalization Performance Robustness for Deep
  Networks via Contrastive Examples
Practical Assessment of Generalization Performance Robustness for Deep Networks via Contrastive Examples
Xuanyu Wu
Xuhong Li
Haoyi Xiong
Xiao Zhang
Siyu Huang
Dejing Dou
116
1
0
20 Jun 2021
Solving hybrid machine learning tasks by traversing weight space
  geodesics
Solving hybrid machine learning tasks by traversing weight space geodesics
G. Raghavan
Matt Thomson
91
0
0
05 Jun 2021
What Kinds of Functions do Deep Neural Networks Learn? Insights from
  Variational Spline Theory
What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline TheorySIAM Journal on Mathematics of Data Science (SIMODS), 2021
Rahul Parhi
Robert D. Nowak
MLT
412
75
0
07 May 2021
Noether's Learning Dynamics: Role of Symmetry Breaking in Neural
  Networks
Noether's Learning Dynamics: Role of Symmetry Breaking in Neural NetworksNeural Information Processing Systems (NeurIPS), 2021
Hidenori Tanaka
D. Kunin
359
39
0
06 May 2021
SGD Implicitly Regularizes Generalization Error
SGD Implicitly Regularizes Generalization Error
Daniel A. Roberts
MLT
127
17
0
10 Apr 2021
Quantitative Performance Assessment of CNN Units via Topological Entropy
  Calculation
Quantitative Performance Assessment of CNN Units via Topological Entropy CalculationInternational Conference on Learning Representations (ICLR), 2021
Yang Zhao
Hao Zhang
253
8
0
17 Mar 2021
Weight Rescaling: Effective and Robust Regularization for Deep Neural
  Networks with Batch Normalization
Weight Rescaling: Effective and Robust Regularization for Deep Neural Networks with Batch Normalization
Ziquan Liu
Yufei Cui
Jia Wan
Yushun Mao
Antoni B. Chan
249
2
0
06 Feb 2021
Accelerating Training of Batch Normalization: A Manifold Perspective
Accelerating Training of Batch Normalization: A Manifold PerspectiveConference on Uncertainty in Artificial Intelligence (UAI), 2021
Mingyang Yi
285
3
0
08 Jan 2021
The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous
  Neural Networks
The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural NetworksInternational Conference on Machine Learning (ICML), 2020
Bohan Wang
Qi Meng
Wei Chen
Tie-Yan Liu
281
41
0
11 Dec 2020
Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning
  Dynamics
Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning Dynamics
D. Kunin
Javier Sagastuy-Breña
Surya Ganguli
Daniel L. K. Yamins
Hidenori Tanaka
346
89
0
08 Dec 2020
Characterization of Excess Risk for Locally Strongly Convex Population
  Risk
Characterization of Excess Risk for Locally Strongly Convex Population RiskNeural Information Processing Systems (NeurIPS), 2020
Mingyang Yi
Ruoyu Wang
Zhi-Ming Ma
340
3
0
04 Dec 2020
Neural Teleportation
Neural Teleportation
M. Armenta
Thierry Judge
Nathan Painchaud
Youssef Skandarani
Carl Lemaire
Gabriel Gibeau Sanchez
Philippe Spino
Pierre-Marc Jodoin
338
20
0
02 Dec 2020
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the
  Hessian
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the HessianNeural Information Processing Systems (NeurIPS), 2020
Jack Parker-Holder
Luke Metz
Cinjon Resnick
Hengyuan Hu
Adam Lerer
Alistair Letcher
A. Peysakhovich
Aldo Pacchiano
Jakob N. Foerster
188
25
0
12 Nov 2020
An Information-Geometric Distance on the Space of Tasks
An Information-Geometric Distance on the Space of TasksInternational Conference on Machine Learning (ICML), 2020
Yansong Gao
Pratik Chaudhari
204
22
0
01 Nov 2020
The power of quantum neural networks
The power of quantum neural networksNature Computational Science (NCS), 2020
Amira Abbas
David Sutter
Christa Zoufal
Aurelien Lucchi
Alessio Figalli
Stefan Woerner
528
948
0
30 Oct 2020
Effective Regularization Through Loss-Function Metalearning
Effective Regularization Through Loss-Function MetalearningIEEE Congress on Evolutionary Computation (CEC), 2020
Santiago Gonzalez
Xin Qiu
Risto Miikkulainen
522
5
0
02 Oct 2020
Learning Optimal Representations with the Decodable Information
  Bottleneck
Learning Optimal Representations with the Decodable Information BottleneckNeural Information Processing Systems (NeurIPS), 2020
Yann Dubois
Douwe Kiela
D. Schwab
Ramakrishna Vedantam
247
50
0
27 Sep 2020
MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search
MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search
Kengo Machida
Kuniaki Uto
Koichi Shinoda
Taiji Suzuki
150
0
0
19 Sep 2020
Extreme Memorization via Scale of Initialization
Extreme Memorization via Scale of InitializationInternational Conference on Learning Representations (ICLR), 2020
Harsh Mehta
Ashok Cutkosky
Behnam Neyshabur
165
21
0
31 Aug 2020
Shallow Univariate ReLu Networks as Splines: Initialization, Loss
  Surface, Hessian, & Gradient Flow Dynamics
Shallow Univariate ReLu Networks as Splines: Initialization, Loss Surface, Hessian, & Gradient Flow Dynamics
Justin Sahs
Ryan Pyle
Aneel Damaraju
J. O. Caro
Onur Tavaslioglu
Andy Lu
Ankit B. Patel
208
20
0
04 Aug 2020
The Representation Theory of Neural Networks
The Representation Theory of Neural Networks
M. Armenta
Pierre-Marc Jodoin
338
35
0
23 Jul 2020
Previous
1234
Next
Page 2 of 4