v1v2v3v4 (latest)

Explorations on high dimensional landscapes

International Conference on Learning Representations (ICLR), 2014

20 December 2014

Papers citing "Explorations on high dimensional landscapes"

41 / 41 papers shown

CopRA: A Progressive LoRA Training Strategy

Xiequn Wang

Yu Zhang

242

30 Oct 2024

The Persistence of Neural Collapse Despite Low-Rank Bias

Connall Garrod

Jonathan P. Keating

302

30 Oct 2024

A survey of deep learning optimizers -- first and second order methods

Rohan Kashyap

ODL

229

28 Nov 2022

A Local Optima Network Analysis of the Feedforward Neural Architecture SpaceIEEE International Joint Conference on Neural Network (IJCNN), 2022

Isak Potgieter

C. Cleghorn

Anna Sergeevna Bosman

106

02 Jun 2022

Universal characteristics of deep neural network loss surfaces from random matrix theory

Nicholas P. Baskerville

196

17 May 2022

Exponentially Many Local Minima in Quantum Neural Networks

Xuchen You

Xiaodi Wu

305

06 Oct 2021

Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and InvariancesInternational Conference on Machine Learning (ICML), 2021

297

119

25 May 2021

Appearance of Random Matrix Theory in Deep Learning

Nicholas P. Baskerville

Diego Granziol

J. Keating

382

12 Feb 2021

Algebraically-Informed Deep Networks (AIDN): A Deep Learning Approach to Represent Algebraic Structures

249

02 Dec 2020

Optimizing Mode Connectivity via Neuron AlignmentNeural Information Processing Systems (NeurIPS), 2020

681

05 Sep 2020

A Topological Framework for Deep Learning

Pavlo Vasylenko

Kyle Istvan

854

31 Aug 2020

Error Estimation and Correction from within Neural Network Differential Equation Solvers

Akshunna S. Dogra

162

09 Jul 2020

The Loss Surfaces of Neural Networks with General Activation FunctionsJournal of Statistical Mechanics: Theory and Experiment (JSTAT), 2020

Nicholas P. Baskerville

361

08 Apr 2020

On the Heavy-Tailed Theory of Stochastic Gradient Descent for Deep Neural Networks

309

29 Nov 2019

Who is Afraid of Big Bad Minima? Analysis of Gradient-Flow in a Spiked Matrix-Tensor ModelNeural Information Processing Systems (NeurIPS), 2019

Stefano Sarao Mannelli

Giulio Biroli

C. Cammarota

Florent Krzakala

Lenka Zdeborová

175

18 Jul 2019

Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

287

05 Jul 2019

The Difficulty of Training Sparse Neural Networks

336

106

25 Jun 2019

Loss Surface Modality of Feed-Forward Neural Network ArchitecturesIEEE International Joint Conference on Neural Network (IJCNN), 2019

Anna Sergeevna Bosman

A. Engelbrecht

Mardé Helbig

188

24 May 2019

A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks

Umut Simsekli

Levent Sagun

Mert Gurbuzbalaban

488

288

18 Jan 2019

Marvels and Pitfalls of the Langevin Algorithm in Noisy High-dimensional Inference

Stefano Sarao Mannelli

Lenka Zdeborová

422

21 Dec 2018

Non-attracting Regions of Local Minima in Deep and Wide Neural Networks

Henning Petzka

C. Sminchisescu

247

16 Dec 2018

Intrinsic Geometric Vulnerability of High-Dimensional Artificial Intelligence

Luca Bortolussi

G. Sanguinetti

AAML

211

08 Nov 2018

The loss surface of deep linear networks viewed through the algebraic geometry lens

233

17 Oct 2018

Implicit Self-Regularization in Deep Neural Networks: Evidence from Random Matrix Theory and Implications for Learning

Charles H. Martin

Michael W. Mahoney

AI4CE

369

234

02 Oct 2018

Trust-Region Algorithms for Training Responses: Machine Learning Methods Using Indefinite Hessian Approximations

333

01 Jul 2018

The committee machine: Computational to statistical gaps in learning a two-layers neural network

Lenka Zdeborová

267

112

14 Jun 2018

Input and Weight Space Smoothing for Semi-supervised Learning

Safa Cicek

Stefano Soatto

115

23 May 2018

Trainability and Accuracy of Neural Networks: An Interacting Particle System Approach

Grant M. Rotskoff

Eric Vanden-Eijnden

335

140

02 May 2018

The Loss Surface of XOR Artificial Neural Networks

256

06 Apr 2018

Comparing Dynamics: Deep Neural Networks versus Glassy Systems

318

124

19 Mar 2018

Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior

Charles H. Martin

Michael W. Mahoney

AI4CE

182

26 Oct 2017

Deep Learning applied to Road Traffic Speed forecasting

173

02 Oct 2017

Empirical Analysis of the Hessian of Over-Parametrized Neural NetworksInternational Conference on Learning Representations (ICLR), 2017

338

444

14 Jun 2017

Sharp Minima Can Generalize For Deep Nets

414

832

15 Mar 2017

Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

283

257

22 Nov 2016

Local minima in training of neural networks

G. Swirszcz

Wojciech M. Czarnecki

Razvan Pascanu

ODL

190

19 Nov 2016

Topology and Geometry of Half-Rectified Network Optimization

C. Freeman

Joan Bruna

735

240

04 Nov 2016

On the Modeling of Error Functions as High Dimensional Landscapes for Weight Initialization in Learning Networks

20 Jul 2016

AdaNet: Adaptive Structural Learning of Artificial Neural NetworksInternational Conference on Machine Learning (ICML), 2016

335

294

05 Jul 2016

On the energy landscape of deep networks

Pratik Chaudhari

Stefano Soatto

ODL

323

20 Nov 2015

Universal halting times in optimization and machine learning

138

19 Nov 2015