Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions

Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network

116

24 Sep 2025

Ryoya Fukasaku

Y. Kabata

Akifumi Okuno

234

25 Aug 2025

Sliced-Wasserstein Distance-based Data Selection

Julien Pallage

Antoine Lesage-Landry

283

17 Apr 2025

Generative Feature Training of Thin 2-Layer Networks

J. Hertrich

Sebastian Neumayer

GAN

500

11 Nov 2024

CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural NetworksNeural Information Processing Systems (NeurIPS), 2024

465

02 Nov 2024

Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization

Prateek Varshney

Randomized Geometric Algebra Methods for Convex Neural Networks

465

09 Oct 2024

365

04 Jun 2024

Learning with Norm Constrained, Over-parameterized, Two-layer Neural Networks

Fanghui Liu

L. Dadi

Volkan Cevher

482

29 Apr 2024

The Real Tropical Geometry of Neural Networks

Marie-Charlotte Brandenburg

Georg Loho

Guido Montúfar

453

18 Mar 2024

Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial TimeInternational Conference on Machine Learning (ICML), 2024

Sungyoon Kim

Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models

586

06 Feb 2024

Fangzhao Zhang

Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization

AI4CE

520

04 Feb 2024

Fangzhao Zhang

DiffM

363

03 Feb 2024

Fixing the NTK: From Neural Network Linearizations to Exact Convex ProgramsNeural Information Processing Systems (NeurIPS), 2023

Rajat Vadiraj Dwaraknath

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

337

26 Sep 2023

296

18 Jun 2023

Optimal Sets and Solution Paths of ReLU NetworksInternational Conference on Machine Learning (ICML), 2023

Aaron Mishkin

Convexifying Transformers: Improving optimization and understanding of transformer networks

385

31 May 2023

Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network CompressionJournal of machine learning research (JMLR), 2023

388

25 May 2023

Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks with Soft-ThresholdingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

380

14 Apr 2023

Training a Two Layer ReLU Network AnalyticallyItalian National Conference on Sensors (INS), 2023

Adrian Barbu

317

06 Apr 2023

Efficient displacement convex optimization with particle gradient descentInternational Conference on Machine Learning (ICML), 2023

Hadi Daneshmand

Jason D. Lee

Chi Jin

321

09 Feb 2023

259

20 Nov 2022

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network ParametrizationInternational Conference on Machine Learning (ICML), 2022

456

14 Nov 2022

PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized Deep Neural Networks

Liu Yang

Jifan Zhang

Joseph Shenouda

Dimitris Papailiopoulos

Kangwook Lee

Robert D. Nowak

436

06 Oct 2022

Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision TransformersInternational Conference on Machine Learning (ICML), 2022

John M. Pauly

Morteza Mardani

Parallel Deep Neural Networks Have Zero Duality Gap

352

17 May 2022

Efficient Global Optimization of Two-Layer ReLU Networks: Quadratic-Time Algorithms and Adversarial TrainingSIAM Journal on Mathematics of Data Science (SIMODS), 2022

381

06 Jan 2022

Yifei Wang

478

13 Oct 2021

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex ProgramsInternational Conference on Machine Learning (ICML), 2021

Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

OffRL MLT

315

11 Oct 2021

420

12 Jul 2021

Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit RegularizationInternational Conference on Learning Representations (ICLR), 2021

John M. Pauly

Morteza Mardani

On the Reproducibility of Neural Network Predictions

461

02 Mar 2021

Srinadh Bhojanapalli

Sanjiv Kumar

345

05 Feb 2021

Vector-output ReLU Neural Network Problems are Copositive Programs: Convex Analysis of Two Layer Networks and Polynomial-time AlgorithmsInternational Conference on Learning Representations (ICLR), 2020

568

24 Dec 2020

Convex Regularization Behind Neural ReconstructionInternational Conference on Learning Representations (ICLR), 2020

Morteza Mardani

Nonparametric Learning of Two-Layer ReLU Residual Units

John M. Pauly

271

09 Dec 2020

556

17 Aug 2020

The Hidden Convex Optimization Landscape of Two-Layer ReLU Neural Networks: an Exact Characterization of the Optimal SolutionsInternational Conference on Learning Representations (ICLR), 2020

431

10 Jun 2020

The Curious Case of Convex Neural Networks

482

09 Jun 2020

A Brief Prehistory of Double DescentProceedings of the National Academy of Sciences of the United States of America (PNAS), 2020

186

07 Apr 2020

Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-layer NetworksInternational Conference on Machine Learning (ICML), 2020

393

140

24 Feb 2020

Revealing the Structure of Deep Neural Networks via Convex DualityInternational Conference on Machine Learning (ICML), 2020